Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Shiyu coder Kronos Single Series Prediction
- Workflow:Isaac sim IsaacGymEnvs Policy Inference and Evaluation
- Workflow:Bentoml BentoML Multi Model Composition
- Workflow:Explodinggradients Ragas Test Data Generation
- Workflow:Microsoft LoRA GPT2 NLG Finetuning
- Workflow:Snorkel team Snorkel Multitask Classification
- Workflow:Apache Airflow Scheduler Operation and Task Execution
- Workflow:Infiniflow Ragflow Knowledge Base Document Ingestion
- Workflow:Openclaw Openclaw Agent Message Loop
- Workflow:Fastai Fastbook Collaborative Filtering
Principles
- Principle:Tensorflow Tfjs Pretrained Model Loading
- Principle:Ollama Ollama GGUF Model Conversion Mistral
- Principle:Huggingface Datasets Dataset Filtering
- Principle:Fastai Fastbook Classifier Data Preparation
- Principle:Dagster io Dagster Project Scaffolding
- Principle:Tensorflow Serving Thread Pool Management
- Principle:Apache Kafka Topic Deletion
- Principle:Junyanz Pytorch CycleGAN and pix2pix Dataset Pair Alignment
- Principle:Ggml org Llama cpp GGUF Quantization
- Principle:Microsoft Autogen Handoff Termination
Implementations
- Implementation:Apache Druid Tuning Config Form
- Implementation:FMInference FlexLLMGen Data Wrangling Install
- Implementation:Predibase Lorax Watermark Logits Processor
- Implementation:Speechbrain Speechbrain Prepare KsponSpeech LM
- Implementation:Apache Druid Publication Config Form
- Implementation:Open compass VLMEvalKit TableVQABench Utils
- Implementation:PeterL1n BackgroundMattingV2 Torch checkpoint ops
- Implementation:Princeton nlp SimPO Conda Environment Create
- Implementation:Google deepmind Mujoco mju Halton
- Implementation:Pyro ppl Pyro BART Forecast
Heuristics
- Heuristic:DataExpert io Data engineer handbook Docker Volume Persistence Management
- Heuristic:CrewAIInc CrewAI Rate Limiting Strategy
- Heuristic:Openclaw Openclaw Warning Suppression For Known Deprecations
- Heuristic:Hiyouga LLaMA Factory Gradient Checkpointing Memory Optimization
- Heuristic:Scikit learn Scikit learn Random State Management
- Heuristic:Online ml River HST Feature Scaling Requirement
- Heuristic:Apache Spark Serialization Optimization
- Heuristic:Mlc ai Mlc llm OpenCL Memory Floor Workaround
- Heuristic:Triton inference server Server Documentation Standards
- Heuristic:Facebookresearch Audiocraft Audio Normalization Strategies
Environments
- Environment:Zai org CogVideo Video Captioning Environment
- Environment:NVIDIA TransformerEngine GPU Compute Capability
- Environment:Zai org CogVideo SAT Framework Environment
- Environment:LMCache LMCache CUDA GPU Runtime
- Environment:Microsoft DeepSpeedExamples VisualChat Training Environment
- Environment:Pytorch Serve CUDA GPU Environment
- Environment:Intel Ipex llm RAG LlamaIndex Environment
- Environment:DistrictDataLabs Yellowbrick Optional NLP Dependencies
- Environment:Apache Shardingsphere Etcd Cluster Coordination
- Environment:Openai Whisper Numba