Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Bitsandbytes foundation Bitsandbytes 8bit Optimizer Training
- Workflow:NVIDIA NeMo Aligner Supervised Fine Tuning
- Workflow:Apache Hudi Flink MOR Compaction
- Workflow:Anthropics Anthropic sdk python Structured Output Extraction
- Workflow:Spcl Graph of thoughts GoT Sorting Pipeline
- Workflow:Fastai Fastbook Collaborative Filtering
- Workflow:Huggingface Trl Supervised Finetuning
- Workflow:ARISE Initiative Robosuite Domain Randomization Training
- Workflow:Hiyouga LLaMA Factory Model Inference and Serving
- Workflow:Haosulab ManiSkill Imitation Learning Pipeline
Principles
- Principle:Hpcaitech ColossalAI Sequence Packing Dataset
- Principle:Apache Hudi Query Type Definition
- Principle:Eventual Inc Daft Column Input Normalization
- Principle:Guardrails ai Guardrails RailSpecification
- Principle:Microsoft LoRA NLU Environment Setup
- Principle:Unslothai Unsloth MoE Kernel Autotuning
- Principle:PeterL1n BackgroundMattingV2 Dataset composition
- Principle:FlowiseAI Flowise Speech Processing
- Principle:Apache Beam Classpath Packaging
- Principle:Microsoft Onnxruntime Distributed Training Loop
Implementations
- Implementation:PrefectHQ Prefect Materialize Decorator
- Implementation:Onnx Onnx Checker Cpp API
- Implementation:Infiniflow Ragflow Config Utils
- Implementation:Lance format Lance PrimitiveBlobDecoding
- Implementation:Kserve Kserve LLMIsvc Manager Deployment
- Implementation:Open compass VLMEvalKit VGRPBench Skyscraper
- Implementation:Isaac sim IsaacGymEnvs Load Asset Meshes In Warp
- Implementation:Evidentlyai Evidently LLM Judge Descriptors
- Implementation:Kserve Kserve LLM Prefill Template
- Implementation:Deepspeedai DeepSpeed PipelineEngine Init
Heuristics
- Heuristic:Fede1024 Rust rdkafka Partitioner Must Not Block
- Heuristic:Princeton nlp SimPO BOS Token Handling
- Heuristic:Cleanlab Cleanlab Label Quality Scoring Method Selection
- Heuristic:Mit han lab Llm awq GPU Memory Management Patterns
- Heuristic:Gretelai Gretel synthetics Gumbel Softmax NaN Retry
- Heuristic:Deepseek ai Janus Bfloat16 Dtype Selection
- Heuristic:Datahub project Datahub Docker Memory Preflight
- Heuristic:Unstructured IO Unstructured Golden File Diff
- Heuristic:Onnx Onnx Opset Version Selection
- Heuristic:Vespa engine Vespa Maven Parallel Build Optimization
Environments
- Environment:Cypress io Cypress Linux Display Server
- Environment:Explodinggradients Ragas LLM Provider Environment
- Environment:Openai Evals Optional Provider APIs
- Environment:EvolvingLMMs Lab Lmms eval GPU Compute Environment
- Environment:Hiyouga LLaMA Factory Optional Inference Backends
- Environment:Facebookresearch Habitat lab SLURM Distributed Environment
- Environment:Sgl project Sglang Prometheus
- Environment:Testtimescaling Testtimescaling github io Python 3 Runtime
- Environment:Apache Paimon Python Core Runtime
- Environment:Zai org CogVideo SAT Framework Environment