Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Airflow Kubernetes Deployment via Helm
- Workflow:Microsoft Onnxruntime ORTModule Training
- Workflow:Ggml org Llama cpp HF to GGUF Model Conversion
- Workflow:Huggingface Datatrove Minhash Deduplication
- Workflow:Testtimescaling Testtimescaling github io Automated Citation Tracking
- Workflow:Dotnet Machinelearning GenAI Causal LM Inference
- Workflow:Huggingface Open r1 GRPO Reasoning Training
- Workflow:Bitsandbytes foundation Bitsandbytes 8bit Optimizer Training
- Workflow:Facebookresearch Audiocraft Model Export And Deployment
- Workflow:Apache Paimon Table Read Write
Principles
- Principle:Openai Openai agents python Agent Level Guardrail Definition
- Principle:Langgenius Dify BrandingConfiguration
- Principle:Vespa engine Vespa Script Resolution
- Principle:Sgl project Sglang HTTP Server Deployment
- Principle:ContextualAI HALOs LM Eval Benchmarking
- Principle:Turboderp org Exllamav2 Layer Quantization
- Principle:LaurentMazare Tch rs Tensor Format Conversion
- Principle:SeleniumHQ Selenium Code Formatting Pipeline
- Principle:Hpcaitech ColossalAI Distributed Model Inference
- Principle:Heibaiying BigData Notes MapReduce Map Phase
Implementations
- Implementation:Hiyouga LLaMA Factory V1 Kernel Base
- Implementation:Nightwatchjs Nightwatch Namespaced Command Pattern
- Implementation:OpenHands OpenHands DaytonaRuntime Close
- Implementation:Haosulab ManiSkill UnitreeH1WithHands
- Implementation:Mlc ai Mlc llm Event Trace Recorder Header
- Implementation:BerriAI Litellm Audit Logging Endpoints
- Implementation:Risingwavelabs Risingwave FragmentDependencyGraph
- Implementation:Open compass VLMEvalKit MaCBench
- Implementation:Microsoft Playwright BidiPdf
- Implementation:FMInference FlexLLMGen DeepSpeed LR Schedules
Heuristics
- Heuristic:PrefectHQ Prefect Retry Backoff Strategy
- Heuristic:Isaac sim IsaacGymEnvs Factory Velocity Limits
- Heuristic:Microsoft Autogen Parallel Tool Call Safety
- Heuristic:Allenai Open instruct Disable Dropout In RL
- Heuristic:Fede1024 Rust rdkafka Partitioner Must Not Block
- Heuristic:Bitsandbytes foundation Bitsandbytes Compressed Statistics Double Quantization
- Heuristic:Princeton nlp Tree of thought llm Global State Token Counting
- Heuristic:Princeton nlp SimPO Multi Seed Diversity
- Heuristic:Iterative Dvc Path Performance Optimization
- Heuristic:Farama Foundation Gymnasium Seeding Determinism Best Practices
Environments
- Environment:Huggingface Transformers Flash Attention 2 Env
- Environment:Lance format Lance Rust Toolchain
- Environment:InternLM Lmdeploy Build From Source
- Environment:Datahub project Datahub Docker Quickstart Environment
- Environment:Langfuse Langfuse Docker Infrastructure
- Environment:LMCache LMCache VLLM Serving Engine
- Environment:Duckdb Duckdb Release Publishing Env
- Environment:Unstructured IO Unstructured Ingest CLI
- Environment:Lm sys FastChat GPU CUDA Inference
- Environment:BerriAI Litellm Docker Deployment