Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:InternLM Lmdeploy W8A8 SmoothQuant Quantization
- Workflow:Microsoft Agent framework Graph Based Workflow Execution
- Workflow:Treeverse LakeFS S3 Gateway Integration
- Workflow:Eric mitchell Direct preference optimization DPO Preference Training
- Workflow:Neuml Txtai Pipeline Workflow Chaining
- Workflow:Microsoft Agent framework Agent With Tool Approval
- Workflow:Togethercomputer Together python Chat Completion
- Workflow:OpenBMB UltraFeedback GPT4 Preference Annotation
- Workflow:Ucbepic Docetl Playground Interactive Development
- Workflow:Huggingface Datasets Dataset Loading and Exploration
Principles
- Principle:Mistralai Client python Azure Client Initialization
- Principle:Arize ai Phoenix Experiment Evaluator Definition
- Principle:Deepset ai Haystack Cross Encoder Reranking
- Principle:Norrrrrrr lyn WAInjectBench Model Serialization
- Principle:Bitsandbytes foundation Bitsandbytes 8bit Quantization Configuration
- Principle:Apache Flink Checkpoint Position Tracking
- Principle:Mistralai Client python Function Dispatch
- Principle:Protectai Llm guard Output Relevance Checking
- Principle:Huggingface Datasets Hub Dataset Deletion
- Principle:Anthropics Anthropic sdk python Real time Content Processing
Implementations
- Implementation:Facebookresearch Habitat lab HitlTutorial
- Implementation:Open compass VLMEvalKit OpenFlamingo
- Implementation:Speechbrain Speechbrain Prepare UrbanSound8k
- Implementation:BerriAI Litellm S3 Cache
- Implementation:TobikoData Sqlmesh WebClient OpenAPI Spec
- Implementation:Haosulab ManiSkill AgentRegistration
- Implementation:Isaac sim IsaacGymEnvs DR YAML Configuration
- Implementation:Microsoft Playwright Client WebSocket
- Implementation:FlagOpen FlagEmbedding AbsEmbedder Encode
- Implementation:Explodinggradients Ragas Generate Personas From KG
Heuristics
- Heuristic:Vespa engine Vespa Maven Parallel Build Optimization
- Heuristic:DataExpert io Data engineer handbook Flink Checkpointing Interval Tuning
- Heuristic:Protectai Llm guard ONNX Runtime Optimization
- Heuristic:MaterializeInc Materialize CI Agent Prioritization
- Heuristic:Volcengine Verl Sequence Length Balancing
- Heuristic:Hiyouga LLaMA Factory Mixed Precision Training Tips
- Heuristic:Vllm project Vllm GPU Memory Utilization Tuning
- Heuristic:Kornia Kornia Numerical Stability Patterns
- Heuristic:Sail sg LongSpec Tree Shape Configuration
- Heuristic:Hpcaitech ColossalAI Warning Deprecated Ray Detached PPO
Environments
- Environment:NVIDIA NeMo Curator Video Codec Stack
- Environment:Intel Ipex llm Windows Environment
- Environment:Ucbepic Docetl Frontend Node Environment
- Environment:Eventual Inc Daft AI Provider Dependencies
- Environment:ThreeSR Awesome Inference Time Scaling Python Runtime Environment
- Environment:Pyro ppl Pyro Distributed Training
- Environment:Sgl project Sglang OpenAI
- Environment:Vllm project Vllm NVIDIA CUDA
- Environment:Langgenius Dify Vector Database Environment
- Environment:Datahub project Datahub Python 3 10 Ingestion Environment