Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Vllm project Vllm Multi LoRA Serving
- Workflow:Iterative Dvc Data Tracking
- Workflow:ChenghaoMou Text dedup MinHash LSH Deduplication
- Workflow:Apache Beam Portable Pipeline Submission
- Workflow:ContextualAI HALOs Online Iterative Alignment
- Workflow:NVIDIA TransformerEngine Accelerate HF Llama With TE
- Workflow:Microsoft Semantic kernel Plugin Integration And Function Calling
- Workflow:Facebookresearch Habitat lab HITL Interactive Evaluation
- Workflow:Vibrantlabsai Ragas Testset Generation
- Workflow:Huggingface Datatrove Minhash Deduplication
Principles
- Principle:Apache Beam Model Enforcement
- Principle:FMInference FlexLLMGen Optimized Inference Engine
- Principle:Pola rs Polars Multi Format Data Reading
- Principle:Sgl project Sglang Engine Lifecycle Management
- Principle:Openai Whisper Full Transcription
- Principle:SeleniumHQ Selenium CDP Session Cleanup
- Principle:Online ml River Online Random Forest Variants
- Principle:Apache Flink Split Based Record Reading
- Principle:Getgauge Taiko Input Clearing
- Principle:NVIDIA DALI Image Decoding
Implementations
- Implementation:CarperAI Trlx NeMo PPO Model
- Implementation:Huggingface Peft Prepare Model For Kbit Training
- Implementation:Ucbepic Docetl Blocking Utils
- Implementation:Microsoft Semantic kernel IFunctionInvocationFilter
- Implementation:Helicone Helicone Filter Types
- Implementation:Langfuse Langfuse Scores Repository
- Implementation:Datajuicer Data juicer VggtMapper
- Implementation:Vespa engine Vespa IndexingProcessor Process
- Implementation:SeleniumHQ Selenium DevTools Event
- Implementation:Pyro ppl Pyro Poutine Runtime
Heuristics
- Heuristic:Datahub project Datahub Emitter Selection Strategy
- Heuristic:Romsto Speculative Decoding KV Cache Instability
- Heuristic:Lucidrains X transformers Rotary Position Embedding Selection
- Heuristic:Ggml org Llama cpp GPU Layer Offloading Verification
- Heuristic:Apache Shardingsphere Shadow Routing Hint First Fallback
- Heuristic:Lucidrains X transformers Flash Attention Configuration
- Heuristic:Promptfoo Promptfoo Retry With Jitter
- Heuristic:Microsoft Onnxruntime Threading Configuration Tips
- Heuristic:DataTalksClub Data engineering zoomcamp Dbt Materialization Strategy
- Heuristic:Obss Sahi Auto Slice Resolution
Environments
- Environment:Microsoft Autogen Studio Server Environment
- Environment:Nightwatchjs Nightwatch Selenium WebDriver 4
- Environment:Hpcaitech ColossalAI ColossalQA RAG Environment
- Environment:Huggingface Alignment handbook Python TRL
- Environment:Ucbepic Docetl Docker Deployment
- Environment:Romsto Speculative Decoding CUDA PyTorch
- Environment:Langgenius Dify Python Backend Environment
- Environment:Ucbepic Docetl Frontend Node Environment
- Environment:Huggingface Transformers PyTorch 24 CUDA
- Environment:Guardrails ai Guardrails LLM Provider API Keys