Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:FlowiseAI Flowise Chatbot Deployment
- Workflow:Astronomer Astronomer cosmos TaskGroup dbt integration
- Workflow:Neuml Txtai Workflow Orchestration
- Workflow:Google deepmind Mujoco Simulation benchmarking
- Workflow:Roboflow Rf detr Object Detection Inference
- Workflow:TA Lib Ta lib python Abstract API Usage
- Workflow:ArroyoSystems Arroyo Local Pipeline Execution
- Workflow:Tencent Ncnn Vulkan GPU Accelerated Inference
- Workflow:Speechbrain Speechbrain Speaker Embedding Training
- Workflow:Cypress io Cypress Project Setup and Configuration
Principles
- Principle:Online ml River Drift State Inspection
- Principle:Sgl project Sglang Sampling Parameters Preparation
- Principle:Gretelai Gretel synthetics Training Configuration
- Principle:Huggingface Diffusers Training Environment Setup
- Principle:Anthropics Anthropic sdk python Thinking Request Execution
- Principle:BerriAI Litellm Monitoring Operations
- Principle:Volcengine Verl Reward Model Scoring
- Principle:Scikit learn Scikit learn Density Estimation
- Principle:Langgenius Dify Application Creation
- Principle:Huggingface Transformers Pipeline Forward Pass
Implementations
- Implementation:Hpcaitech ColossalAI DocumentLoader
- Implementation:ClickHouse ClickHouse Poco MessageHeader
- Implementation:DataExpert io Data engineer handbook Statsig Log event
- Implementation:OpenRLHF OpenRLHF GEM Multiturn AgentExecutor
- Implementation:Datajuicer Data juicer ReplaceContentMapper
- Implementation:Evidentlyai Evidently Legacy Python Engine
- Implementation:Langchain ai Langchain UV Build
- Implementation:Datahub project Datahub DatahubJob
- Implementation:Deepspeedai DeepSpeed GEMM Test
- Implementation:Onnx Onnx Cpp2py Export
Heuristics
- Heuristic:Huggingface Datatrove FineWeb Filter Pipeline Order
- Heuristic:Pyro ppl Pyro Numerical Stability Patterns
- Heuristic:Openai CLIP L2 Normalization For Cosine Similarity
- Heuristic:Pola rs Polars Use Spawn Not Fork Multiprocessing
- Heuristic:Huggingface Optimum Device Offload Constraints
- Heuristic:Vespa engine Vespa KStemmer Dictionary Loading
- Heuristic:Marker Inc Korea AutoRAG GPU Memory Cleanup Pattern
- Heuristic:Fastai Fastbook Weight Decay Tuning
- Heuristic:Fede1024 Rust rdkafka Partitioner Must Not Block
- Heuristic:Deepspeedai DeepSpeed FP16 Convergence Tips
Environments
- Environment:Run llama Llama index OpenAI API Configuration
- Environment:Tencent Ncnn Build Environment
- Environment:Kserve Kserve GPU Accelerator
- Environment:Lance format Lance Rust Toolchain
- Environment:Kornia Kornia PyTorch Python Environment
- Environment:Promptfoo Promptfoo Node Runtime
- Environment:Huggingface Transformers 3D Parallel Multi GPU
- Environment:Togethercomputer Together python Fine Tuning Data Requirements
- Environment:Vespa engine Vespa Cosign Sigstore Signing
- Environment:Googleapis Python genai Gemini API Key Authentication