Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent with the Leeroopedia MCP setup guide. Let it search docs, build plans, verify code, and diagnose failures on your behalf.
Go end-to-end. Leeroopedia gives your agent the knowledge. Kapso gives it the ability to act on it: research, experiment, and deploy.
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Haifengl Smile Model Serving Pipeline
- Workflow:BerriAI Litellm Proxy Server Deployment
- Workflow:Langchain ai Langchain Tool Calling Structured Output
- Workflow:Teamcapybara Capybara RSpec Integration Setup
- Workflow:Scikit learn contrib Imbalanced learn Ensemble Imbalanced Classification
- Workflow:Protectai Modelscan Programmatic Model Scanning
- Workflow:Openclaw Openclaw Channel Connection
- Workflow:Datajuicer Data juicer LLM Powered Data Generation
- Workflow:Apache Beam Portable Pipeline Submission
- Workflow:Getgauge Taiko Gauge Integration Testing
Principles
- Principle:Langfuse Langfuse Eval Error Handling and Retry
- Principle:Microsoft BIPIA Training Data Tokenization
- Principle:EvolvingLMMs Lab Lmms eval Baseline Comparison
- Principle:Webdriverio Webdriverio Command Wrapping
- Principle:Google deepmind Mujoco Resource Cleanup
- Principle:Microsoft Onnxruntime Distributed Training Configuration
- Principle:TA Lib Ta lib python Binary Wheel Installation
- Principle:Infiniflow Ragflow Chunk Management
- Principle:Huggingface Datatrove Document Level Statistics
- Principle:Volcengine Verl VLM Model Configuration
Implementations
- Implementation:Scikit learn Scikit learn FetchLfw
- Implementation:Turboderp org Exllamav2 ExLlamaV2Filter
- Implementation:Ray project Ray ActorHandle Task Remote
- Implementation:Haotian liu LLaVA Load Pretrained Model
- Implementation:Scikit learn Scikit learn LossFunction
- Implementation:Microsoft Autogen Studio MCP Resources Tab
- Implementation:Microsoft Semantic kernel VectorStoreTextSearch RAG
- Implementation:Volcengine Verl ToolAgentLoop Run
- Implementation:Pyro ppl Pyro TraceTMC ELBO
- Implementation:Huggingface Peft Get Peft Model
Heuristics
- Heuristic:FMInference FlexLLMGen Pin Memory Tradeoffs
- Heuristic:Hpcaitech ColossalAI Warmup Steps Heuristic
- Heuristic:Sgl project Sglang Schedule Conservativeness Tuning
- Heuristic:ClickHouse ClickHouse Test Writing Conventions
- Heuristic:Hiyouga LLaMA Factory Quantized Training Best Practices
- Heuristic:ChenghaoMou Text dedup Suffix Array Merge Strategy
- Heuristic:Alibaba MNN LLM Runtime Tuning
- Heuristic:Ggml org Llama cpp Warning Deprecated Legacy Converters
- Heuristic:Fede1024 Rust rdkafka Sensitive Config Sanitization
- Heuristic:Danijar Dreamerv3 Percentile Return Normalization
Environments
- Environment:Huggingface Trl Python Core Dependencies
- Environment:Farama Foundation Gymnasium MuJoCo Physics Backend
- Environment:Openai Openai agents python Python 3 9 Runtime
- Environment:Mlc ai Mlc llm Python Serving Environment
- Environment:Treeverse LakeFS Go Runtime Environment
- Environment:TobikoData Sqlmesh Python Runtime
- Environment:Mlc ai Mlc llm TVM Runtime Environment
- Environment:Bentoml BentoML Python Runtime
- Environment:Heibaiying BigData Notes Spark 2 4 Environment
- Environment:Dotnet Machinelearning Dotnet SDK And Runtime