Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Openclaw Openclaw Agent Message Loop
- Workflow:Heibaiying BigData Notes Storm Topology Development
- Workflow:Arize ai Phoenix Prompt Management Pipeline
- Workflow:Apache Druid Streaming Ingestion Management
- Workflow:Pyro ppl Pyro SVI Training
- Workflow:Neuml Txtai Semantic Search Pipeline
- Workflow:MaterializeInc Materialize Release Process
- Workflow:ArroyoSystems Arroyo Local Pipeline Execution
- Workflow:Google deepmind Dm control Composer Environment Building
- Workflow:Allenai Open instruct SFT Finetuning
Principles
- Principle:Apache Druid Explore State Management
- Principle:Mlflow Mlflow Trace Assessment
- Principle:Duckdb Duckdb Benchmark Build Configuration
- Principle:Duckdb Duckdb Performance Regression Detection
- Principle:Ucbepic Docetl Programmatic Optimization
- Principle:NVIDIA NeMo Aligner KTO Data Preparation
- Principle:OpenGVLab InternVL Vision Encoder LoRA
- Principle:Isaac sim IsaacGymEnvs Factory Scene Initialization
- Principle:Online ml River API Documentation Generation
- Principle:Deepset ai Haystack Optional Dependency Handling
Implementations
- Implementation:Triton inference server Server L0 Response Cache Test
- Implementation:Scikit learn Scikit learn KFold Init
- Implementation:Alibaba MNN Protobuf Arena CC
- Implementation:Ggml org Ggml Zendnn backend
- Implementation:AUTOMATIC1111 Stable diffusion webui API Server
- Implementation:Huggingface Datatrove TypesHelper
- Implementation:Explodinggradients Ragas NVMetrics Module
- Implementation:OpenRLHF OpenRLHF PromptDataset init
- Implementation:Datahub project Datahub MetadataEventFormatter
- Implementation:TobikoData Sqlmesh SelectEnvironment
Heuristics
- Heuristic:ThreeSR Awesome Inference Time Scaling Duplicate Detection By Title
- Heuristic:Ray project Ray Serve Concurrency And Backpressure
- Heuristic:Zai org CogVideo CPU Offload Strategy
- Heuristic:HKUDS AI Trader Linear Retry Backoff
- Heuristic:Datajuicer Data juicer Partition Size Tuning
- Heuristic:Apache Airflow Memory Management Tips
- Heuristic:Romsto Speculative Decoding Seed Fixing For Reproducibility
- Heuristic:Tencent Ncnn Optimize Before Quantize
- Heuristic:Unstructured IO Unstructured Chunk Size Tuning
- Heuristic:Mit han lab Llm awq Skip QK Projection Clipping
Environments
- Environment:Deepspeedai DeepSpeed Multi Accelerator Environment
- Environment:Pytorch Serve DeepSpeed Environment
- Environment:Apache Airflow Development Contributor Environment
- Environment:Infiniflow Ragflow Data Source Credentials
- Environment:Nightwatchjs Nightwatch Selenium WebDriver 4
- Environment:Datahub project Datahub Docker Runtime
- Environment:Openai Openai python Python 3 9 Plus
- Environment:Evidentlyai Evidently Spark Engine Environment
- Environment:Alibaba MNN HuggingFace Ecosystem Environment
- Environment:Speechbrain Speechbrain Multi GPU DDP