Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:DistrictDataLabs Yellowbrick Cluster Analysis
- Workflow:Dagster io Dagster RAG Pipeline
- Workflow:Predibase Lorax Single LoRA Inference
- Workflow:Run llama Llama index Embedding Finetuning
- Workflow:Apache Kafka Docker Image Release
- Workflow:Pyro ppl Pyro MCMC Inference
- Workflow:Norrrrrrr lyn WAInjectBench Embedding Classifier Training
- Workflow:Sgl project Sglang Multimodal Vision Language Inference
- Workflow:Recommenders team Recommenders News Recommendation NRMS
- Workflow:Googleapis Python genai Multi Turn Chat
Principles
- Principle:Scikit learn contrib Imbalanced learn Borderline Oversampling
- Principle:Online ml River Dummy Baseline Estimation
- Principle:Zai org CogVideo SAT Training Execution
- Principle:Intel Ipex llm LoRA Adapter Injection
- Principle:Turboderp org Exllamav2 Quantization Sensitivity Measurement
- Principle:Scikit learn contrib Imbalanced learn Sensitivity Specificity Analysis
- Principle:DevExpress Testcafe Configuration Loading
- Principle:Astronomer Astronomer cosmos Dbt Invocation
- Principle:Cleanlab Cleanlab Integrated Label Issue Detection
- Principle:Huggingface Open r1 Model and Tokenizer Loading
Implementations
- Implementation:ArroyoSystems Arroyo State Core
- Implementation:Apache Druid ShowLog
- Implementation:Ucbepic Docetl PipelineSettings
- Implementation:InternLM Lmdeploy Gemm SmemCopy
- Implementation:LMCache LMCache Chunk Statistics Lookup Client
- Implementation:Tensorflow Tfjs MathBackendCPU
- Implementation:OpenHands OpenHands Nested Runtime HTTP Configuration
- Implementation:Risingwavelabs Risingwave StreamChunk
- Implementation:Liu00222 Open Prompt Injection create model
- Implementation:Datajuicer Data juicer Load Formatter
Heuristics
- Heuristic:Ollama Ollama Download Retry Strategy
- Heuristic:Alibaba MNN Backend Selection Guide
- Heuristic:Cleanlab Cleanlab Confident Threshold Heuristic
- Heuristic:Sdv dev SDV Gaussian KDE Incompatibility
- Heuristic:Astronomer Astronomer cosmos Watcher Queue Sizing
- Heuristic:Obss Sahi Class Agnostic vs Per Class NMS
- Heuristic:Microsoft DeepSpeedExamples SuperOffload NUMA Binding
- Heuristic:Huggingface Datasets Flatten Indices Performance
- Heuristic:PrefectHQ Prefect Retry Backoff Strategy
- Heuristic:Diagram of thought Diagram of thought Acyclicity Constraint Enforcement
Environments
- Environment:Vllm project Vllm AWS ECR
- Environment:Tensorflow Serving Python Client Environment
- Environment:Langchain ai Langchain OpenAI API Credentials
- Environment:ArroyoSystems Arroyo Webui Runtime
- Environment:Norrrrrrr lyn WAInjectBench OpenAI API Credentials
- Environment:Risingwavelabs Risingwave Rust Build Environment
- Environment:Alibaba ROLL SGLang Inference Environment
- Environment:Volcengine Verl Megatron Core Environment
- Environment:Kserve Kserve Gateway API
- Environment:Truera Trulens OpenAI Provider Environment