Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Mistralai Client python OCR Document Processing
- Workflow:CARLA simulator Carla Building from Source
- Workflow:Iterative Dvc Data Tracking
- Workflow:Facebookresearch Habitat lab Agent Benchmarking
- Workflow:Confident ai Deepeval Synthetic Dataset Generation
- Workflow:Arize ai Phoenix Trace Ingestion Pipeline
- Workflow:Astronomer Astronomer cosmos TaskGroup dbt integration
- Workflow:Guardrails ai Guardrails Streaming Validation
- Workflow:Microsoft DeepSpeedExamples ZeRO Inference
- Workflow:ContextualAI HALOs Offline SFT Alignment Pipeline
Principles
- Principle:Nautechsystems Nautilus trader Data Event Handling
- Principle:Pytorch Serve vLLM Model Configuration
- Principle:Iterative Dvc Data Index Update
- Principle:Cohere ai Cohere python Streaming Chat Request
- Principle:Mistralai Client python GCP Client Initialization
- Principle:Microsoft Onnxruntime Checkpoint Loading
- Principle:Sdv dev SDV Fixed Combinations Constraint
- Principle:Duckdb Duckdb Build Environment Setup
- Principle:FMInference FlexLLMGen Learning Rate Scheduling
- Principle:Liu00222 Open Prompt Injection Prompt Localization
Implementations
- Implementation:Datahub project Datahub Java SDK V2 Examples
- Implementation:SeldonIO Seldon core Seldon Model CRD Explainer
- Implementation:Mage ai Mage ai GitHub Issue Events Schema
- Implementation:Zai org CogVideo Get Video Frames
- Implementation:Axolotl ai cloud Axolotl Generate Config Docs
- Implementation:Predibase Lorax Client Init
- Implementation:SeldonIO Seldon core Seldon Pipeline Load
- Implementation:Neuml Txtai Embeddings Search
- Implementation:Speechbrain Speechbrain Train IWSLT22 W2V mBART
- Implementation:Sail sg LongSpec Benchmark Eval Script
Heuristics
- Heuristic:Huggingface Peft LoRA Default Configuration
- Heuristic:Recommenders team Recommenders SAR Cold Start Items
- Heuristic:Bentoml BentoML Warning Deprecated Server Module
- Heuristic:Romsto Speculative Decoding Ngram Order Selection
- Heuristic:NVIDIA DALI Batch Size Tuning
- Heuristic:ThreeSR Awesome Inference Time Scaling Empty Venue Default Tip
- Heuristic:Lance format Lance IO Buffer And Batch Sizing
- Heuristic:Tencent Ncnn Lightmode Memory Optimization
- Heuristic:Google research Deduplicate text datasets HACKSIZE Overlap Buffer
- Heuristic:Bentoml BentoML Platform Serving Caveats
Environments
- Environment:Dagster io Dagster GRPC Communication
- Environment:Deepspeedai DeepSpeed Multi Accelerator Environment
- Environment:Datahub project Datahub Java 17 Backend Environment
- Environment:ARISE Initiative Robomimic HDF5 Data Dependencies
- Environment:Duckdb Duckdb Code Generation Tools
- Environment:NVIDIA NeMo Curator Video Codec Stack
- Environment:ARISE Initiative Robomimic HuggingFace Hub Dependencies
- Environment:Apache Kafka JVM Runtime Environment
- Environment:Astronomer Astronomer cosmos Cosmos Airflow Configuration
- Environment:Sgl project Sglang CUDA GPU Runtime