Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:PeterL1n BackgroundMattingV2 Video matting inference
- Workflow:Kornia Kornia ONNX Model Pipeline
- Workflow:Lucidrains X transformers Autoregressive Language Modeling
- Workflow:Spotify Luigi Spark Processing Pipeline
- Workflow:NVIDIA NeMo Aligner DPO Training
- Workflow:Mlc ai Web llm Chrome Extension Integration
- Workflow:DataTalksClub Data engineering zoomcamp dlt Data Ingestion
- Workflow:Vespa engine Vespa Document indexing pipeline
- Workflow:Scikit learn contrib Imbalanced learn Imbalanced Model Evaluation
- Workflow:Huggingface Trl GRPO Training
Principles
- Principle:ARISE Initiative Robosuite Environment Wrapper Pattern
- Principle:Pyro ppl Pyro MCMC Numerical Methods
- Principle:Getgauge Taiko Checkbox Interaction
- Principle:Groq Groq python Vector Extraction
- Principle:Webdriverio Webdriverio AsyncIterationPattern
- Principle:NVIDIA DALI Pipeline Validation
- Principle:Apache Dolphinscheduler Workflow Triggering
- Principle:DataTalksClub Data engineering zoomcamp Kestra Infrastructure Setup
- Principle:Sdv dev SDV Sequential Model Fitting
- Principle:Mbzuai oryx Awesome LLM Post training Publication Count Querying
Implementations
- Implementation:Openai Openai node Speech Resource
- Implementation:Lucidrains X transformers UniversalPretrainWrapper
- Implementation:Vllm project Vllm Marlin MoE Generate Kernels
- Implementation:Datajuicer Data juicer WikipediaDownloader
- Implementation:Apache Airflow Helm Chart Values
- Implementation:Mage ai Mage ai Dremio Source
- Implementation:Pyro ppl Pyro TorchDistributionMixin
- Implementation:Pyro ppl Pyro UnconditionMessenger
- Implementation:Infiniflow Ragflow DelimiterFormField Component
- Implementation:Openai Openai python Response Input Image Param
Heuristics
- Heuristic:HKUDS AI Trader Sortino Ratio Capping
- Heuristic:Facebookresearch Audiocraft FSDP Distributed Training Tips
- Heuristic:AnswerDotAI RAGatouille Searcher Configuration By Collection Size
- Heuristic:SeldonIO Seldon core Kafka Partition Throughput Tip
- Heuristic:Shiyu coder Kronos Learning Rate And Optimizer Tuning
- Heuristic:Kornia Kornia Morphology Engine Selection
- Heuristic:Avhz RustQuant Discretization Scheme Selection
- Heuristic:Langgenius Dify SSE Streaming Error Handling
- Heuristic:Kubeflow Pipelines Component URL Commit SHA Pinning
- Heuristic:Trailofbits Fickling Race Condition Prevention
Environments
- Environment:VainF Torch Pruning PyTorch Python Core
- Environment:Sgl project Sglang Multimodal
- Environment:ClickHouse ClickHouse CI Docker Environment
- Environment:Fastai Fastbook NLP SpaCy Environment
- Environment:SeldonIO Seldon core GPU Inference Environment
- Environment:LLMBook zh LLMBook zh github io PyTorch CUDA GPU Environment
- Environment:Googleapis Python genai Gemini API Key Authentication
- Environment:Roboflow Rf detr Python GPU Environment
- Environment:DataTalksClub Data engineering zoomcamp Kestra Orchestration Environment
- Environment:ArroyoSystems Arroyo Python UDF Runtime