Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Datahub project Datahub Metadata Actions Pipeline
- Workflow:Alibaba ROLL Reward Flow Diffusion Pipeline
- Workflow:Risingwavelabs Risingwave Sink Connector Pipeline
- Workflow:Webdriverio Webdriverio Page Object Pattern
- Workflow:Openai Openai python Audio Processing
- Workflow:NVIDIA NeMo Curator Semantic Deduplication
- Workflow:Spotify Luigi Spark Processing Pipeline
- Workflow:Microsoft BIPIA White Box Defense Finetuning
- Workflow:Tensorflow Serving Model Version Management
- Workflow:Huggingface Alignment handbook SFT DPO Alignment Pipeline
Principles
- Principle:LaurentMazare Tch rs Feature Extraction
- Principle:Pyro ppl Pyro Simulator Based Inference
- Principle:Onnx Onnx Result Validation
- Principle:Teamcapybara Capybara Server And Options Configuration
- Principle:Online ml River Imbalanced Learning
- Principle:Huggingface Datasets Data Download and Preparation
- Principle:Cleanlab Cleanlab CIFAR CNN Architecture
- Principle:DataTalksClub Data engineering zoomcamp Dbt Intermediate Layer
- Principle:Microsoft DeepSpeedExamples Baseline PyTorch Training
- Principle:Treeverse LakeFS Commit
Implementations
- Implementation:Openai Openai node Zod ParseDef
- Implementation:Run llama Llama index AgentOutput Processing
- Implementation:Datahub project Datahub Action Act Interface
- Implementation:Predibase Lorax Base Model
- Implementation:DevExpress Testcafe BrowserProviderPool GetBrowserInfo
- Implementation:Lance format Lance LegacyValueEncoding
- Implementation:Facebookresearch Habitat lab GuiPlacementHelper
- Implementation:Haosulab ManiSkill RoboCasaFixture
- Implementation:Online ml River Metrics SampleAverage
- Implementation:Interpretml Interpret ExportableEBMModel
Heuristics
- Heuristic:Huggingface Transformers Mixed Precision Training Selection
- Heuristic:ThreeSR Awesome Inference Time Scaling Date Parsing Fallback Tip
- Heuristic:Arize ai Phoenix Notebook Event Loop Patching
- Heuristic:Fastai Fastbook Random Forest Defaults
- Heuristic:OpenGVLab InternVL LoRA Alpha Scaling
- Heuristic:Farama Foundation Gymnasium Action Space Normalization Tip
- Heuristic:Huggingface Alignment handbook Global Batch Size Scaling
- Heuristic:Scikit learn Scikit learn Random State Management
- Heuristic:Microsoft Playwright Browser Specific Workarounds
- Heuristic:Astronomer Astronomer cosmos Static Parser Hang Workaround
Environments
- Environment:Apache Hudi Docker Demo Environment
- Environment:Ray project Ray Docker GPU Environment
- Environment:Mistralai Client python Azure Deployment Environment
- Environment:Groq Groq python Python Groq SDK
- Environment:Explodinggradients Ragas Python Runtime Environment
- Environment:OpenHands OpenHands Frontend Build Environment
- Environment:Sgl project Sglang CUDA
- Environment:Datajuicer Data juicer GPU CUDA Environment
- Environment:Cohere ai Cohere python Python SDK Runtime
- Environment:Datahub project Datahub Frontend Build