Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Huggingface Datatrove Common Crawl Processing
- Workflow:Microsoft Playwright End to end test authoring
- Workflow:Bentoml BentoML Model Store Management
- Workflow:HKUDS AI Trader Agent Decision Loop
- Workflow:SeleniumHQ Selenium Chrome DevTools Protocol Integration
- Workflow:Heibaiying BigData Notes Kafka Producer Consumer Pipeline
- Workflow:Cypress io Cypress CI Pipeline Integration
- Workflow:Openclaw Openclaw Plugin And Skill Extension
- Workflow:Ggml org Llama cpp Text Generation
- Workflow:Dotnet Machinelearning Text Classification
Principles
- Principle:FMInference FlexLLMGen Data Wrangling Setup
- Principle:Scikit learn contrib Imbalanced learn Sphinx Documentation Configuration
- Principle:DistrictDataLabs Yellowbrick Residual Analysis
- Principle:Huggingface Alignment handbook APO Zero Preference Alignment
- Principle:VainF Torch Pruning Taylor Importance
- Principle:Shiyu coder Kronos Qlib Experiment Configuration
- Principle:Spotify Luigi Task Definition
- Principle:Triton inference server Server Command Line Parsing
- Principle:Avhz RustQuant Day Count Conventions
- Principle:Wandb Weave Session Finalization
Implementations
- Implementation:Scikit learn Scikit learn OPTICS
- Implementation:Axolotl ai cloud Axolotl SwanLab Custom Trainer Profiling
- Implementation:Mage ai Mage ai Chargebee Plan Model Subscriptions Schema
- Implementation:Ucbepic Docetl Directive DocSummarization
- Implementation:CARLA simulator Carla Client Load World
- Implementation:Zai org CogVideo RIFE Model
- Implementation:Apache Hudi SchemaChangeUtils IsTypeUpdateAllow
- Implementation:OpenHands OpenHands LinearManager
- Implementation:Axolotl ai cloud Axolotl Load Lora
- Implementation:Protectai Modelscan ModelScan Scan
Heuristics
- Heuristic:Pola rs Polars Streaming For Large Datasets
- Heuristic:ARISE Initiative Robomimic HDF5 Cache Mode Selection
- Heuristic:Puppeteer Puppeteer Chrome Default Launch Arguments
- Heuristic:Openai Openai node Warning Deprecated Assistants API
- Heuristic:Lance format Lance Warning Deprecated Java APIs
- Heuristic:ARISE Initiative Robomimic Data Worker Tuning By Modality
- Heuristic:BerriAI Litellm Batch Size Flush Interval Tuning
- Heuristic:Nautechsystems Nautilus trader Order Rate Limiting Configuration
- Heuristic:Pyro ppl Pyro Guide Initialization Strategy
- Heuristic:OpenBMB UltraFeedback GPU Memory Utilization
Environments
- Environment:Volcengine Verl Python Core Dependencies
- Environment:Dotnet Machinelearning Platform Architecture Support
- Environment:NVIDIA NeMo Curator RAPIDS GPU Stack
- Environment:Apache Druid Integration Test Docker
- Environment:Spotify Luigi Hadoop HDFS Cluster
- Environment:InternLM Lmdeploy Build From Source
- Environment:Huggingface Trl vLLM Generation Environment
- Environment:Cleanlab Cleanlab Image Quality Dependencies
- Environment:Spotify Luigi SQLAlchemy Database
- Environment:Promptfoo Promptfoo Provider API Keys