Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Open compass VLMEvalKit Adding Custom VLM
- Workflow:Ucbepic Docetl YAML Pipeline Execution
- Workflow:Facebookresearch Audiocraft MusicGen Training Pipeline
- Workflow:Apache Airflow Scheduler Operation and Task Execution
- Workflow:Ggml org Ggml Vision Model Inference
- Workflow:Google research Deduplicate text datasets Single file deduplication
- Workflow:DistrictDataLabs Yellowbrick Model Selection and Tuning
- Workflow:DistrictDataLabs Yellowbrick Classification Model Evaluation
- Workflow:FlagOpen FlagEmbedding Embedder Inference
- Workflow:Neuml Txtai Workflow Orchestration
Principles
- Principle:Zai org CogVideo Flow Refinement
- Principle:Neuml Txtai Interactive Console
- Principle:Danijar Dreamerv3 Distributed Learner Training
- Principle:Roboflow Rf detr ONNX Runtime Validation
- Principle:Truera Trulens Agent Evaluation Metrics
- Principle:Lm sys FastChat ShareGPT HTML Cleaning
- Principle:Eventual Inc Daft Data Joining
- Principle:Ray project Ray Runtime Shutdown
- Principle:Facebookresearch Audiocraft Sound Dataset Augmented Loading
- Principle:CarperAI Trlx Reward Model Architecture
Implementations
- Implementation:OpenGVLab InternVL ADE20KDataset
- Implementation:Cohere ai Cohere python BedrockClientV2 Init
- Implementation:BerriAI Litellm Completion Request Types
- Implementation:LaurentMazare Tch rs Mmap Safetensors Load
- Implementation:ARISE Initiative Robosuite PandaRobot
- Implementation:CARLA simulator Carla MapData
- Implementation:Truera Trulens Record Viewer Dependencies
- Implementation:Kserve Kserve LocalModel Manager Deployment
- Implementation:Kornia Kornia Bilateral Filter
- Implementation:LMCache LMCache Basic Check
Heuristics
- Heuristic:PacktPublishing LLM Engineers Handbook RAG Retrieval Parameters
- Heuristic:Datahub project Datahub Warning Deprecated Spark Lineage Legacy
- Heuristic:ContextualAI HALOs Online Round Budgeting
- Heuristic:Junyanz Pytorch CycleGAN and pix2pix CuDNN Benchmark Scale Width
- Heuristic:Nightwatchjs Nightwatch Warning Deprecated API Members
- Heuristic:ARISE Initiative Robomimic HDF5 Cache Mode Selection
- Heuristic:Sdv dev SDV HMA Schema Simplification
- Heuristic:Anthropics Anthropic sdk python Adaptive Thinking Over Enabled
- Heuristic:Fede1024 Rust rdkafka Transaction Error Recovery
- Heuristic:Trailofbits Fickling Force Flag Bypass
Environments
- Environment:MaterializeInc Materialize Docker Compose Runtime
- Environment:Googleapis Python genai Gemini API Key Authentication
- Environment:Datajuicer Data juicer Python Runtime Environment
- Environment:Datahub project Datahub Python 3 10 Ingestion Environment
- Environment:FMInference FlexLLMGen NVMe Disk
- Environment:OpenRLHF OpenRLHF Flash Attention Environment
- Environment:Snorkel team Snorkel PySpark
- Environment:Intel Ipex llm NPU Environment
- Environment:Alibaba ROLL Ascend NPU Environment
- Environment:Cypress io Cypress Linux Display Server