Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Mit han lab Llm awq HuggingFace Model Export
- Workflow:NVIDIA NeMo Curator Video Curation Pipeline
- Workflow:ArroyoSystems Arroyo SQL Pipeline Lifecycle
- Workflow:Interpretml Interpret Model Explanation And Visualization
- Workflow:Openai Openai agents python Basic Agent Execution
- Workflow:Isaac sim IsaacGymEnvs Factory Assembly Training
- Workflow:Online ml River Online Clustering
- Workflow:Junyanz Pytorch CycleGAN and pix2pix Pretrained Inference
- Workflow:Apache Druid Batch Data Ingestion
- Workflow:Hpcaitech ColossalAI LLaMA Continual Pretraining
Principles
- Principle:FlagOpen FlagEmbedding RetroMAE Pretraining
- Principle:Norrrrrrr lyn WAInjectBench Text Feature Extraction
- Principle:FlagOpen FlagEmbedding Reinforced Domain Adaptation
- Principle:Truera Trulens Feedback Provider Configuration
- Principle:Mlfoundations Open flamingo Classification Evaluation
- Principle:Protectai Llm guard REST API Scanning Endpoints
- Principle:Apache Dolphinscheduler Workflow DAG Definition
- Principle:Ggml org Llama cpp User Input Handling
- Principle:DataTalksClub Data engineering zoomcamp Pipeline Cleanup
- Principle:Huggingface Datasets Feature Type Definition
Implementations
- Implementation:Risingwavelabs Risingwave Docker Compose Deployment
- Implementation:Astronomer Astronomer cosmos DbtRunner Wrapper
- Implementation:Kserve Kserve AnchorTabular Explainer
- Implementation:Deepspeedai DeepSpeed DeepSpeedEngine Backward Step
- Implementation:Recommenders team Recommenders Amazon Reviews
- Implementation:Langchain ai Langchain SandboxTests
- Implementation:NVIDIA DALI EfficientNet Model
- Implementation:Apache Druid Sampler Streaming Schema
- Implementation:Mlc ai Mlc llm Qwen3 Model
- Implementation:Astronomer Astronomer cosmos TrinoCertificateProfileMapping
Heuristics
- Heuristic:Langgenius Dify API Token Single Flight Caching
- Heuristic:ARISE Initiative Robosuite Hard Reset Vs Soft Reset
- Heuristic:Guardrails ai Guardrails Guard History Memory Management
- Heuristic:Fastai Fastbook Embedding Size Rule
- Heuristic:Danijar Dreamerv3 Replay Context Carry Init
- Heuristic:Run llama Llama index Batch Eval Retry Strategy
- Heuristic:Astronomer Astronomer cosmos Dbt Invocation Mode Selection
- Heuristic:Gretelai Gretel synthetics Binary Encoder Cutoff
- Heuristic:Kserve Kserve Autoscaler Concurrency Target
- Heuristic:Apache Spark Serialization Optimization
Environments
- Environment:Sgl project Sglang Multi Platform Accelerators
- Environment:Pola rs Polars Cloud Storage Environment
- Environment:Infiniflow Ragflow Python Runtime
- Environment:Unstructured IO Unstructured OpenAI API
- Environment:LLMBook zh LLMBook zh github io VLLM Inference Environment
- Environment:Marker Inc Korea AutoRAG API Keys Configuration
- Environment:Pytorch Serve DeepSpeed Environment
- Environment:Haifengl Smile Java 25 Runtime
- Environment:DistrictDataLabs Yellowbrick Python Scikit Learn Environment
- Environment:Huggingface Peft Optional Quantization Backends