Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Mlc ai Web llm Web Worker Deployment
- Workflow:Interpretml Interpret Blackbox Model Explanation
- Workflow:DataTalksClub Data engineering zoomcamp Docker PostgreSQL Data Ingestion
- Workflow:EvolvingLMMs Lab Lmms eval Custom Task Creation
- Workflow:Apache Beam Local Pipeline Execution
- Workflow:Truera Trulens LangGraph Agent Evaluation
- Workflow:ChenghaoMou Text dedup SimHash Deduplication
- Workflow:Hpcaitech ColossalAI Distributed GRPO Training
- Workflow:Mage ai Mage ai API Source Extraction
- Workflow:Sktime Pytorch forecasting TFT Demand Forecasting
Principles
- Principle:Iterative Dvc SCM Introspection
- Principle:ARISE Initiative Robomimic Checkpointing and Model Saving
- Principle:Microsoft Playwright Test Configuration
- Principle:Microsoft Onnxruntime Training Monitoring and Debugging
- Principle:Huggingface Datasets WebDataset Building
- Principle:Langfuse Langfuse OTel S3 Upload and Queue Dispatch
- Principle:Ray project Ray Ray Runtime Initialization
- Principle:Infiniflow Ragflow Application Embedding
- Principle:LaurentMazare Tch rs NumPy Format IO
- Principle:Getgauge Taiko Request Handler Functions
Implementations
- Implementation:Pytorch Serve XGBoost Iris Handler
- Implementation:Apache Paimon VectorSearch Construction
- Implementation:Microsoft Onnxruntime CgManifest
- Implementation:Langgenius Dify FetchPromptTemplate
- Implementation:FMInference FlexLLMGen AutoTokenizer Usage
- Implementation:Ollama Ollama Llama Base64
- Implementation:OpenHands OpenHands GithubManager Receive Message
- Implementation:AUTOMATIC1111 Stable diffusion webui StableDiffusionProcessingImg2Img
- Implementation:Astronomer Astronomer cosmos DbtDocsCloudOperator Init
- Implementation:Open compass VLMEvalKit Monkey
Heuristics
- Heuristic:Mistralai Client python Tool Docstring Requirement
- Heuristic:Hpcaitech ColossalAI CUDA Device Max Connections Tip
- Heuristic:Unstructured IO Unstructured Strategy Fallback Chain
- Heuristic:Apache Kafka Coordinator Loading Commit Interval
- Heuristic:Lance format Lance BM25 FTS Configuration
- Heuristic:HKUDS AI Trader A Share Lot Size Rule
- Heuristic:Zai org CogVideo Scheduler and Guidance Selection
- Heuristic:Apache Hudi Record Level Index Optimization
- Heuristic:CARLA simulator Carla Client Server Version Match
- Heuristic:Anthropics Anthropic sdk python Retry Backoff Strategy
Environments
- Environment:Mlc ai Mlc llm CUDA GPU Environment
- Environment:SeldonIO Seldon core Kubernetes Cluster Environment
- Environment:Huggingface Datatrove IO Dependencies
- Environment:ARISE Initiative Robomimic HDF5 Data Dependencies
- Environment:Dotnet Machinelearning OneDal Acceleration
- Environment:Truera Trulens LangChain LangGraph Environment
- Environment:Vllm project Vllm CPU Runtime
- Environment:Spotify Luigi Apache Spark
- Environment:Huggingface Trl Python Core Dependencies
- Environment:TobikoData Sqlmesh GitHub CICD Runner