Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:DistrictDataLabs Yellowbrick Model Selection and Tuning
- Workflow:Huggingface Diffusers ControlNet Guided Generation
- Workflow:Google research Deduplicate text datasets Wiki40B TFDS deduplication
- Workflow:Scikit learn Scikit learn Ensemble Model Building
- Workflow:Recommenders team Recommenders Algorithm Benchmarking
- Workflow:Mistralai Client python OCR Document Processing
- Workflow:EvolvingLMMs Lab Lmms eval Server Mode Evaluation
- Workflow:Haotian liu LLaVA Benchmark Evaluation
- Workflow:Apache Shardingsphere Metadata DDL Refresh
- Workflow:Langchain ai Langgraph Building a Stateful Graph
Principles
- Principle:Apache Shardingsphere In Memory Rule Rebuild
- Principle:Huggingface Transformers Fully Sharded Data Parallelism
- Principle:Helicone Helicone Asynchronous Log Queuing
- Principle:OpenRLHF OpenRLHF Process Reward Model Training
- Principle:DataExpert io Data engineer handbook Streaming ETL Pipeline
- Principle:CrewAIInc CrewAI Project Scaffolding
- Principle:Iamhankai Forest of Thought Chain of Thought Reasoning
- Principle:Mlfoundations Open flamingo Benchmark Dataset Loading
- Principle:Triton inference server Server Config Optimization
- Principle:ChenghaoMou Text dedup Benchmark Evaluation
Implementations
- Implementation:SeleniumHQ Selenium ModuleGenerator
- Implementation:Zai org CogVideo CogVLM2 Predict
- Implementation:Run llama Llama index Query Engine Query
- Implementation:Risingwavelabs Risingwave Iceberg External Query
- Implementation:Triton inference server Server L0 Trt Dynamic Shape Test
- Implementation:Google deepmind Mujoco MJX Constraint
- Implementation:Huggingface Transformers LoraConfig
- Implementation:Obss Sahi Evaluate
- Implementation:Microsoft LoRA Legacy Finetune Trainer Seq2Seq
- Implementation:Openai Openai python Runtime Type Inspection
Heuristics
- Heuristic:Predibase Lorax GPU Sampling Optimization
- Heuristic:InternLM Lmdeploy OOM Troubleshooting
- Heuristic:Microsoft LoRA LoRA Rank Selection
- Heuristic:TA Lib Ta lib python NaN Propagation Behavior
- Heuristic:Triton inference server Server Server Default Configuration
- Heuristic:MaterializeInc Materialize CI Retry Strategies
- Heuristic:Zai org CogVideo Training Hyperparameter Defaults
- Heuristic:Dotnet Machinelearning Text File Sampling Strategy
- Heuristic:Apache Spark Serialization Optimization
- Heuristic:Pytorch Serve Ampere Tensor Core Optimization
Environments
- Environment:Openai Whisper FFmpeg
- Environment:Microsoft BIPIA OpenAI API Environment
- Environment:Microsoft Onnxruntime CUDA GPU Environment
- Environment:AUTOMATIC1111 Stable diffusion webui GPU Compute Backend
- Environment:Open compass VLMEvalKit GPU CUDA Environment
- Environment:Guardrails ai Guardrails Python 3 10 Runtime
- Environment:Mistralai Client python Python SDK Environment
- Environment:Pyro ppl Pyro Visualization Tools
- Environment:Spotify Luigi SQLAlchemy Database
- Environment:Openai Openai agents python Python 3 9 Runtime