Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Lucidrains X transformers DPO Preference Alignment
- Workflow:Fede1024 Rust rdkafka Transactional Produce Consume
- Workflow:Huggingface Alignment handbook SFT DPO Alignment Pipeline
- Workflow:DataTalksClub Data engineering zoomcamp dlt Data Ingestion
- Workflow:NVIDIA NeMo Curator Fuzzy Deduplication
- Workflow:Langgenius Dify RAG Pipeline Development
- Workflow:Openclaw Openclaw Channel Connection
- Workflow:Deepset ai Haystack Document Preprocessing Pipeline
- Workflow:Togethercomputer Together python Embeddings And Reranking
- Workflow:CrewAIInc CrewAI Hierarchical Crew Execution
Principles
- Principle:Microsoft LoRA NLU LoRA Injection
- Principle:Vespa engine Vespa CMake Configuration
- Principle:Helicone Helicone ClickHouse Cost SQL Generation
- Principle:Pyro ppl Pyro Information Form Gaussian
- Principle:EvolvingLMMs Lab Lmms eval Task Utility Functions
- Principle:Microsoft Onnxruntime Checkpoint Saving
- Principle:Ggml org Llama cpp StreamingJSONParsing
- Principle:Farama Foundation Gymnasium Vectorized Environment Creation
- Principle:Mlc ai Web llm Embedding Input Formatting
- Principle:Microsoft Semantic kernel Vector Store Data Model
Implementations
- Implementation:Treeverse LakeFS ImportStatus
- Implementation:Bentoml BentoML Models Export Import
- Implementation:Open compass VLMEvalKit MMLongBench
- Implementation:Treeverse LakeFS Authorization API Spec
- Implementation:Ollama Ollama Inference Handler
- Implementation:BerriAI Litellm Integration Handlers
- Implementation:Langfuse Langfuse ProcessEventBatch
- Implementation:Webdriverio Webdriverio Error Handling Pattern
- Implementation:Huggingface Diffusers Scheduler From Config
- Implementation:ARISE Initiative Robosuite Renderer Base
Heuristics
- Heuristic:Romsto Speculative Decoding Shared Tokenizer Requirement
- Heuristic:Haosulab ManiSkill Physics Solver Tuning
- Heuristic:Pola rs Polars Streaming For Large Datasets
- Heuristic:Mbzuai oryx Awesome LLM Post training Checkpoint Every 3 Papers
- Heuristic:Fastai Fastbook Discriminative Learning Rates
- Heuristic:Zai org CogVideo Memory Optimization Strategies
- Heuristic:Allenai Open instruct NCCL CUMEM Disable
- Heuristic:EvolvingLMMs Lab Lmms eval Distributed Padding Strategy
- Heuristic:Openai Openai agents python Tool Choice Reset Prevents Loops
- Heuristic:Apache Beam Warning Deprecated Twister2 Runner
Environments
- Environment:Vllm project Vllm Intel XPU
- Environment:Facebookresearch Audiocraft AudioCraft Environment Variables
- Environment:VainF Torch Pruning CUDA GPU Benchmarking
- Environment:NVIDIA DALI CMake Build Environment
- Environment:Intel Ipex llm Pipeline Parallel Environment
- Environment:Volcengine Verl Megatron Core Environment
- Environment:TobikoData Sqlmesh BigQuery Connection
- Environment:Apache Shardingsphere Calcite Federation Engine
- Environment:Allenai Open instruct CUDA GPU Training
- Environment:Openai Openai python Azure OpenAI