Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Openai Openai agents python Guardrails Secured Agent
- Workflow:Datajuicer Data juicer Dataset Quality Analysis
- Workflow:Wandb Weave Tracing Setup
- Workflow:Cleanlab Cleanlab Multiannotator Consensus
- Workflow:Openai Evals Implementing a custom completion function
- Workflow:Romsto Speculative Decoding Speculative Decoding Inference
- Workflow:Neuml Txtai Model Training
- Workflow:Ollama Ollama Custom Model Creation
- Workflow:ChenghaoMou Text dedup Bloom Filter Deduplication
- Workflow:Google research Deduplicate text datasets Wiki40B TFDS deduplication
Principles
- Principle:Fastai Fastbook Activation Functions
- Principle:Pyro ppl Pyro Combinatorial Distributions
- Principle:Sdv dev SDV Metadata Detection
- Principle:DataTalksClub Data engineering zoomcamp Pipeline Containerization
- Principle:Haosulab ManiSkill Registration Pattern
- Principle:Bentoml BentoML Distributed Deployment Configuration
- Principle:Spotify Luigi Metrics Collection
- Principle:Infiniflow Ragflow Retrieval Configuration
- Principle:Bigscience workshop Petals Server Configuration
- Principle:Alibaba MNN Source Model Preparation
Implementations
- Implementation:Openai Openai python Images Response Model
- Implementation:NVIDIA DALI EfficientNet Backbone
- Implementation:Deepspeedai DeepSpeed DeepSpeedEngine Checkpoint
- Implementation:ContextualAI HALOs Alpaca Eval CLI
- Implementation:Risingwavelabs Risingwave PgOutputMessageDecoder
- Implementation:Ollama Ollama Llama Vocab
- Implementation:LMCache LMCache Audit Connector
- Implementation:Mlc ai Mlc llm Engine State
- Implementation:Ollama Ollama Imagegen Transfer
- Implementation:Pytorch Serve Management API
Heuristics
- Heuristic:Sgl project Sglang Memory Fraction Tuning
- Heuristic:Teamcapybara Capybara Animation Disabling For Tests
- Heuristic:Avdvg InjectGuard Embedding Normalization Cosine Equivalence
- Heuristic:Run llama Llama index Evaluator LLM Selection
- Heuristic:Risingwavelabs Risingwave Memory Cache Eviction Policy
- Heuristic:Cohere ai Cohere python HTTP Retry Backoff Strategy
- Heuristic:Hpcaitech ColossalAI Warning Deprecated Ray Detached PPO
- Heuristic:Predibase Lorax GPU Sampling Optimization
- Heuristic:Huggingface Diffusers LoRA Safe Fusing
- Heuristic:Norrrrrrr lyn WAInjectBench Zero Vector Fallback Failed Embeddings
Environments
- Environment:AUTOMATIC1111 Stable diffusion webui Xformers Attention
- Environment:Vllm project Vllm ROCm
- Environment:OWASP Www project top 10 for large language model applications Pydantic Invoice Agent Runtime
- Environment:Unstructured IO Unstructured Profiling Tools
- Environment:Snorkel team Snorkel PyTorch
- Environment:SqueezeAILab ETS Multi GPU Sglang Runtime
- Environment:CarperAI Trlx Python Accelerate
- Environment:Dagster io Dagster PostgreSQL Storage
- Environment:Apache Kafka JVM Runtime Environment
- Environment:Mbzuai oryx Awesome LLM Post training Python Requests