Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Lm sys FastChat MT Bench Evaluation
- Workflow:Eventual Inc Daft Data Lakehouse ETL
- Workflow:Cleanlab Cleanlab CleanLearning Robust Training
- Workflow:Pyro ppl Pyro Discrete Enumeration
- Workflow:Cleanlab Cleanlab Datalab Dataset Audit
- Workflow:Haotian liu LLaVA Two Stage Pretraining and Finetuning
- Workflow:Kserve Kserve Canary Rollout Deployment
- Workflow:Vespa engine Vespa Linguistics text processing pipeline
- Workflow:Huggingface Open r1 GRPO Reasoning Training
- Workflow:Shiyu coder Kronos Qlib Finetuning
Principles
- Principle:Kserve Kserve Batch Inference
- Principle:Turboderp org Exllamav2 Multi GPU Compatibility
- Principle:PacktPublishing LLM Engineers Handbook LoRA Adapter Injection
- Principle:Langchain ai Langchain Public API Surface
- Principle:Alibaba MNN Continuous Integration Testing
- Principle:Guardrails ai Guardrails Observability
- Principle:Pola rs Polars SQL Data Registration
- Principle:Allenai Open instruct Padding Free Training
- Principle:Arize ai Phoenix Span Annotation
- Principle:Haifengl Smile Nearest Neighbor Query
Implementations
- Implementation:Mlc ai Mlc llm Tokenizer Py
- Implementation:Princeton nlp Tree of thought llm Get Task
- Implementation:Online ml River Cluster KMeans
- Implementation:Interpretml Interpret Transpose
- Implementation:Alibaba MNN Torch Model Export
- Implementation:Scikit learn Scikit learn Make Column Selector
- Implementation:Ollama Ollama Llama Arch Header
- Implementation:Haosulab ManiSkill BackendResolution
- Implementation:Avhz RustQuant Curve get rate
- Implementation:Cohere ai Cohere python AssistantMessageResponse Model
Heuristics
- Heuristic:Huggingface Open r1 vLLM GPU Allocation
- Heuristic:Cypress io Cypress Global Install Warning
- Heuristic:ChenghaoMou Text dedup Suffix Array Merge Strategy
- Heuristic:Facebookresearch Habitat lab Mini Batch Environment Divisibility
- Heuristic:Huggingface Peft LoRA Default Configuration
- Heuristic:Hiyouga LLaMA Factory CUDA Memory Optimization
- Heuristic:Openai CLIP Class Name Curation
- Heuristic:Tensorflow Tfjs Memory Management With Tidy
- Heuristic:Eric mitchell Direct preference optimization TF32 Matmul Precision
- Heuristic:Fede1024 Rust rdkafka Partitioner Must Not Block
Environments
- Environment:Ggml org Llama cpp CUDA GPU Environment
- Environment:Shiyu coder Kronos DDP Multi GPU Environment
- Environment:Spcl Graph of thoughts Python 3 8 Runtime
- Environment:Open compass VLMEvalKit Data Storage Environment
- Environment:DataTalksClub Data engineering zoomcamp Docker PostgreSQL Python Environment
- Environment:Bigscience workshop Petals CUDA Server
- Environment:FMInference FlexLLMGen CUDA GPU
- Environment:Wandb Weave LLM Integration Dependencies
- Environment:DataTalksClub Data engineering zoomcamp PySpark Batch Environment
- Environment:TobikoData Sqlmesh GitHub CICD Runner