Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Hpcaitech ColossalAI Supervised Finetuning
- Workflow:Deepseek ai Janus Autoregressive Image Generation
- Workflow:Pola rs Polars Streaming Large Dataset Processing
- Workflow:Cleanlab Cleanlab Multiannotator Consensus
- Workflow:OWASP Www project top 10 for large language model applications GenAI Red Team Testing
- Workflow:Bentoml BentoML Model Store Management
- Workflow:Apache Airflow Provider Distribution Development
- Workflow:NVIDIA NeMo Aligner Supervised Fine Tuning
- Workflow:Norrrrrrr lyn WAInjectBench Embedding Classifier Training
- Workflow:Apache Hudi Flink Batch Incremental Read
Principles
- Principle:Vllm project Vllm LoRA Engine Configuration
- Principle:DataExpert io Data engineer handbook Experiment User Assignment
- Principle:Infiniflow Ragflow Document Upload
- Principle:Spotify Luigi NoSQL Data Targets
- Principle:InternLM Lmdeploy W8A8 Quantized Inference
- Principle:ThreeSR Awesome Inference Time Scaling Entry Format Verification
- Principle:Neuml Txtai YAML Application Configuration
- Principle:Puppeteer Puppeteer Browser Cache Management
- Principle:Guardrails ai Guardrails Validator Installation
- Principle:Webdriverio Webdriverio TypeScript Type Safety
Implementations
- Implementation:Google deepmind Dm control Composer Environment For Locomotion
- Implementation:Unstructured IO Unstructured Check Diff Expected Output
- Implementation:Neuml Txtai ImageHash
- Implementation:CrewAIInc CrewAI NL2SQL Tool
- Implementation:SeleniumHQ Selenium Rectangle
- Implementation:Avhz RustQuant HoLee
- Implementation:Huggingface Datasets Split Dataset By Node
- Implementation:Huggingface Datasets Dataset From Pandas
- Implementation:Langgenius Dify Var Utils
- Implementation:Apache Hudi HoodieTableSource GetScanRuntimeProvider
Heuristics
- Heuristic:Iamhankai Forest of Thought UCB Exploration Constant
- Heuristic:Mage ai Mage ai Parallel Sink Concurrency Limit
- Heuristic:ARISE Initiative Robomimic Checkpoint Selection Strategy
- Heuristic:Tensorflow Serving Batching Thread Tuning
- Heuristic:Eric mitchell Direct preference optimization RMSprop Over Adam
- Heuristic:Apache Airflow Database Lock Handling
- Heuristic:Mbzuai oryx Awesome LLM Post training Paper Deduplication Via Dict
- Heuristic:ClickHouse ClickHouse ThinLTO Build Tradeoffs
- Heuristic:OWASP Www project top 10 for large language model applications Deliberately Insecure Code Isolation
- Heuristic:Spcl Graph of thoughts Budget Gated Benchmark Execution
Environments
- Environment:Facebookresearch Habitat lab CUDA GPU Training Environment
- Environment:LMCache LMCache Python Runtime
- Environment:Openai Evals OpenAI API Configuration
- Environment:Intel Ipex llm Linux XPU Environment
- Environment:Iterative Dvc Python Runtime
- Environment:Pola rs Polars Rust Build Environment
- Environment:Apache Airflow Kubernetes Helm Environment
- Environment:Vllm project Vllm CUDA Runtime
- Environment:Getgauge Taiko Node Runtime
- Environment:Apache Hudi Flink Runtime Environment