Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Airflow Kubernetes Deployment via Helm
- Workflow:Mlc ai Mlc llm Python Engine Inference
- Workflow:Astronomer Astronomer cosmos Watcher execution mode
- Workflow:MarketSquare Robotframework browser Plugin Development
- Workflow:Openai Whisper Word Level Timestamps
- Workflow:Huggingface Transformers Pipeline Inference
- Workflow:Bentoml BentoML Model Store Management
- Workflow:Arize ai Phoenix Dataset and Experiment Lifecycle
- Workflow:Mage ai Mage ai Building a New Destination Connector
- Workflow:Confident ai Deepeval LLM Tracing and Observability
Principles
- Principle:Speechbrain Speechbrain SepFormer Model Configuration
- Principle:NVIDIA NeMo Curator Pairwise Similarity Computation
- Principle:ChenghaoMou Text dedup MinHash Fingerprinting
- Principle:Eventual Inc Daft Data Aggregation
- Principle:Google deepmind Dm control Environment Wrapping
- Principle:NVIDIA DALI TensorFlow Training Integration
- Principle:Huggingface Diffusers Pipeline Level Quantization
- Principle:PeterL1n BackgroundMattingV2 Video output writing
- Principle:Openai Whisper Language Detection
- Principle:Huggingface Optimum Library Logging Configuration
Implementations
- Implementation:Mlc ai Mlc llm Batch Jumpforward
- Implementation:Mit han lab Llm awq NVILA Benchmark
- Implementation:LMCache LMCache PDBackend Batched Submit Put Task
- Implementation:Apache Druid CoordinatorDynamicConfigDialog
- Implementation:Run llama Llama index QueryPlanTool
- Implementation:Datahub project Datahub DatahubSparkListener Init
- Implementation:Hpcaitech ColossalAI Prepare Dataset Preference
- Implementation:Truera Trulens Record Viewer Dependencies
- Implementation:Microsoft Onnxruntime PlotMetadata
- Implementation:Elevenlabs Elevenlabs python SessionStartedPayloadConfig
Heuristics
- Heuristic:Run llama Llama index Batch Eval Retry Strategy
- Heuristic:PrefectHQ Prefect SQLite Performance Tuning
- Heuristic:Snorkel team Snorkel Minimum Three LFs
- Heuristic:Apache Beam Warning Deprecated Twister2 Runner
- Heuristic:Interpretml Interpret Memory Budget Heuristic
- Heuristic:Openai Openai python Streaming Resource Management
- Heuristic:Predibase Lorax Flash Attention Backend Selection
- Heuristic:NVIDIA DALI Distributed Sharding Strategy
- Heuristic:Promptfoo Promptfoo Warning Deprecated Cache Migration
- Heuristic:Testtimescaling Testtimescaling github io Hardcoded IDs vs Registry
Environments
- Environment:Heibaiying BigData Notes Flink 1 9 Environment
- Environment:Alibaba MNN Python Export Environment
- Environment:Intel Ipex llm XPU Finetuning Environment
- Environment:DataTalksClub Data engineering zoomcamp Dbt DuckDB Environment
- Environment:Nautechsystems Nautilus trader Asyncio Uvloop Event Loop
- Environment:Allenai Open instruct Docker Container
- Environment:Elevenlabs Elevenlabs python Python Websockets
- Environment:Helicone Helicone Node 20 TypeScript Runtime
- Environment:Ucbepic Docetl Frontend Node Environment
- Environment:Vllm project Vllm AWS ECR