Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Speechbrain Speechbrain Speaker Embedding Training
- Workflow:LMCache LMCache Disaggregated Prefill
- Workflow:ArroyoSystems Arroyo Checkpoint Recovery
- Workflow:Haifengl Smile Model Serving Pipeline
- Workflow:Wandb Weave SDK Release
- Workflow:Cohere ai Cohere python AWS Bedrock Deployment
- Workflow:Rapidsai Cuml Random Forest Training And Inference
- Workflow:Snorkel team Snorkel Data Augmentation
- Workflow:Treeverse LakeFS External Data Import
- Workflow:FMInference FlexLLMGen Text Completion API
Principles
- Principle:Apache Shardingsphere Shadow Algorithm Evaluation
- Principle:Langfuse Langfuse Observation Evaluation Scheduling
- Principle:Webdriverio Webdriverio BrowserStack Funnel Instrumentation
- Principle:Confident ai Deepeval Dataset Publishing
- Principle:Dotnet Machinelearning Binary Classification Training
- Principle:NVIDIA NeMo Aligner DPO Preference Data Preparation
- Principle:OpenBMB UltraFeedback Environment Setup
- Principle:Gretelai Gretel synthetics Synthetic Data Quality Evaluation
- Principle:Duckdb Duckdb Integer Bit Packing
- Principle:Sdv dev SDV Column Distribution Visualization
Implementations
- Implementation:Hpcaitech ColossalAI LoRA Finetune Script
- Implementation:Mlc ai Mlc llm Threaded Engine
- Implementation:Apache Druid Tuning Config Form
- Implementation:Evidentlyai Evidently Project Add And Configure
- Implementation:CARLA simulator Carla World API Spec
- Implementation:Eventual Inc Daft Pyproject Configuration
- Implementation:Openai Openai python Response Input File Param
- Implementation:Tensorflow Serving Session Bundle Util
- Implementation:Lance format Lance UdfRegistration
- Implementation:Apache Kafka Kafka Server Start Script
Heuristics
- Heuristic:Gretelai Gretel synthetics Parallel Generation CUDA Disable
- Heuristic:Deepset ai Haystack Pipeline Max Runs Safety Limit
- Heuristic:PacktPublishing LLM Engineers Handbook Token Window Safety Margin
- Heuristic:Google research Deduplicate text datasets Variable Width Pointer Optimization
- Heuristic:Tencent Ncnn FP16 Precision Selection
- Heuristic:Spotify Luigi Marker Table Idempotency
- Heuristic:Tensorflow Serving Batching Thread Tuning
- Heuristic:Apache Dolphinscheduler Datasource Cache Expiry
- Heuristic:Turboderp org Exllamav2 Dynamic Generator Tuning
- Heuristic:Fede1024 Rust rdkafka Queue Buffering Priority
Environments
- Environment:Online ml River Build Toolchain
- Environment:OpenHands OpenHands Frontend Build Environment
- Environment:Liu00222 Open Prompt Injection CUDA Environment
- Environment:Deepset ai Haystack Python Runtime Environment
- Environment:Apache Spark Python Environment
- Environment:Arize ai Phoenix Frontend Node 22
- Environment:Sgl project Sglang Grafana
- Environment:Datajuicer Data juicer GPU CUDA Environment
- Environment:Promptfoo Promptfoo Node Runtime
- Environment:Mlflow Mlflow MLflow Server Environment