Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Explodinggradients Ragas Test Data Generation
- Workflow:Spotify Luigi Central Scheduler Deployment
- Workflow:Guardrails ai Guardrails Server Deployment
- Workflow:PrefectHQ Prefect API Sourced ETL
- Workflow:Open compass VLMEvalKit Adding Custom Benchmark
- Workflow:Huggingface Peft QLoRA SFT Finetuning
- Workflow:Mlflow Mlflow Experiment Tracking
- Workflow:Sktime Pytorch forecasting NBeats Univariate Forecasting
- Workflow:Snorkel team Snorkel Weak Supervision Pipeline
- Workflow:Googleapis Python genai Function Calling and Tools
Principles
- Principle:Guardrails ai Guardrails Observability
- Principle:Iterative Dvc Target Resolution
- Principle:Predibase Lorax Stateless Conversation Management
- Principle:Webdriverio Webdriverio Async Iteration
- Principle:Apache Flink Async Sink Configuration
- Principle:Facebookresearch Audiocraft Masked Parallel Token Generation
- Principle:Microsoft Playwright Identify Interception Targets
- Principle:Ucbepic Docetl Pipeline Execution
- Principle:Lm sys FastChat Environment Setup
- Principle:Sdv dev SDV Custom Constraint Definition
Implementations
- Implementation:Hiyouga LLaMA Factory V1 NPU RMSNorm
- Implementation:Datahub project Datahub Docker Health Check Pattern
- Implementation:Huggingface Datasets Dataset To Parquet
- Implementation:Deepspeedai DeepSpeed UlyssesSPDataLoaderAdapter Init
- Implementation:Lance format Lance Commit Compaction
- Implementation:Infiniflow Ragflow Custom Exceptions
- Implementation:Mlc ai Mlc llm DeepSeek Templates
- Implementation:Intel Ipex llm Axolotl Finetuning
- Implementation:Apache Druid ModulePane
- Implementation:Norrrrrrr lyn WAInjectBench process file
Heuristics
- Heuristic:PeterL1n BackgroundMattingV2 Checkpoint Interval Tuning
- Heuristic:Dotnet Machinelearning Tokenizer Caching Strategy
- Heuristic:Huggingface Optimum Device Offload Constraints
- Heuristic:DataTalksClub Data engineering zoomcamp Kafka Consumer Poll Timeout
- Heuristic:Webdriverio Webdriverio Stale Element Auto Refetch
- Heuristic:Triton inference server Server Concurrency Throughput Rule
- Heuristic:NVIDIA DALI Last Batch Policy Selection
- Heuristic:Avdvg InjectGuard Module Level Initialization
- Heuristic:Triton inference server Server Model Instance Scaling
- Heuristic:InternLM Lmdeploy Max Batch Size Selection
Environments
- Environment:Microsoft Onnxruntime CUDA GPU Environment
- Environment:Protectai Llm guard ONNX Runtime Acceleration
- Environment:ClickHouse ClickHouse CI Docker Environment
- Environment:Lm sys FastChat API Keys And Credentials
- Environment:Haifengl Smile Java 25 Runtime
- Environment:Sail sg LongSpec Inference Environment
- Environment:NVIDIA NeMo Aligner PyTriton Serving Environment
- Environment:Promptfoo Promptfoo Python Runtime
- Environment:Apache Kafka Committer Tools Environment
- Environment:PacktPublishing LLM Engineers Handbook AWS SageMaker GPU Environment