Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Shardingsphere Metadata DDL Refresh
- Workflow:Isaac sim IsaacGymEnvs Domain Randomization Training
- Workflow:Google deepmind Dm control Composer Environment Building
- Workflow:VainF Torch Pruning Object Detection Pruning
- Workflow:Arize ai Phoenix Span Annotation Pipeline
- Workflow:SeleniumHQ Selenium Selenium Grid Deployment
- Workflow:DataTalksClub Data engineering zoomcamp Kafka Stream Processing
- Workflow:Dotnet Machinelearning ONNX Model Scoring
- Workflow:Marker Inc Korea AutoRAG Pipeline Deployment
- Workflow:Dotnet Machinelearning Text Classification
Principles
- Principle:Ucbepic Docetl Deterministic Code Operations
- Principle:Tencent Ncnn Non Maximum Suppression
- Principle:NVIDIA DALI TensorFlow Dataset Integration
- Principle:Groq Groq python Streaming Request Execution
- Principle:Evidentlyai Evidently Report Execution
- Principle:Tensorflow Serving Bundle Factory Configuration
- Principle:Huggingface Open r1 Synthetic Data Generation
- Principle:Triton inference server Server Copyright Management
- Principle:NVIDIA NeMo Aligner Supervised Training Loop
- Principle:Protectai Llm guard Topic Filtering
Implementations
- Implementation:Pyro ppl Pyro TraceTMC ELBO
- Implementation:Gretelai Gretel synthetics DataFrameBatch Batches To Df
- Implementation:Turboderp org Exllamav2 PromptFormat Interface
- Implementation:Puppeteer Puppeteer WaitTask
- Implementation:Huggingface Peft FourierFTConfig
- Implementation:Ollama Ollama Llama Batch
- Implementation:Mage ai Mage ai Google Ads Sync
- Implementation:Speechbrain Speechbrain Prepare Switchboard Tokenizer
- Implementation:CARLA simulator Carla Show Topology Tool
- Implementation:Microsoft LoRA LoRA Layers
Heuristics
- Heuristic:Huggingface Alignment handbook Liger Kernel Memory
- Heuristic:AUTOMATIC1111 Stable diffusion webui NaN Detection And Precision Fixes
- Heuristic:Microsoft Playwright Test Stability Practices
- Heuristic:Axolotl ai cloud Axolotl Sample Packing Best Practices
- Heuristic:Openai Openai node Warning Deprecated Beta Realtime
- Heuristic:PrefectHQ Prefect HTTP Connection Pool Tuning
- Heuristic:Astronomer Astronomer cosmos Deprecation Migration Paths
- Heuristic:Unslothai Unsloth Gradient Accumulation Accuracy
- Heuristic:DataTalksClub Data engineering zoomcamp GCS Upload Timeout Workaround
- Heuristic:Helicone Helicone Anthropic Cache Double Count Prevention
Environments
- Environment:Langfuse Langfuse Redis 7 Queue Cache
- Environment:Pola rs Polars Python Runtime Environment
- Environment:MarketSquare Robotframework browser Docker Container
- Environment:DataExpert io Data engineer handbook Python Development Environment
- Environment:Alibaba MNN GPU Metal Environment
- Environment:OpenRLHF OpenRLHF Ray Distributed Environment
- Environment:Microsoft DeepSpeedExamples ZeRO Inference Runtime
- Environment:Dotnet Machinelearning ONNX Runtime Environment
- Environment:Unstructured IO Unstructured OpenAI API
- Environment:Evidentlyai Evidently Spark Engine Environment