Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Speechbrain Speechbrain Speech Enhancement Training
- Workflow:Apache Beam Dataflow Streaming Execution
- Workflow:Iterative Dvc Experiment Tracking
- Workflow:ThreeSR Awesome Inference Time Scaling Automated Paper Addition
- Workflow:Farama Foundation Gymnasium Vectorized Environment Training
- Workflow:Lm sys FastChat Distributed Model Serving
- Workflow:Triton inference server Server Quickstart Model Deployment
- Workflow:Apache Druid SQL Query Execution
- Workflow:Openai Openai agents python Multi Agent Handoff
- Workflow:Sgl project Sglang Multimodal Vision Language Inference
Principles
- Principle:AUTOMATIC1111 Stable diffusion webui Diagnostics
- Principle:Langchain ai Langgraph State Inspection
- Principle:AUTOMATIC1111 Stable diffusion webui Launch Sequence
- Principle:DataTalksClub Data engineering zoomcamp Pipeline Cleanup
- Principle:Apache Beam Windmill Connection Setup
- Principle:Huggingface Datatrove Exact Deduplication
- Principle:Huggingface Transformers Model Loading For Training
- Principle:Dagster io Dagster ML Model Lifecycle
- Principle:Dagster io Dagster Sensor Driven Pipelines
- Principle:Ray project Ray Application Deployment
Implementations
- Implementation:Kubeflow Pipelines Profile Controller Sync
- Implementation:OpenGVLab InternVL InternViT 6B Model
- Implementation:ChenghaoMou Text dedup MinHash Check False Positives
- Implementation:NVIDIA NeMo Curator WER Metric Stage
- Implementation:Openai Openai node Migration Config
- Implementation:ChenghaoMou Text dedup SimHash Union Find Cluster
- Implementation:NVIDIA DALI DALIDataset
- Implementation:Apache Kafka KafkaAdminClient CreateTopics
- Implementation:Google deepmind Dm control Reference Pose Tracking
- Implementation:Ucbepic Docetl FilterOperation Execute
Heuristics
- Heuristic:CrewAIInc CrewAI Rate Limiting Strategy
- Heuristic:Spotify Luigi Streaming MapReduce Processing
- Heuristic:Anthropics Anthropic sdk python Warning Deprecated LegacyAPIResponse
- Heuristic:NVIDIA DALI Distributed Sharding Strategy
- Heuristic:Predibase Lorax LoRA Kernel Selection By Rank
- Heuristic:Triton inference server Server Concurrency Throughput Rule
- Heuristic:Roboflow Rf detr EMA Best Checkpoint Strategy
- Heuristic:Avhz RustQuant Interpolation Method Selection
- Heuristic:Guardrails ai Guardrails Guard History Memory Management
- Heuristic:ChenghaoMou Text dedup Fingerprint Batch Size One
Environments
- Environment:Gretelai Gretel synthetics PyTorch CUDA Environment
- Environment:Vllm project Vllm AArch64 CPU
- Environment:Infiniflow Ragflow Frontend Node Environment
- Environment:Duckdb Duckdb CMake Build Toolchain
- Environment:FlowiseAI Flowise Queue Mode Environment
- Environment:Gretelai Gretel synthetics TensorFlow GPU Environment
- Environment:Pytorch Serve Distributed Training Environment
- Environment:TA Lib Ta lib python TA Lib C Library
- Environment:Marker Inc Korea AutoRAG API Keys And Credentials
- Environment:Apache Paimon Optional Extensions