Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Dolphinscheduler RPC Service Communication
- Workflow:VainF Torch Pruning LLM Structural Pruning
- Workflow:Spotify Luigi Hadoop MapReduce Pipeline
- Workflow:Treeverse LakeFS Write Audit Publish With Hooks
- Workflow:OWASP Www project top 10 for large language model applications Vulnerability Translation
- Workflow:Trailofbits Fickling PyTorch Payload Injection
- Workflow:NVIDIA NeMo Curator Semantic Deduplication
- Workflow:Iterative Dvc Pipeline Reproduction
- Workflow:Ggml org Llama cpp HF to GGUF Model Conversion
- Workflow:Dotnet Machinelearning GenAI Causal LM Inference
Principles
- Principle:Kserve Kserve Pipeline Validation
- Principle:Langfuse Langfuse Export Completion Notification
- Principle:Deepseek ai Janus Image Post Processing Flow
- Principle:Anthropics Anthropic sdk python Message Request Construction
- Principle:Apache Paimon Lazy Blob Loading
- Principle:FlowiseAI Flowise Evaluation Rerun
- Principle:Googleapis Python genai Training Dataset Preparation
- Principle:Wandb Weave Prompt Retrieval
- Principle:Huggingface Trl PEFT LoRA Configuration Reward
- Principle:OpenGVLab InternVL Model Configuration
Implementations
- Implementation:Promptfoo Promptfoo RedteamGraderBase getResult
- Implementation:Kubeflow Kubeflow Git Tag Container Push
- Implementation:Puppeteer Puppeteer Common Types
- Implementation:Recommenders team Recommenders Notebook Utils
- Implementation:Openclaw Openclaw LoadWorkspaceBootstrapFiles
- Implementation:Nautechsystems Nautilus trader Data Event Handlers
- Implementation:Treeverse LakeFS Java SDK Model PrepareGCUncommittedResponse
- Implementation:Online ml River Forest ARFClassifier
- Implementation:NVIDIA TransformerEngine NVFP4 Storage
- Implementation:Datajuicer Data juicer RandomSelector
Heuristics
- Heuristic:Marker Inc Korea AutoRAG Warning Deprecated Legacy QA Creation
- Heuristic:Ggml org Llama cpp Warning Deprecated Legacy Converters
- Heuristic:Hpcaitech ColossalAI Warning Deprecated Ray Detached PPO
- Heuristic:Langgenius Dify Credential Sanitization In API Responses
- Heuristic:Apache Shardingsphere DDL Refresher Superclass Fallback
- Heuristic:Online ml River Hoeffding Tree Grace Period Tuning
- Heuristic:Tensorflow Serving Model Warmup Strategy
- Heuristic:Fastai Fastbook Mixup Data Augmentation
- Heuristic:BerriAI Litellm Batch Size Flush Interval Tuning
- Heuristic:Microsoft Semantic kernel Telemetry Log Level Configuration
Environments
- Environment:Apache Spark Release Build Environment
- Environment:FMInference FlexLLMGen NVMe Disk
- Environment:Sgl project Sglang ROCm
- Environment:Huggingface Peft Optional Quantization Backends
- Environment:NVIDIA NeMo Aligner NeMo Framework GPU Environment
- Environment:Treeverse LakeFS Web UI Environment
- Environment:Apache Paimon Optional Extensions
- Environment:Ucbepic Docetl LLM API Keys
- Environment:Dagster io Dagster DAGSTER HOME Configuration
- Environment:Huggingface Transformers Python 310 Runtime