Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Hudi Flink MOR Compaction
- Workflow:Webdriverio Webdriverio WDIO Testrunner Setup
- Workflow:Snorkel team Snorkel Data Augmentation
- Workflow:Run llama Llama index OpenAI LLM Finetuning
- Workflow:Interpretml Interpret EBM Training And Prediction
- Workflow:Groq Groq python Chat Completion
- Workflow:VainF Torch Pruning Object Detection Pruning
- Workflow:Huggingface Optimum Accelerated Inference Pipeline
- Workflow:Trailofbits Fickling PyTorch Format Identification
- Workflow:Langchain ai Langchain Streaming Responses
Principles
- Principle:Apache Airflow Structlog Logging
- Principle:NVIDIA TransformerEngine FSDP Integration
- Principle:Microsoft Agent framework Declarative Tool Binding
- Principle:LLMBook zh LLMBook zh github io Preference Data Preparation
- Principle:Guardrails ai Guardrails Remote Guard Connection
- Principle:DataExpert io Data engineer handbook Kafka Source Table Definition
- Principle:Huggingface Transformers Repository Consistency Checking
- Principle:ClickHouse ClickHouse Code Review Process
- Principle:Google deepmind Mujoco Rendering Resource Cleanup
- Principle:Cleanlab Cleanlab Token Issue Display
Implementations
- Implementation:Liu00222 Open Prompt Injection create attacker
- Implementation:Arize ai Phoenix Google Adapter
- Implementation:ArroyoSystems Arroyo Physical Planner
- Implementation:Microsoft DeepSpeedExamples LossTracker AverageMeter Accuracy
- Implementation:CarperAI Trlx Metric Function Interface
- Implementation:Microsoft Playwright UtilityScriptSerializers
- Implementation:Apache Paimon AppendTableSplitGenerator
- Implementation:Vllm project Vllm Prometheus Metrics Endpoint
- Implementation:Apache Shardingsphere FederationMetaDataRefreshEngine Refresh
- Implementation:Microsoft Playwright PlaywrightServer
Heuristics
- Heuristic:Datahub project Datahub Git Worktree Gradle Fix
- Heuristic:Huggingface Open r1 vLLM GPU Allocation
- Heuristic:Openai Evals Chat Format Recommendation
- Heuristic:Openai CLIP JIT Vs Non JIT Loading
- Heuristic:DataExpert io Data engineer handbook Watermark Late Arrival Tolerance
- Heuristic:Speechbrain Speechbrain Gradient Clipping Strategy
- Heuristic:Alibaba MNN NC4HW4 Data Layout
- Heuristic:FMInference FlexLLMGen Weight Compression 4bit
- Heuristic:Microsoft Agent framework Silent Workflow Failures
- Heuristic:LMCache LMCache Memory Fragmentation Budget
Environments
- Environment:Vespa engine Vespa POSIX Mmap Log Control
- Environment:NVIDIA DALI PyTorch Environment
- Environment:Apache Dolphinscheduler Netty Runtime
- Environment:Openai Openai python Azure OpenAI
- Environment:Haosulab ManiSkill Python SAPIEN Core
- Environment:Sktime Pytorch forecasting Cpflows MQF2 Dependencies
- Environment:Pola rs Polars Cloud Storage Environment
- Environment:Alibaba MNN GPU CUDA Environment
- Environment:CarperAI Trlx DeepSpeed Multi GPU
- Environment:Huggingface Diffusers Attention Backends