Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Dolphinscheduler Datasource Plugin Development
- Workflow:Google research Deduplicate text datasets Cross dataset deduplication
- Workflow:Avhz RustQuant Yield Curve Construction
- Workflow:OpenRLHF OpenRLHF Iterative DPO
- Workflow:Apache Hudi Flink Table Clustering
- Workflow:Ollama Ollama Model Registry Operations
- Workflow:Spotify Luigi Database Ingestion Pipeline
- Workflow:Datahub project Datahub Metadata Ingestion Pipeline
- Workflow:Groq Groq python Chat Completion
- Workflow:Evidentlyai Evidently ML Model Quality Report
Principles
- Principle:Openai Openai agents python Streamed Run Invocation
- Principle:Webdriverio Webdriverio Async Iteration
- Principle:Apache Airflow Helm Values Configuration
- Principle:Tensorflow Serving TFRT Model Management
- Principle:NVIDIA DALI Image Resize
- Principle:Langchain ai Langgraph UI Component Management
- Principle:Huggingface Datatrove URL Filtering
- Principle:FlagOpen FlagEmbedding Evaluation Model Loading
- Principle:Avhz RustQuant Automatic Differentiation
- Principle:Pyro ppl Pyro Deterministic Computation
Implementations
- Implementation:Evidentlyai Evidently Legacy Target Drift Preset
- Implementation:ArroyoSystems Arroyo Kinesis Source
- Implementation:Sgl project Sglang CPU Common Header
- Implementation:Apache Airflow RBAC Security Config
- Implementation:ArroyoSystems Arroyo Filesystem Source
- Implementation:LaurentMazare Tch rs Torch Api Generated Cpp
- Implementation:DevExpress Testcafe Runner Fluent API
- Implementation:Recommenders team Recommenders Load Pandas Df
- Implementation:Open compass VLMEvalKit Infer Data Job
- Implementation:NVIDIA TransformerEngine Ops Activation
Heuristics
- Heuristic:LLMBook zh LLMBook zh github io DPO Beta Hyperparameter
- Heuristic:LMCache LMCache Chunk Size And Default Config
- Heuristic:Axolotl ai cloud Axolotl FSDP Configuration Guide
- Heuristic:Allenai Open instruct Disable Dropout In RL
- Heuristic:Microsoft DeepSpeedExamples SuperOffload NUMA Binding
- Heuristic:PeterL1n BackgroundMattingV2 Checkpoint Interval Tuning
- Heuristic:ClickHouse ClickHouse ThinLTO Build Tradeoffs
- Heuristic:Deepspeedai DeepSpeed Vocabulary Tensor Core Alignment
- Heuristic:Alibaba ROLL GPU Memory Offload Strategy
- Heuristic:CARLA simulator Carla Traffic Manager Sync Mode
Environments
- Environment:Openai Whisper PyTorch CUDA
- Environment:Apache Kafka Docker Build Environment
- Environment:Wandb Weave Trace Server Infrastructure
- Environment:Iamhankai Forest of Thought Python CUDA Runtime
- Environment:Dotnet Machinelearning Native Build Toolchain
- Environment:Getgauge Taiko Node Runtime
- Environment:Microsoft Playwright Platform Support Environment
- Environment:Huggingface Optimum GPTQ Quantization Environment
- Environment:Langgenius Dify Vector Database Environment
- Environment:HKUDS AI Trader API Credentials