Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Infiniflow Ragflow Knowledge Base Document Ingestion
- Workflow:Haosulab ManiSkill Imitation Learning Pipeline
- Workflow:Alibaba ROLL Agentic RL Training Pipeline
- Workflow:Arize ai Phoenix LLM Evaluation Pipeline
- Workflow:Pyro ppl Pyro MCMC Inference
- Workflow:Googleapis Python genai Context Caching
- Workflow:Alibaba ROLL RLVR Training Pipeline
- Workflow:DataExpert io Data engineer handbook PySpark Job Testing
- Workflow:Mit han lab Llm awq HuggingFace Model Export
- Workflow:Isaac sim IsaacGymEnvs Policy Inference and Evaluation
Principles
- Principle:Sdv dev SDV Fixed Combinations Constraint
- Principle:Spcl Graph of thoughts Thought State Management
- Principle:LaurentMazare Tch rs ImageNet Preprocessing
- Principle:Mbzuai oryx Awesome LLM Post training Collection Parameter Configuration
- Principle:Danijar Dreamerv3 Distributed Actor Inference
- Principle:Openai Openai python Training Data Preparation
- Principle:Infiniflow Ragflow Citation Insertion
- Principle:AnswerDotAI RAGatouille Model Training
- Principle:Webdriverio Webdriverio Async Iteration
- Principle:Cypress io Cypress System Test Validation
Implementations
- Implementation:AUTOMATIC1111 Stable diffusion webui Canvas Zoom And Pan
- Implementation:Googleapis Python genai Documents
- Implementation:Ucbepic Docetl FastShouldOptimize
- Implementation:Togethercomputer Together python Image Prompt Format
- Implementation:Huggingface Datasets Dataset To Json
- Implementation:Microsoft DeepSpeedExamples Load Model SuperOffload
- Implementation:Apache Paimon RenamingSnapshotCommit
- Implementation:Hpcaitech ColossalAI Model Utils
- Implementation:ArroyoSystems Arroyo Recovering State
- Implementation:Langgenius Dify UseEducation
Heuristics
- Heuristic:Langfuse Langfuse ClickHouse FINAL Skip Optimization
- Heuristic:Kornia Kornia Lazy Loading Optional Deps
- Heuristic:FlagOpen FlagEmbedding Same Dataset Batching Tip
- Heuristic:AnswerDotAI RAGatouille Searcher Configuration By Collection Size
- Heuristic:Open compass VLMEvalKit Judge Model Selection By Dataset
- Heuristic:Huggingface Diffusers Guidance Scale Defaults
- Heuristic:Kubeflow Kubeflow Kustomize Build Pipe Apply Pattern
- Heuristic:NVIDIA DALI Warning Deprecated C API V1 Functions
- Heuristic:Princeton nlp SimPO Hyperparameter Tuning
- Heuristic:Vespa engine Vespa Warning Deprecated Cloud API Constructors
Environments
- Environment:Apache Shardingsphere Calcite Federation Engine
- Environment:Mbzuai oryx Awesome LLM Post training Python Requests
- Environment:Datajuicer Data juicer Ray Cluster Environment
- Environment:Romsto Speculative Decoding CUDA PyTorch
- Environment:Kserve Kserve Leader Worker Set
- Environment:Rapidsai Cuml CUDA GPU
- Environment:Dotnet Machinelearning Native Build Toolchain
- Environment:Fede1024 Rust rdkafka Kafka Broker Runtime
- Environment:Dagster io Dagster Python 3 10 Runtime
- Environment:Trailofbits Fickling Python Runtime