Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Dotnet Machinelearning GenAI Causal LM Inference
- Workflow:Datajuicer Data juicer Custom Operator Development
- Workflow:Huggingface Transformers Model Training With Trainer
- Workflow:Ggml org Llama cpp OpenAI Compatible Server
- Workflow:Guardrails ai Guardrails Streaming Validation
- Workflow:Risingwavelabs Risingwave Iceberg Lakehouse Ingestion
- Workflow:Pola rs Polars DataFrame Aggregation and Grouping
- Workflow:Pyro ppl Pyro Bayesian Regression
- Workflow:Avhz RustQuant Analytic Option Pricing
- Workflow:Shiyu coder Kronos Batch Prediction
Principles
- Principle:Iterative Dvc Database Import
- Principle:Kubeflow Kubeflow Train Model
- Principle:Huggingface Datasets PDF Feature Handling
- Principle:ClickHouse ClickHouse Banned Function Enforcement
- Principle:SqueezeAILab ETS Reward Model Serving
- Principle:AUTOMATIC1111 Stable diffusion webui Component Reuse
- Principle:Romsto Speculative Decoding Rejection Sampling Adjustment
- Principle:OpenGVLab InternVL VQA Accuracy Scoring
- Principle:Recommenders team Recommenders Negative Sampling For Implicit Feedback
- Principle:Apache Paimon Lance Table Configuration
Implementations
- Implementation:Facebookresearch Audiocraft UnetTransformer
- Implementation:Pytorch Serve Scriptable Tokenizer Handler
- Implementation:Pyro ppl Pyro Contract Tensor Tree
- Implementation:NVIDIA NeMo Curator JusText Extractor
- Implementation:NVIDIA NeMo Curator ModelStage
- Implementation:FlagOpen FlagEmbedding Embedding Similarity Scoring
- Implementation:Openai Whisper Median Filter
- Implementation:ThreeSR Awesome Inference Time Scaling Config Function
- Implementation:Apache Flink Mapreduce HadoopOutputFormatBase
- Implementation:Facebookresearch Habitat lab HumanoidBaseController
Heuristics
- Heuristic:Astronomer Astronomer cosmos Static Parser Hang Workaround
- Heuristic:Junyanz Pytorch CycleGAN and pix2pix Batch Size One Default
- Heuristic:Hpcaitech ColossalAI Empty Cache Between Phases
- Heuristic:Deepseek ai Janus Bfloat16 Operation Workarounds
- Heuristic:ARISE Initiative Robomimic BatchNorm To GroupNorm For EMA
- Heuristic:DataTalksClub Data engineering zoomcamp GCS Upload Timeout Workaround
- Heuristic:Eventual Inc Daft Runner Selection Guide
- Heuristic:Diagram of thought Diagram of thought Strict Vs Flexible Critic Rigor
- Heuristic:Apache Dolphinscheduler JDBC Security Blocklist
- Heuristic:Infiniflow Ragflow Citation Threshold Decay
Environments
- Environment:Evidentlyai Evidently Python Core Environment
- Environment:Fastai Fastbook Python FastAI Environment
- Environment:Apache Spark Python Environment
- Environment:Cypress io Cypress Browser Requirements
- Environment:Datahub project Datahub Java 17 Backend Environment
- Environment:Danijar Dreamerv3 JAX CUDA
- Environment:Teamcapybara Capybara Selenium WebDriver Environment
- Environment:Apache Airflow Kubernetes Helm Environment
- Environment:Iterative Dvc DVC Environment Variables
- Environment:EvolvingLMMs Lab Lmms eval GPU Compute Environment