Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Lance format Lance Table Optimization
- Workflow:Googleapis Python genai Text Content Generation
- Workflow:Pyro ppl Pyro MCMC Inference
- Workflow:OpenBMB UltraFeedback Dataset Construction
- Workflow:Apache Paimon Vector Similarity Search
- Workflow:Snorkel team Snorkel Data Augmentation
- Workflow:ArroyoSystems Arroyo Connection Setup
- Workflow:Iterative Dvc Remote Data Sync
- Workflow:Apache Spark Release Process
- Workflow:Openai Openai agents python Basic Agent Execution
Principles
- Principle:Mistralai Client python Response Processing
- Principle:Alibaba MNN Diffusion Engine Compilation
- Principle:MaterializeInc Materialize Cluster and Index Configuration
- Principle:Apache Spark Dependency Pinning
- Principle:Huggingface Optimum Quantized Weight Packing
- Principle:Getgauge Taiko Request Handler Functions
- Principle:ARISE Initiative Robosuite HDF5 Dataset Aggregation
- Principle:Confident ai Deepeval Evaluation Result Analysis
- Principle:Dagster io Dagster BI Tool Integration
- Principle:MaterializeInc Materialize Upgrade Validation
Implementations
- Implementation:Huggingface Trl GRPOTrainer Train Loop
- Implementation:Explodinggradients Ragas OpikTracer Class
- Implementation:Predibase Lorax Rotary Embedding
- Implementation:Vllm project Vllm Scaled MM Epilogues C2X
- Implementation:Risingwavelabs Risingwave OpensearchRestHighLevelClientAdapter
- Implementation:Open compass VLMEvalKit RefCOCODataset
- Implementation:Helicone Helicone ToFilterNode
- Implementation:Kubeflow Pipelines MLMD Service Protobuf
- Implementation:Apache Shardingsphere StandaloneContextManagerBuilder Build
- Implementation:Treeverse LakeFS Java SDK MetadataApi
Heuristics
- Heuristic:Hpcaitech ColossalAI CUDA Device Max Connections Tip
- Heuristic:Huggingface Datasets Warning Deprecated Pandas Builder
- Heuristic:Pyro ppl Pyro Numerical Stability Patterns
- Heuristic:Cypress io Cypress V8 Snapshot Memory
- Heuristic:Princeton nlp Tree of thought llm Global State Token Counting
- Heuristic:Duckdb Duckdb Version Sync Across Files
- Heuristic:Evidentlyai Evidently Statistical Test Auto Selection
- Heuristic:Intel Ipex llm DeepSpeed Tensor Parallel Tips
- Heuristic:Princeton nlp SimPO Left Truncation Strategy
- Heuristic:Huggingface Datasets Num Proc Guidelines
Environments
- Environment:Vespa engine Vespa CMake Cpp23 Build Environment
- Environment:Duckdb Duckdb Extension Distribution Env
- Environment:Protectai Modelscan H5py Optional
- Environment:Roboflow Rf detr ONNX Export Environment
- Environment:Sktime Pytorch forecasting Matplotlib Plotting Dependencies
- Environment:Facebookresearch Audiocraft AudioCraft Environment Variables
- Environment:Huggingface Open r1 vLLM Server
- Environment:Pola rs Polars GPU Execution Environment
- Environment:Lance format Lance Rust Toolchain
- Environment:Lucidrains X transformers Python Environment