Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Microsoft Onnxruntime Distributed Model Training
- Workflow:Bitsandbytes foundation Bitsandbytes 4bit QLoRA Inference
- Workflow:PeterL1n BackgroundMattingV2 Image matting inference
- Workflow:ThreeSR Awesome Inference Time Scaling Automated Paper Addition
- Workflow:Datahub project Datahub CLI Metadata Ingestion
- Workflow:Nightwatchjs Nightwatch Page Object Pattern
- Workflow:Explodinggradients Ragas Metric Prompt Optimization
- Workflow:Kserve Kserve Deploying InferenceService
- Workflow:ArroyoSystems Arroyo Local Pipeline Execution
- Workflow:Iamhankai Forest of Thought CGDM Post Processing
Principles
- Principle:Pola rs Polars Expression Pipeline Building
- Principle:Treeverse LakeFS S3 Commit Management
- Principle:Kubeflow Kubeflow Register Model
- Principle:Microsoft Onnxruntime TensorBoard Monitoring
- Principle:MaterializeInc Materialize Release Commit and Tagging
- Principle:Puppeteer Puppeteer JavaScript Object Handling
- Principle:Fastai Fastbook Backpropagation
- Principle:Nightwatchjs Nightwatch Page Commands
- Principle:DataTalksClub Data engineering zoomcamp Loading Path Selection
- Principle:Apache Airflow Trigger Handling
Implementations
- Implementation:Eventual Inc Daft Regexp Extract
- Implementation:NVIDIA DALI EfficientNet Backbone
- Implementation:Bitsandbytes foundation Bitsandbytes HPU Dequantize 4bit
- Implementation:Ucbepic Docetl Directive Registry
- Implementation:Bentoml BentoML CLI Env Manager
- Implementation:Norrrrrrr lyn WAInjectBench torch save Checkpoint
- Implementation:Sktime Pytorch forecasting TimeSeriesDataSet Init
- Implementation:Kserve Kserve InferenceService Full CRD
- Implementation:Pyro ppl Pyro ImproperUniform
- Implementation:Apache Beam PortableRunner Run
Heuristics
- Heuristic:Treeverse LakeFS Retry Backoff Configuration
- Heuristic:Princeton nlp SimPO Hyperparameter Tuning
- Heuristic:Farama Foundation Gymnasium Render Mode Selection Guide
- Heuristic:Microsoft DeepSpeedExamples RLHF Stability Constraints
- Heuristic:Mlc ai Mlc llm OpenCL Memory Floor Workaround
- Heuristic:Apache Airflow Variable Access Pattern
- Heuristic:Microsoft Semantic kernel Telemetry Log Level Configuration
- Heuristic:Microsoft DeepSpeedExamples Gradient Checkpointing Tradeoff
- Heuristic:Kubeflow Kubeflow Kustomize Build Pipe Apply Pattern
- Heuristic:Webdriverio Webdriverio Stale Element Auto Refetch
Environments
- Environment:Openai Openai node Node 20 Runtime
- Environment:Lakeraai Pint benchmark Python 310 With Transformers
- Environment:TobikoData Sqlmesh BigQuery Connection
- Environment:Apache Druid Integration Test Docker
- Environment:Puppeteer Puppeteer Configuration Environment Variables
- Environment:DevExpress Testcafe Node Runtime
- Environment:Huggingface Open r1 vLLM Server
- Environment:Kserve Kserve Gateway API
- Environment:Deepset ai Haystack GPU Device Environment
- Environment:Vllm project Vllm GitHub