Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Heibaiying BigData Notes HBase Java CRUD Operations
- Workflow:Treeverse LakeFS Write Audit Publish With Hooks
- Workflow:Neuml Txtai Agent Execution
- Workflow:Guardrails ai Guardrails LLM Output Validation
- Workflow:Explodinggradients Ragas Test Data Generation
- Workflow:Ggml org Ggml Vision Model Inference
- Workflow:Eventual Inc Daft Distributed UDF Processing
- Workflow:Deepset ai Haystack Document Indexing Pipeline
- Workflow:Cypress io Cypress Component Test Execution
- Workflow:Apache Shardingsphere Metadata DDL Refresh
Principles
- Principle:Ollama Ollama Structured Output
- Principle:SeleniumHQ Selenium Pull Request Submission
- Principle:Speechbrain Speechbrain Beam Search Decoding
- Principle:Langgenius Dify Application Publishing
- Principle:Scikit learn contrib Imbalanced learn Easy Ensemble
- Principle:CarperAI Trlx Offline RL Training
- Principle:Sktime Pytorch forecasting Reversible Instance Normalization
- Principle:Fastai Fastbook Learner Abstraction
- Principle:MaterializeInc Materialize Pipeline Bootstrap
- Principle:PacktPublishing LLM Engineers Handbook Quantized Model Loading
Implementations
- Implementation:Iterative Dvc Pyproject Config
- Implementation:ARISE Initiative Robosuite TrajUtils
- Implementation:Iterative Dvc Testing Workspace Tests
- Implementation:Puppeteer Puppeteer Cdp ElementHandle
- Implementation:Treeverse LakeFS Java SDK ApiClient
- Implementation:Openai Openai node AzureOpenAI
- Implementation:Infiniflow Ragflow Logic Hooks
- Implementation:Microsoft LoRA Legacy Seq2Seq Utils
- Implementation:Openai Openai python Response Image Gen Call Completed
- Implementation:Protectai Llm guard Output Bias
Heuristics
- Heuristic:Online ml River ARF Drift Detection Sensitivity
- Heuristic:Openai Openai python Timeout Connection Defaults
- Heuristic:Heibaiying BigData Notes Kafka Consumer Offset Strategy Tip
- Heuristic:Junyanz Pytorch CycleGAN and pix2pix High Res Crop Training
- Heuristic:Turboderp org Exllamav2 Paged Cache Configuration
- Heuristic:NVIDIA NeMo Aligner Adam State Offloading Tip
- Heuristic:Romsto Speculative Decoding Seed Fixing For Reproducibility
- Heuristic:Microsoft LoRA Label Smoothing NLG
- Heuristic:Alibaba MNN GPU Tuning Modes
- Heuristic:Microsoft BIPIA OpenAI Rate Limit Retry
Environments
- Environment:PacktPublishing LLM Engineers Handbook AWS SageMaker GPU Environment
- Environment:Sgl project Sglang CUDA
- Environment:Huggingface Peft Optional Quantization Backends
- Environment:Huggingface Datasets Python PyArrow Core
- Environment:Fede1024 Rust rdkafka Kafka Broker Runtime
- Environment:Datahub project Datahub Docker Quickstart Environment
- Environment:Apache Shardingsphere Etcd Cluster Coordination
- Environment:Iterative Dvc Git SCM Environment
- Environment:Mbzuai oryx Awesome LLM Post training Git CLI
- Environment:Guardrails ai Guardrails OpenTelemetry Tracing