Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Bitsandbytes foundation Bitsandbytes 8bit LLM Int8 Inference
- Workflow:TA Lib Ta lib python Installation And Setup
- Workflow:Mlfoundations Open flamingo Distributed Training
- Workflow:ARISE Initiative Robomimic Training Policy From Demonstrations
- Workflow:PeterL1n BackgroundMattingV2 Training pipeline
- Workflow:Neuml Txtai Pipeline Workflow Chaining
- Workflow:Guardrails ai Guardrails Server Deployment
- Workflow:Neuml Txtai Model Training
- Workflow:Cypress io Cypress Local Development Environment
- Workflow:Apache Flink File Source Pipeline
Principles
- Principle:Truera Trulens Session Initialization
- Principle:Gretelai Gretel synthetics LSTM Model Training
- Principle:OpenGVLab InternVL Optimizer Construction
- Principle:Treeverse LakeFS S3 Commit Management
- Principle:Neuml Txtai Embeddings Configuration
- Principle:FMInference FlexLLMGen HELM Batch Construction
- Principle:Datajuicer Data juicer Operator Package Registration
- Principle:Huggingface Datatrove Data Reading Framework
- Principle:PacktPublishing LLM Engineers Handbook LLM As Judge Evaluation
- Principle:SeleniumHQ Selenium WebDriver Session Creation
Implementations
- Implementation:Vespa engine Vespa SimpleTransformer AccentDrop
- Implementation:Apache Druid RegexpFilterControl
- Implementation:Alibaba MNN Protobuf Repeated Field H
- Implementation:Astronomer Astronomer cosmos DbtVirtualenvBaseOperator
- Implementation:Iterative Dvc Collect Plot Definitions
- Implementation:Online ml River Compose Select
- Implementation:Neuml Txtai HFTrainer Model
- Implementation:Facebookresearch Habitat lab Agent ABC
- Implementation:Kubeflow Pipelines XGBoost Train On Parquet Op
- Implementation:Rapidsai Cuml Lasso
Heuristics
- Heuristic:Kserve Kserve NCCL RoCE Auto Detection
- Heuristic:Deepseek ai Janus CFG Weight Tuning
- Heuristic:Huggingface Peft LoRA Initialization Strategy Selection
- Heuristic:Trailofbits Fickling Allowlist Maintenance
- Heuristic:Unstructured IO Unstructured Hi Res Model Configuration
- Heuristic:FlowiseAI Flowise Tool Ordering Convention
- Heuristic:OpenRLHF OpenRLHF Off Policy IS Correction Tip
- Heuristic:Lm sys FastChat Flash Attention GPU Requirements
- Heuristic:DataTalksClub Data engineering zoomcamp GCS Upload Timeout Workaround
- Heuristic:Bentoml BentoML Warning Deprecated Server Module
Environments
- Environment:Onnx Onnx Python Runtime Environment
- Environment:Fastai Fastbook CUDA GPU Environment
- Environment:Google deepmind Mujoco MJX Warp CUDA Environment
- Environment:Google research Deduplicate text datasets Rust Cargo Build Environment
- Environment:Helicone Helicone Node 20 TypeScript Runtime
- Environment:Allenai Open instruct Docker Container
- Environment:Pytorch Serve DeepSpeed Environment
- Environment:Huggingface Trl vLLM Generation Environment
- Environment:Intel Ipex llm Build Environment
- Environment:Mit han lab Llm awq CUDA Build Environment