Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Mistralai Client python Text Embeddings
- Workflow:Duckdb Duckdb Benchmark Execution
- Workflow:Datahub project Datahub CLI Metadata Ingestion
- Workflow:FMInference FlexLLMGen Data Wrangling Batch Inference
- Workflow:Helicone Helicone Cost Calculation Pipeline
- Workflow:Sdv dev SDV Single table synthesis
- Workflow:Ucbepic Docetl YAML Pipeline Execution
- Workflow:Vespa engine Vespa Logging framework initialization
- Workflow:Rapidsai Cuml Random Forest Training And Inference
- Workflow:Neuml Txtai Agent Execution
Principles
- Principle:Openclaw Openclaw Deployment Health Verification
- Principle:Helicone Helicone Provider Key Management
- Principle:SeleniumHQ Selenium Test Execution Strategy
- Principle:Pola rs Polars Data Output Validation
- Principle:Openai Openai agents python Approval Processing
- Principle:Deepspeedai DeepSpeed Op Builder System
- Principle:Mlflow Mlflow Local Model Serving
- Principle:Zai org CogVideo Temporal Autoencoding
- Principle:Apache Paimon Lance Table Configuration
- Principle:Romsto Speculative Decoding Rejection Sampling Adjustment
Implementations
- Implementation:Microsoft Onnxruntime CUDA BatchNormGrad
- Implementation:DataExpert io Data engineer handbook Statsig Initialize
- Implementation:Infiniflow Ragflow Common Constants
- Implementation:SeleniumHQ Selenium Bazel Build And Go Wrapper
- Implementation:Triton inference server Server GenQaSequenceModels
- Implementation:Puppeteer Puppeteer ESLint Config
- Implementation:Sktime Pytorch forecasting NaNLabelEncoder
- Implementation:Google deepmind Mujoco Render GL2
- Implementation:Mlc ai Mlc llm Top P Pivot
- Implementation:Huggingface Diffusers ControlNetModel Forward
Heuristics
- Heuristic:Scikit learn Scikit learn Data Leakage Prevention
- Heuristic:EvolvingLMMs Lab Lmms eval Limit Flag Testing Only
- Heuristic:OpenBMB UltraFeedback GPU Memory Utilization
- Heuristic:NVIDIA NeMo Aligner Higher Stability Log Probs
- Heuristic:Unstructured IO Unstructured Chunk Size Tuning
- Heuristic:InternLM Lmdeploy KV Quantization Tradeoffs
- Heuristic:Treeverse LakeFS Batch Delay Tuning
- Heuristic:ARISE Initiative Robosuite XML Reset Method Tradeoff
- Heuristic:LaurentMazare Tch rs Device Fallback Pattern
- Heuristic:Huggingface Diffusers Guidance Scale Defaults
Environments
- Environment:Open compass VLMEvalKit GPU CUDA Environment
- Environment:Snorkel team Snorkel SpaCy NLP
- Environment:Junyanz Pytorch CycleGAN and pix2pix DDP Multi GPU
- Environment:Roboflow Rf detr ONNX Export Environment
- Environment:Pytorch Serve vLLM Engine Environment
- Environment:Google deepmind Dm control OSMesa Software Rendering
- Environment:Kubeflow Kubeflow Kubectl Kustomize CLI Environment
- Environment:Alibaba ROLL Python Runtime Environment
- Environment:Intel Ipex llm Build Environment
- Environment:Microsoft Autogen Studio Server Environment