Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Treeverse LakeFS Garbage Collection
- Workflow:Langgenius Dify RAG Pipeline Development
- Workflow:Farama Foundation Gymnasium RL Agent Training Loop
- Workflow:Teamcapybara Capybara Custom Selector Definition
- Workflow:Elevenlabs Elevenlabs python Voice Cloning
- Workflow:Snorkel team Snorkel Weak Supervision Pipeline
- Workflow:Langfuse Langfuse Trace ingestion pipeline
- Workflow:Apache Paimon Distributed Processing With Ray
- Workflow:CarperAI Trlx RLHF Dialogue Alignment
- Workflow:Apache Paimon Blob Storage With Descriptors
Principles
- Principle:Interpretml Interpret Optimal Transport Selection
- Principle:Webdriverio Webdriverio TestFrameworkIntegration
- Principle:Ggml org Ggml OpenCL GPU Computation
- Principle:Mit han lab Llm awq Interactive Multimodal Demo
- Principle:Ollama Ollama CLI Format Utility
- Principle:Lance format Lance Compaction Planning
- Principle:PacktPublishing LLM Engineers Handbook Quantized Model Loading
- Principle:Deepspeedai DeepSpeed Evoformer Attention Kernels
- Principle:Predibase Lorax Inference Result Evaluation
- Principle:InternLM Lmdeploy Response Processing
Implementations
- Implementation:Huggingface Transformers Load Adapter
- Implementation:Kserve Kserve TrainedModel Full CRD
- Implementation:Cypress io Cypress GetInstallMessage
- Implementation:Webdriverio Webdriverio BrowserStack Util
- Implementation:Kornia Kornia Load Image
- Implementation:Openai CLIP Dataset Preparation Wrapper
- Implementation:Google deepmind Dm control Fruitfly V2
- Implementation:Apache Paimon FileIO
- Implementation:ClickHouse ClickHouse TRAP Macro
- Implementation:Confident ai Deepeval Error Hierarchy
Heuristics
- Heuristic:Openai Openai python Retry Backoff Strategy
- Heuristic:Nightwatchjs Nightwatch Safari Parallel Limitation
- Heuristic:Elevenlabs Elevenlabs python Audio Buffer Sizes
- Heuristic:Microsoft Semantic kernel Experimental Feature Opt In
- Heuristic:Apache Beam Watermark Update Throttling
- Heuristic:Speechbrain Speechbrain Score Normalization Tips
- Heuristic:Pola rs Polars Streaming For Large Datasets
- Heuristic:Fede1024 Rust rdkafka Queue Buffering Priority
- Heuristic:Microsoft Playwright Browser Specific Workarounds
- Heuristic:LMCache LMCache Health Monitor Thresholds
Environments
- Environment:Guardrails ai Guardrails Python 3 10 Runtime
- Environment:Fastai Fastbook NLP SpaCy Environment
- Environment:Run llama Llama index Sentence Transformers Finetuning
- Environment:Kubeflow Pipelines KFP Backend Deployment
- Environment:Astronomer Astronomer cosmos Kubernetes Provider
- Environment:Huggingface Datasets PyTorch Integration
- Environment:Interpretml Interpret Blackbox Explainer Dependencies
- Environment:Anthropics Anthropic sdk python Python SDK Core Environment
- Environment:Datahub project Datahub Java 17 Backend Environment
- Environment:Langfuse Langfuse S3 Compatible Storage