Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Openclaw Openclaw Agent Message Loop
- Workflow:Bentoml BentoML Multi Model Composition
- Workflow:Hiyouga LLaMA Factory DPO Preference Alignment
- Workflow:Ucbepic Docetl Long Document Chunking
- Workflow:Open compass VLMEvalKit Video Benchmark Evaluation
- Workflow:Sktime Pytorch forecasting TFT Demand Forecasting
- Workflow:Predibase Lorax Multi Adapter Merging
- Workflow:Sgl project Sglang Frontend Language Multi Turn Chat
- Workflow:PacktPublishing LLM Engineers Handbook LLM Finetuning
- Workflow:AnswerDotAI RAGatouille ColBERT Training
Principles
- Principle:ThreeSR Awesome Inference Time Scaling Paper Detail Retrieval
- Principle:Scikit learn Scikit learn Parameter Space Definition
- Principle:Iterative Dvc Dependency Graph Validation
- Principle:Apache Airflow Security Review Planning
- Principle:Anthropics Anthropic sdk python Response Processing
- Principle:Tensorflow Tfjs Model Compilation
- Principle:OWASP Www project top 10 for large language model applications Automated Vulnerability Scanning
- Principle:Ray project Ray Remote Function Definition
- Principle:Risingwavelabs Risingwave Connector Observability
- Principle:Fastai Fastbook DataLoaders Creation
Implementations
- Implementation:Microsoft Autogen Studio Run View
- Implementation:Haosulab ManiSkill XArm7Ability
- Implementation:Openai Openai node Conversations Resource
- Implementation:Nightwatchjs Nightwatch Chrome Options Type Definitions
- Implementation:Open compass VLMEvalKit OlmOCRBench Evaluator
- Implementation:Groq Groq python JSONL Construction
- Implementation:Langgenius Dify Env Template Copy
- Implementation:Openclaw Openclaw LoadWorkspaceBootstrapFiles
- Implementation:Infiniflow Ragflow File Util
- Implementation:Treeverse LakeFS Java SDK InternalApi
Heuristics
- Heuristic:Cohere ai Cohere python Tokenizer Cache With TTL
- Heuristic:Apache Kafka JVM GC Tuning Defaults
- Heuristic:Testtimescaling Testtimescaling github io Hardcoded IDs vs Registry
- Heuristic:Huggingface Trl QLoRA BF16 Adapter Casting
- Heuristic:Junyanz Pytorch CycleGAN and pix2pix Test Train Option Consistency
- Heuristic:Unstructured IO Unstructured Hi Res Model Configuration
- Heuristic:Protectai Modelscan Unknown Opcodes Assume Critical
- Heuristic:Datahub project Datahub Git Worktree Gradle Fix
- Heuristic:Apache Druid Query Error Suggestion Patterns
- Heuristic:Unstructured IO Unstructured Multi Python Matrix
Environments
- Environment:Lm sys FastChat Python Core Dependencies
- Environment:HKUDS AI Trader Browser Runtime
- Environment:Sgl project Sglang CUDA Runtime
- Environment:Ggml org Ggml Vulkan GPU Environment
- Environment:SeleniumHQ Selenium Selenium Manager Runtime
- Environment:DistrictDataLabs Yellowbrick Optional NLP Dependencies
- Environment:Lance format Lance Python Environment
- Environment:Explodinggradients Ragas Google Drive Backend Environment
- Environment:DataExpert io Data engineer handbook Python Development Environment
- Environment:Microsoft Onnxruntime CUDA GPU Environment