Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Iterative Dvc Pipeline Reproduction
- Workflow:Heibaiying BigData Notes Hive Data Warehouse Operations
- Workflow:Tensorflow Serving Batched Inference Pipeline
- Workflow:DataExpert io Data engineer handbook PySpark Iceberg Job Execution
- Workflow:Kornia Kornia ONNX Model Pipeline
- Workflow:Huggingface Open r1 GRPO Reasoning Training
- Workflow:CARLA simulator Carla Traffic Generation
- Workflow:DevExpress Testcafe CLI Test Execution
- Workflow:ARISE Initiative Robosuite Domain Randomization Training
- Workflow:Vllm project Vllm Vision Language Inference
Principles
- Principle:Deepspeedai DeepSpeed Reward Model Training
- Principle:Langchain ai Langgraph Execution Resumption
- Principle:Liu00222 Open Prompt Injection Conditional Probability Computation
- Principle:Snorkel team Snorkel Labeling Function Analysis
- Principle:Predibase Lorax Health Check Verification
- Principle:Helicone Helicone Payment Processing
- Principle:SeleniumHQ Selenium Page Object Interaction Methods
- Principle:Run llama Llama index Agent Execution
- Principle:Openclaw Openclaw Binding Configuration
- Principle:Volcengine Verl Answer Extraction
Implementations
- Implementation:Openai Openai node Responses Resource
- Implementation:Openai Openai python Eval Retrieve Response
- Implementation:TobikoData Sqlmesh EditorInspector
- Implementation:Predibase Lorax Exllama V1 CUDA Bindings
- Implementation:Elevenlabs Elevenlabs python DubbingMetadataResponse
- Implementation:Astronomer Astronomer cosmos Dbt Output Parser
- Implementation:Apache Druid HeaderBar
- Implementation:Openai Openai python Embedding Create Params
- Implementation:NVIDIA DALI RunImpl CUDA Kernel
- Implementation:ARISE Initiative Robosuite ManipulatorModel
Heuristics
- Heuristic:Google deepmind Dm control Rendering Backend Selection Tips
- Heuristic:Google research Deduplicate text datasets Parallel Job Scaling By Data Size
- Heuristic:Mlc ai Mlc llm Metal KV Cache Capacity Limit
- Heuristic:Norrrrrrr lyn WAInjectBench LoRA Rank Alpha Selection
- Heuristic:VainF Torch Pruning Channel Rounding Alignment
- Heuristic:Allenai Open instruct Logprob Clamping
- Heuristic:FlagOpen FlagEmbedding Dynamic Batch Size Reduction
- Heuristic:Mbzuai oryx Awesome LLM Post training API Rate Limit Retry Strategy
- Heuristic:Cohere ai Cohere python Warning Deprecated Legacy Generate API
- Heuristic:FlowiseAI Flowise Edge Connection Type Matching
Environments
- Environment:SqueezeAILab ETS Multi GPU Sglang Runtime
- Environment:LMCache LMCache VLLM Serving Engine
- Environment:Treeverse LakeFS LakeFS Server Environment
- Environment:EvolvingLMMs Lab Lmms eval GPU Compute Environment
- Environment:Shiyu coder Kronos HuggingFace Hub Access
- Environment:Langchain ai Langgraph Docker Deployment Environment
- Environment:Hpcaitech ColossalAI GRPO Distributed Environment
- Environment:Princeton nlp SimPO CUDA Training
- Environment:Unslothai Unsloth CUDA VLLM
- Environment:Google research Deduplicate text datasets Python HuggingFace Environment