Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Spotify Luigi Hadoop MapReduce Pipeline
- Workflow:Iterative Dvc Data Tracking
- Workflow:Langgenius Dify Application Creation
- Workflow:Mit han lab Llm awq TinyChat LLM Deployment
- Workflow:Treeverse LakeFS Garbage Collection
- Workflow:Huggingface Datatrove FineWeb Dataset Creation
- Workflow:Dagster io Dagster Modal Serverless Pipeline
- Workflow:FlowiseAI Flowise Evaluation Pipeline
- Workflow:Intel Ipex llm LoRA Finetuning
- Workflow:Tencent Ncnn PyTorch Model Conversion and Inference
Principles
- Principle:CarperAI Trlx Model Checkpointing
- Principle:FlagOpen FlagEmbedding Distributed Reranker Training
- Principle:Huggingface Optimum Pipeline Inference Execution
- Principle:Spotify Luigi Dependency Analysis
- Principle:Duckdb Duckdb Zstandard Compression
- Principle:Arize ai Phoenix Client Initialization
- Principle:Apache Airflow Post Release Activities
- Principle:Huggingface Transformers Distributed Checkpointing
- Principle:Dotnet Machinelearning Binary Model Evaluation
- Principle:NVIDIA DALI Anchor Box Encoding
Implementations
- Implementation:Sgl project Sglang CPU BMM
- Implementation:Webdriverio Webdriverio Remote Function
- Implementation:Pyro ppl Pyro Distributions Init
- Implementation:Online ml River Datasets Insects
- Implementation:Mage ai Mage ai Chargebee Common Credit Notes Schema
- Implementation:Apache Paimon BlobFormatWriter Write
- Implementation:CrewAIInc CrewAI Apify Actors Tool
- Implementation:Mlc ai Mlc llm Conversation Protocol
- Implementation:Farama Foundation Gymnasium Graph Space
- Implementation:Ray project Ray PlacementGroupCreationOptions
Heuristics
- Heuristic:Sgl project Sglang Schedule Conservativeness Tuning
- Heuristic:MarketSquare Robotframework browser MacOS Sonoma Startup Delay
- Heuristic:AnswerDotAI RAGatouille Searcher Configuration By Collection Size
- Heuristic:Tencent Ncnn Vulkan Pipeline Warmup
- Heuristic:Microsoft Semantic kernel Experimental Feature Opt In
- Heuristic:Mbzuai oryx Awesome LLM Post training API Rate Limit Retry Strategy
- Heuristic:Mlc ai Web llm KV Cache Window Configuration
- Heuristic:CrewAIInc CrewAI RAG Search Defaults
- Heuristic:NVIDIA NeMo Curator Semantic Dedup Cluster Sizing
- Heuristic:Kubeflow Pipelines Resource Sizing For Components
Environments
- Environment:Triton inference server Server Docker Container Build
- Environment:Alibaba ROLL Megatron Training Environment
- Environment:Lucidrains X transformers PyTorch CUDA
- Environment:Huggingface Datasets Audio Video Dependencies
- Environment:Lance format Lance SIMD And Platform Requirements
- Environment:Ray project Ray Docker GPU Environment
- Environment:Isaac sim IsaacGymEnvs IsaacGym Preview 4
- Environment:Openai Whisper Numba
- Environment:Deepspeedai DeepSpeed XPU Environment
- Environment:Mistralai Client python Realtime Transcription Environment