Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:BerriAI Litellm SDK Completion
- Workflow:Bentoml BentoML Multi Model Composition
- Workflow:Huggingface Optimum GPTQ Quantization
- Workflow:Wandb Weave Tracing Setup
- Workflow:Apache Dolphinscheduler RPC Service Communication
- Workflow:LLMBook zh LLMBook zh github io Inference and Quantization
- Workflow:Risingwavelabs Risingwave Sink Connector Pipeline
- Workflow:Hpcaitech ColossalAI LLaMA Continual Pretraining
- Workflow:Groq Groq python Streaming Chat Completion
- Workflow:Dotnet Machinelearning Binary Classification Pipeline
Principles
- Principle:Romsto Speculative Decoding Input Tokenization
- Principle:EvolvingLMMs Lab Lmms eval Data Sharding
- Principle:Microsoft Agent framework Sample Validation
- Principle:DistrictDataLabs Yellowbrick Precision Recall Analysis
- Principle:Hpcaitech ColossalAI RAG Deployment
- Principle:Nautechsystems Nautilus trader Risk Management
- Principle:Promptfoo Promptfoo Provider Resolution
- Principle:Dagster io Dagster Sensor Driven Pipelines
- Principle:Huggingface Open r1 High Concurrency Inference
- Principle:ContextualAI HALOs Environment Setup
Implementations
- Implementation:Kubeflow Pipelines Dsl Pipeline Decorator
- Implementation:FlagOpen FlagEmbedding LLM Embedder Eval ICL
- Implementation:LMCache LMCache Base Cache Policy
- Implementation:Apache Paimon MathUtils
- Implementation:Open compass VLMEvalKit VGRPBench Futoshiki
- Implementation:Evidentlyai Evidently Legacy Text Length Feature
- Implementation:Tencent Ncnn YOLOv3 Example
- Implementation:Datahub project Datahub Entity Metadata Mutations
- Implementation:Ucbepic Docetl UseRestorePipeline
- Implementation:Pyro ppl Pyro GuideMessenger
Heuristics
- Heuristic:CrewAIInc CrewAI MCP Timeout And Retry Strategy
- Heuristic:SeldonIO Seldon core Tracing Latency Tip
- Heuristic:Eric mitchell Direct preference optimization Disable Sampling During Eval
- Heuristic:Dotnet Machinelearning Sparsity Threshold Optimization
- Heuristic:Apache Kafka JVM GC Tuning Defaults
- Heuristic:Openai Openai agents python GPT 5 Reasoning Settings
- Heuristic:Unslothai Unsloth VLLM Memory Utilization
- Heuristic:Apache Spark Warning Deprecated DStream Streaming
- Heuristic:Apache Druid Explore Compare Query Strategy
- Heuristic:Mlc ai Mlc llm OpenCL Memory Floor Workaround
Environments
- Environment:Apache Kafka Docker Build Environment
- Environment:AUTOMATIC1111 Stable diffusion webui Python And PyTorch Runtime
- Environment:Fastai Fastbook Sklearn Environment
- Environment:ArroyoSystems Arroyo Python UDF Runtime
- Environment:Huggingface Diffusers PyTorch CUDA Runtime
- Environment:Apache Dolphinscheduler Node Pnpm Runtime
- Environment:Vibrantlabsai Ragas Python 3 9 Core Environment
- Environment:ClickHouse ClickHouse CI Docker Environment
- Environment:Unstructured IO Unstructured GitHub Actions
- Environment:Volcengine Verl Ray Distributed Environment