Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Lance format Lance Dataset Lifecycle
- Workflow:Volcengine Verl Multi Turn Tool Use Training
- Workflow:Cohere ai Cohere python Model Finetuning
- Workflow:Cohere ai Cohere python Text Embedding
- Workflow:Mage ai Mage ai API Source Extraction
- Workflow:Mistralai Client python Finetuning Job Management
- Workflow:Huggingface Optimum Model Export
- Workflow:Danijar Dreamerv3 Train And Evaluate
- Workflow:CARLA simulator Carla Autonomous Navigation
- Workflow:Unstructured IO Unstructured Chunking And Embedding
Principles
- Principle:NVIDIA NeMo Curator Text Deduplication
- Principle:ClickHouse ClickHouse Optimized Memcpy
- Principle:Pytorch Serve LLM Text Generation
- Principle:Groq Groq python Model Listing
- Principle:Online ml River Bandit Datasets
- Principle:Isaac sim IsaacGymEnvs Robot Controller Configuration
- Principle:MaterializeInc Materialize Dbt Profile Configuration
- Principle:Spotify Luigi Spark Configuration
- Principle:LaurentMazare Tch rs LLM Weight Conversion
- Principle:Langchain ai Langchain Embedding Model Initialization
Implementations
- Implementation:Norrrrrrr lyn WAInjectBench SentenceTransformer Encode
- Implementation:Mlc ai Mlc llm MLCEngine Mobile
- Implementation:LaurentMazare Tch rs Nn Linear
- Implementation:Risingwavelabs Risingwave BinlogHistoryRecordComparator
- Implementation:Datahub project Datahub RequiresMutable
- Implementation:MaterializeInc Materialize Query Fitness Module
- Implementation:NVIDIA NeMo Curator CaptionGenerationStage
- Implementation:Apache Dolphinscheduler WorkerGroupDispatcherCoordinator Dispatch
- Implementation:Scikit learn Scikit learn MultiOutputClassifier
- Implementation:ArroyoSystems Arroyo Mqtt Connector
Heuristics
- Heuristic:Langchain ai Langchain Error Context Preservation
- Heuristic:Alibaba MNN GPU Tuning Modes
- Heuristic:NVIDIA DALI Batch Size Tuning
- Heuristic:Microsoft Agent framework PowerFx Python Version Limit
- Heuristic:Mlc ai Mlc llm Metal KV Cache Capacity Limit
- Heuristic:Ggml org Ggml Memory Allocation Strategy
- Heuristic:Microsoft DeepSpeedExamples SuperOffload NUMA Binding
- Heuristic:Mit han lab Llm awq GPU Memory Management Patterns
- Heuristic:Sgl project Sglang Chunked Prefill OOM Prevention
- Heuristic:Apache Druid Query Error Suggestion Patterns
Environments
- Environment:Mistralai Client python Azure Deployment Environment
- Environment:Cohere ai Cohere python Python SDK Runtime
- Environment:Explodinggradients Ragas Python Runtime Environment
- Environment:Langgenius Dify Python Backend Environment
- Environment:Openai Openai python Realtime WebSocket
- Environment:Fastai Fastbook Sklearn Environment
- Environment:Datahub project Datahub Docker Quickstart Environment
- Environment:Infiniflow Ragflow Docker Infrastructure
- Environment:FMInference FlexLLMGen CUDA GPU
- Environment:Kubeflow Pipelines Kubernetes Cluster