Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Confident ai Deepeval Synthetic Dataset Generation
- Workflow:Openclaw Openclaw Channel Connection
- Workflow:Haosulab ManiSkill Custom Task Development
- Workflow:Unslothai Unsloth QLoRA SFT Finetuning
- Workflow:Microsoft Onnxruntime On Device Training
- Workflow:CarperAI Trlx ILQL Offline Training
- Workflow:NVIDIA NeMo Curator Text Curation Pipeline
- Workflow:Risingwavelabs Risingwave Docker Deployment
- Workflow:Romsto Speculative Decoding Interactive CLI Comparison
- Workflow:Microsoft DeepSpeedExamples VisualChat Multimodal Training
Principles
- Principle:Infiniflow Ragflow Environment Configuration
- Principle:Microsoft Agent framework Magentic Orchestration Pattern
- Principle:Norrrrrrr lyn WAInjectBench JSONL Results Serialization
- Principle:Scikit learn contrib Imbalanced learn Balanced Bagging
- Principle:Online ml River Streaming Adjusted Rand
- Principle:Tencent Ncnn Top K Classification
- Principle:SqueezeAILab ETS Evaluation Reporting
- Principle:HKUDS AI Trader Portfolio Valuation
- Principle:Treeverse LakeFS Hook Result Review
- Principle:Langchain ai Langgraph Interrupt Definition
Implementations
- Implementation:MaterializeInc Materialize Composition Sql Testdrive
- Implementation:Google deepmind Mujoco MJWarp Constraint
- Implementation:Deepseek ai Janus Conversation Template
- Implementation:ClickHouse ClickHouse Install Script
- Implementation:Apache Paimon MemorySliceInput
- Implementation:Scikit learn Scikit learn BaseForest Fit
- Implementation:Datahub project Datahub ProtobufExtensionFieldVisitor
- Implementation:ClickHouse ClickHouse Review Severity Model
- Implementation:Infiniflow Ragflow DialogService Update Retrieval Settings
- Implementation:LMCache LMCache Bloom Filter
Heuristics
- Heuristic:PeterL1n BackgroundMattingV2 Data Augmentation Strategy
- Heuristic:MaterializeInc Materialize CI Retry Strategies
- Heuristic:Recommenders team Recommenders TensorFlow Session Ordering
- Heuristic:Speechbrain Speechbrain Data Augmentation Defaults
- Heuristic:Astronomer Astronomer cosmos Dbt Invocation Mode Selection
- Heuristic:Lm sys FastChat Tokenizer Offset Correction
- Heuristic:Apache Flink Async Sink Timeout And Backpressure Defaults
- Heuristic:Zai org CogVideo Memory Optimization Strategies
- Heuristic:Haotian liu LLaVA Use Cache Training Inference Toggle
- Heuristic:Openai CLIP CLIP Normalization Constants
Environments
- Environment:Datahub project Datahub Python Ingestion
- Environment:Rapidsai Cuml Python RAPIDS Stack
- Environment:Huggingface Trl Quantization Environment
- Environment:Nautechsystems Nautilus trader Binance API Credentials
- Environment:LLMBook zh LLMBook zh github io Data Processing Environment
- Environment:Apache Kafka Gradle Build Environment
- Environment:Rapidsai Cuml Dask Distributed
- Environment:Sgl project Sglang CUDA Runtime
- Environment:Huggingface Datatrove Processing Dependencies
- Environment:Sgl project Sglang Grafana