Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Hudi Flink Batch Incremental Read
- Workflow:Snorkel team Snorkel Weak Supervision Pipeline
- Workflow:Predibase Lorax Server Deployment
- Workflow:Ollama Ollama Model Registry Operations
- Workflow:DataTalksClub Data engineering zoomcamp Docker PostgreSQL Data Ingestion
- Workflow:Eventual Inc Daft Data Lakehouse ETL
- Workflow:Lucidrains X transformers Non Autoregressive Masked Generation
- Workflow:Intel Ipex llm LoRA Finetuning
- Workflow:Mit han lab Llm awq TinyChat LLM Deployment
- Workflow:Huggingface Optimum Automatic Tensor Parallelization
Principles
- Principle:Lucidrains X transformers Autoregressive Wrapper Setup
- Principle:LLMBook zh LLMBook zh github io Loss Masking Tokenization
- Principle:Microsoft BIPIA Dataset Preparation
- Principle:Microsoft Agent framework Edge Condition Pattern
- Principle:Ggml org Llama cpp Apple Platform Build
- Principle:Ray project Ray Application Deployment
- Principle:Langfuse Langfuse Dataset Item Processing
- Principle:Alibaba MNN Operator Fusion Codegen
- Principle:NVIDIA NeMo Aligner Knowledge Distillation Training
- Principle:Cohere ai Cohere python Server Sent Events Decoding
Implementations
- Implementation:Kubeflow Kubeflow Profile CRD RBAC Setup
- Implementation:Arize ai Phoenix Legacy Evaluators
- Implementation:Microsoft Playwright Client Events
- Implementation:Cohere ai Cohere python ToolCallV2 Model
- Implementation:LMCache LMCache SegmentTokenDatabase Process Tokens
- Implementation:Deepspeedai DeepSpeed PipelineEngine Train Batch
- Implementation:Ollama Ollama Convert Gemma
- Implementation:Dagster io Dagster Sensor Decorator
- Implementation:Microsoft Onnxruntime CUDA MixedPrecisionScale
- Implementation:Truera Trulens Feedback Tool Selection
Heuristics
- Heuristic:NVIDIA NeMo Aligner PPO Critic Warmup Tip
- Heuristic:Langfuse Langfuse Eval Loop Prevention
- Heuristic:Microsoft Onnxruntime Memory Recomputation Optimization
- Heuristic:Mit han lab Llm awq Kernel Selection Thresholds
- Heuristic:Treeverse LakeFS S3 Multipart Size Constraint
- Heuristic:Mage ai Mage ai Parallel Sink Concurrency Limit
- Heuristic:Rapidsai Cuml Quantile Split Differences
- Heuristic:DistrictDataLabs Yellowbrick Scikit Learn API Compatibility
- Heuristic:MarketSquare Robotframework browser MacOS Sonoma Startup Delay
- Heuristic:PeterL1n BackgroundMattingV2 Mixed Precision Training
Environments
- Environment:Mistralai Client python Realtime Transcription Environment
- Environment:Tensorflow Tfjs Node Native Runtime
- Environment:Langchain ai Langchain Python 3 10 Runtime
- Environment:Vllm project Vllm AWS ECR
- Environment:BerriAI Litellm Observability Stack
- Environment:Mistralai Client python Python SDK Environment
- Environment:Open compass VLMEvalKit Python Runtime Environment
- Environment:Apache Kafka JVM Runtime Environment
- Environment:Vllm project Vllm NVIDIA CUDA
- Environment:Apache Paimon Optional Extensions