Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent with the Leeroopedia MCP setup guide. Let it search docs, build plans, verify code, and diagnose failures on your behalf.
Go end-to-end. Leeroopedia gives your agent the knowledge. Kapso gives it the ability to act on it: research, experiment, and deploy.
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Marker Inc Korea AutoRAG Evaluation Data Creation
- Workflow:Deepseek ai Janus Multimodal Understanding
- Workflow:NVIDIA TransformerEngine FP8 Training Quickstart
- Workflow:Openai Openai python Chat Completion
- Workflow:Microsoft Agent framework Multi Agent Concurrent Orchestration
- Workflow:Neuml Txtai Agent Execution
- Workflow:Arize ai Phoenix Prompt Management Pipeline
- Workflow:Facebookresearch Habitat lab PointNav PPO Training
- Workflow:ContextualAI HALOs Online Iterative Alignment
- Workflow:Microsoft Onnxruntime Distributed Model Training
Principles
- Principle:Ggml org Llama cpp GrammarConstrained
- Principle:Elevenlabs Elevenlabs python Realtime Text to Speech
- Principle:AUTOMATIC1111 Stable diffusion webui Prompt composition
- Principle:Sdv dev SDV Inequality Constraint
- Principle:Lucidrains X transformers Direct Preference Optimization
- Principle:Ggml org Llama cpp BatchProcessing
- Principle:Ucbepic Docetl LLM Powered Text Extraction
- Principle:AUTOMATIC1111 Stable diffusion webui UI Visual Feedback
- Principle:Tensorflow Serving ML Metadata Integration
- Principle:NVIDIA NeMo Curator Video Embedding
Implementations
- Implementation:Mlc ai Mlc llm JSONFFIEngine Java
- Implementation:Bentoml BentoML SDK Validators
- Implementation:Apache Paimon RenamingSnapshotCommit
- Implementation:DistrictDataLabs Yellowbrick TargetType Utilities
- Implementation:FlowiseAI Flowise RolesView
- Implementation:Ggml org Llama cpp LLGuidance
- Implementation:MarketSquare Robotframework browser Evaluation Grpc Handlers
- Implementation:DataTalksClub Data engineering zoomcamp Java JsonConsumer
- Implementation:PeterL1n BackgroundMattingV2 Displayer
- Implementation:Openai Openai node FineTuning Methods
Heuristics
- Heuristic:Sktime Pytorch forecasting Batch Size Selection
- Heuristic:Scikit learn Scikit learn Random State Management
- Heuristic:Nautechsystems Nautilus trader Order Rate Limiting Configuration
- Heuristic:Ggml org Llama cpp Warning Deprecated Legacy Converters
- Heuristic:Microsoft DeepSpeedExamples LoRA Learning Rate Scaling
- Heuristic:Farama Foundation Gymnasium Shared Memory Vector Env Optimization
- Heuristic:Norrrrrrr lyn WAInjectBench NaN Inf Fallback FP32 Recovery
- Heuristic:Risingwavelabs Risingwave Source Backoff Strategy
- Heuristic:Deepset ai Haystack Document Splitting Defaults
- Heuristic:Protectai Modelscan Nested Zip Not Supported
Environments
- Environment:Astronomer Astronomer cosmos Kubernetes Provider
- Environment:Wandb Weave Python SDK Runtime
- Environment:DataTalksClub Data engineering zoomcamp Dlt BigQuery Environment
- Environment:Kubeflow Pipelines KFP Backend Deployment
- Environment:Liu00222 Open Prompt Injection Python Dependencies
- Environment:Kserve Kserve Kubernetes Cluster
- Environment:Hiyouga LLaMA Factory Distributed Training Environment
- Environment:Alibaba MNN GPU CUDA Environment
- Environment:OpenGVLab InternVL PEFT LoRA
- Environment:Langgenius Dify Vector Database Environment