Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Mistralai Client python Streaming Chat Completion
- Workflow:DataExpert io Data engineer handbook Flink Kafka Streaming Pipeline
- Workflow:Trailofbits Fickling PyTorch Format Identification
- Workflow:Kubeflow Pipelines Pipeline Authoring and Compilation
- Workflow:Rapidsai Cuml GPU Clustering
- Workflow:Apache Druid Batch Data Ingestion
- Workflow:Huggingface Trl Reward Model Training
- Workflow:Facebookresearch Habitat lab HITL Interactive Evaluation
- Workflow:Apache Flink File Sink Pipeline
- Workflow:DistrictDataLabs Yellowbrick Classification Model Evaluation
Principles
- Principle:Google deepmind Mujoco Passive Forces
- Principle:Evidentlyai Evidently Dataset Score Extraction
- Principle:Google deepmind Dm control Props and Targets
- Principle:Deepseek ai Janus CFG Input Preparation for Flow
- Principle:Sgl project Sglang Streaming Response Handling
- Principle:ARISE Initiative Robosuite Omniverse Rendering
- Principle:Apache Shardingsphere SPI Refresher Loading
- Principle:Microsoft DeepSpeedExamples Multi Dataset VQA Preparation
- Principle:Speechbrain Speechbrain Audio Interpretability
- Principle:Huggingface Transformers Training Execution
Implementations
- Implementation:LaurentMazare Tch rs Layer Norm
- Implementation:DataTalksClub Data engineering zoomcamp Kestra PostgreSQL CopyIn
- Implementation:Online ml River FeatureSelection VarianceThreshold
- Implementation:FlagOpen FlagEmbedding BGE Coder CorpusGenerator
- Implementation:Apache Airflow Timezone Utilities
- Implementation:Lance format Lance Java StorageOptionsProvider
- Implementation:Hpcaitech ColossalAI Train ORPO Script
- Implementation:Google deepmind Mujoco mj name2id
- Implementation:Hiyouga LLaMA Factory WebUI Common
- Implementation:Bentoml BentoML Cloud ModelAPI
Heuristics
- Heuristic:AUTOMATIC1111 Stable diffusion webui VRAM Management Strategies
- Heuristic:SeldonIO Seldon core Tracing Latency Tip
- Heuristic:Openclaw Openclaw Warning Suppression For Known Deprecations
- Heuristic:Promptfoo Promptfoo Retry With Jitter
- Heuristic:Axolotl ai cloud Axolotl FSDP Configuration Guide
- Heuristic:Haosulab ManiSkill Initial Pose Performance
- Heuristic:Bitsandbytes foundation Bitsandbytes Outlier Threshold Detection
- Heuristic:Microsoft BIPIA LLAMA Pad Token Workaround
- Heuristic:NVIDIA DALI NVJPEG Memory Preallocation
- Heuristic:Langfuse Langfuse LLM Rate Limit 24h Abandon
Environments
- Environment:Alibaba MNN HuggingFace Ecosystem Environment
- Environment:Huggingface Open r1 CUDA Environment
- Environment:Zai org CogVideo Video Captioning Environment
- Environment:DistrictDataLabs Yellowbrick Python Scikit Learn Environment
- Environment:LLMBook zh LLMBook zh github io Data Processing Environment
- Environment:Microsoft Agent framework Core Package Dependencies
- Environment:Bigscience workshop Petals CUDA Server
- Environment:Langgenius Dify Docker Compose Environment
- Environment:Unstructured IO Unstructured Profiling Tools
- Environment:Shiyu coder Kronos Qlib Data Environment