Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Hudi Flink MOR Compaction
- Workflow:Eventual Inc Daft Multimodal AI Batch Inference
- Workflow:Huggingface Diffusers Model Quantization
- Workflow:Marker Inc Korea AutoRAG Evaluation Data Creation
- Workflow:Volcengine Verl PPO Training With Reward Model
- Workflow:Huggingface Diffusers Text to Image Inference
- Workflow:DataExpert io Data engineer handbook PySpark Job Testing
- Workflow:Kubeflow Pipelines Pipeline Control Flow
- Workflow:Elevenlabs Elevenlabs python Realtime TTS Streaming
- Workflow:Princeton nlp SimPO On Policy Data Generation
Principles
- Principle:Openai Openai node Response Stream Processing
- Principle:Datajuicer Data juicer Custom Operator Configuration
- Principle:Datahub project Datahub Spark Lineage Configuration
- Principle:Webdriverio Webdriverio UtilityPattern
- Principle:Lucidrains X transformers Entropy Based Segmentation
- Principle:Kubeflow Pipelines Iterative Training Termination
- Principle:Ggml org Ggml Vision Encoder Execution
- Principle:LaurentMazare Tch rs PyO3 Tensor Bridge
- Principle:Pyro ppl Pyro Bayesian Module Integration
- Principle:Langfuse Langfuse Data Streaming from ClickHouse
Implementations
- Implementation:Mlc ai Web llm Grammar Matcher Decoding
- Implementation:Tencent Ncnn ModelWriter
- Implementation:Microsoft Onnxruntime CPU GatherElementsGrad
- Implementation:Risingwavelabs Risingwave Graph Algorithms
- Implementation:SeleniumHQ Selenium Closure SafeUrl
- Implementation:FlagOpen FlagEmbedding EvalReranker Call
- Implementation:TobikoData Sqlmesh Metadata Component
- Implementation:MaterializeInc Materialize Deploy Promote Pattern
- Implementation:Open compass VLMEvalKit MMIF Function And Compare
- Implementation:FlowiseAI Flowise Pnpm Lock
Heuristics
- Heuristic:Lance format Lance Warning Deprecated Java APIs
- Heuristic:LaurentMazare Tch rs MPS Weight Loading Workaround
- Heuristic:Groq Groq python Timeout Configuration
- Heuristic:Recommenders team Recommenders SAR Cold Start Items
- Heuristic:Ggml org Ggml Gradient Accumulation Batch Sizing
- Heuristic:Diagram of thought Diagram of thought Strict Vs Flexible Critic Rigor
- Heuristic:Vibrantlabsai Ragas Warning Deprecated V1 Metrics
- Heuristic:Huggingface Datatrove Gopher Quality Thresholds
- Heuristic:Huggingface Datasets Parquet Shard Sizing
- Heuristic:Apache Spark Build Fallback Strategies
Environments
- Environment:OpenGVLab InternVL PyTorch CUDA
- Environment:Apache Kafka Gradle Build Environment
- Environment:Langchain ai Langgraph Python Runtime Environment
- Environment:Speechbrain Speechbrain Multi GPU DDP
- Environment:Promptfoo Promptfoo Python Runtime
- Environment:Microsoft DeepSpeedExamples SuperOffload Runtime
- Environment:Datahub project Datahub Docker Runtime
- Environment:Alibaba MNN GPU OpenCL Environment
- Environment:Teamcapybara Capybara Ruby And Gem Dependencies
- Environment:Danijar Dreamerv3 JAX CUDA