Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Openai Evals Running a single eval
- Workflow:Fede1024 Rust rdkafka Mock Cluster Testing
- Workflow:Open compass VLMEvalKit Adding Custom VLM
- Workflow:MaterializeInc Materialize Upgrade Testing
- Workflow:Apache Shardingsphere Dynamic Rule Configuration Change
- Workflow:Openai Openai agents python Tool Integrated Agent
- Workflow:Eventual Inc Daft Multimodal AI Batch Inference
- Workflow:Open compass VLMEvalKit Video Benchmark Evaluation
- Workflow:Tencent Ncnn PyTorch Model Conversion and Inference
- Workflow:Huggingface Open r1 GRPO Reasoning Training
Principles
- Principle:Zai org CogVideo 3D Video Encoding
- Principle:OpenHands OpenHands Payload Parsing
- Principle:Shiyu coder Kronos Tokenizer Encoding
- Principle:Sgl project Sglang Visual Input Preparation
- Principle:Risingwavelabs Risingwave CDC Pipeline Coordination
- Principle:AUTOMATIC1111 Stable diffusion webui Canvas UI Interaction
- Principle:Sktime Pytorch forecasting Series Decomposition
- Principle:Huggingface Transformers Device Mesh Topology
- Principle:Huggingface Datasets Struct Flattening
- Principle:Huggingface Datasets Arrow File Reading
Implementations
- Implementation:ArroyoSystems Arroyo Planner Extensions
- Implementation:LMCache LMCache XPU Connector
- Implementation:Apache Druid NamedExpressionsInput
- Implementation:Guardrails ai Guardrails Rail Schema
- Implementation:Lance format Lance Dataset Checkout Version
- Implementation:SeleniumHQ Selenium ExecutableFinder
- Implementation:TobikoData Sqlmesh Pyproject Toml
- Implementation:NVIDIA NeMo Aligner Custom Checkpoint Callback
- Implementation:Hiyouga LLaMA Factory WebUI Export Component
- Implementation:FlowiseAI Flowise CompStyleOverride
Heuristics
- Heuristic:Scikit learn contrib Imbalanced learn Sampling Before Split Leakage
- Heuristic:Tensorflow Serving Batching Thread Tuning
- Heuristic:ArroyoSystems Arroyo Stateful Operator TTL
- Heuristic:FlagOpen FlagEmbedding Length Sorted Batching
- Heuristic:Allenai Open instruct Pre Init Torch Distributed
- Heuristic:FMInference FlexLLMGen OOM Memory Management
- Heuristic:Haosulab ManiSkill Num Envs Backend Selection
- Heuristic:Astronomer Astronomer cosmos Deprecation Migration Paths
- Heuristic:Facebookresearch Audiocraft FSDP Distributed Training Tips
- Heuristic:Avdvg InjectGuard Dataset Coverage Recall Bound
Environments
- Environment:Microsoft DeepSpeedExamples RLHF Training Environment
- Environment:Apache Paimon Cloud Storage Credentials
- Environment:Evidentlyai Evidently Python Core Environment
- Environment:Deepset ai Haystack OpenAI API Environment
- Environment:Openai Openai node OpenAI API Credentials
- Environment:Langgenius Dify Docker Compose Environment
- Environment:Deepspeedai DeepSpeed NVMe Environment
- Environment:Arize ai Phoenix Phoenix Server Runtime
- Environment:Langchain ai Langchain LangSmith Tracing Config
- Environment:Mbzuai oryx Awesome LLM Post training Python Matplotlib