Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Deepspeedai DeepSpeed ZeRO Distributed Training
- Workflow:Isaac sim IsaacGymEnvs Policy Inference and Evaluation
- Workflow:Open compass VLMEvalKit Adding Custom VLM
- Workflow:ChenghaoMou Text dedup Benchmark Evaluation
- Workflow:Confident ai Deepeval Synthetic Dataset Generation
- Workflow:Alibaba ROLL Agentic RL Training Pipeline
- Workflow:Explodinggradients Ragas Metric Prompt Optimization
- Workflow:Microsoft Agent framework Multi Agent Concurrent Orchestration
- Workflow:Spcl Graph of thoughts Custom GoT Use Case Integration
- Workflow:TobikoData Sqlmesh Environment management
Principles
- Principle:Huggingface Diffusers Training Dataset Preparation
- Principle:Ucbepic Docetl LLM Powered Text Extraction
- Principle:Microsoft Agent framework Custom Aggregation Pattern
- Principle:Scikit learn Scikit learn Search Execution
- Principle:Deepspeedai DeepSpeed Pipeline Evaluation
- Principle:Tensorflow Serving Test Model Export
- Principle:Snorkel team Snorkel Label Quality Evaluation
- Principle:Arize ai Phoenix Evaluation Data Preparation
- Principle:Isaac sim IsaacGymEnvs Checkpoint Export and Logging
- Principle:Togethercomputer Together python Audio Translation
Implementations
- Implementation:Pyro ppl Pyro SV DKL
- Implementation:Microsoft Onnxruntime CPU GatherNDGrad
- Implementation:Apache Druid ServicePropertiesTable
- Implementation:Neuml Txtai Similarity Pipeline
- Implementation:NVIDIA TransformerEngine JAX Cpp Quantization
- Implementation:Fede1024 Rust rdkafka AdminClient
- Implementation:FlagOpen FlagEmbedding LLM Embedder Eval QReCC
- Implementation:Datahub project Datahub Sidebars Config
- Implementation:Openclaw Openclaw Platform Credential Helpers
- Implementation:Online ml River Compose Select
Heuristics
- Heuristic:Google research Deduplicate text datasets Variable Width Pointer Optimization
- Heuristic:Princeton nlp SimPO Left Truncation Strategy
- Heuristic:FMInference FlexLLMGen OOM Memory Management
- Heuristic:Isaac sim IsaacGymEnvs JIT Profiling Optimization
- Heuristic:Google research Deduplicate text datasets Ulimit File Descriptors For Merge
- Heuristic:Facebookresearch Habitat lab Mini Batch Environment Divisibility
- Heuristic:Avhz RustQuant Learning Rate Tuning
- Heuristic:Scikit learn contrib Imbalanced learn Sampling Before Split Leakage
- Heuristic:ArroyoSystems Arroyo Parallelism Configuration
- Heuristic:Speechbrain Speechbrain Score Normalization Tips
Environments
- Environment:Astronomer Astronomer cosmos Kubernetes Provider
- Environment:OpenGVLab InternVL DeepSpeed
- Environment:Haosulab ManiSkill Python SAPIEN Core
- Environment:ClickHouse ClickHouse Python3 Test Environment
- Environment:PacktPublishing LLM Engineers Handbook Python 3 11 Poetry Environment
- Environment:Sktime Pytorch forecasting Cpflows MQF2 Dependencies
- Environment:Allenai Open instruct Docker Container
- Environment:Langchain ai Langgraph Python Runtime Environment
- Environment:Mistralai Client python Realtime Transcription Environment
- Environment:Mage ai Mage ai Singer SDK And Joblib Runtime