Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Spotify Luigi Spark Processing Pipeline
- Workflow:Webdriverio Webdriverio Custom Plugin Development
- Workflow:PacktPublishing LLM Engineers Handbook Feature Engineering
- Workflow:Sktime Pytorch forecasting NBeats Univariate Forecasting
- Workflow:Protectai Llm guard API Server Deployment
- Workflow:Openai Openai python Fine Tuning Job Management
- Workflow:MaterializeInc Materialize Docker Image Build
- Workflow:ArroyoSystems Arroyo Connection Setup
- Workflow:Apache Paimon Vector Similarity Search
- Workflow:Pyro ppl Pyro MCMC Inference
Principles
- Principle:ChenghaoMou Text dedup MinHash Fingerprinting
- Principle:CrewAIInc CrewAI Specialist Agent Definition
- Principle:Facebookresearch Habitat lab Data Recording
- Principle:CARLA simulator Carla Simulation Recording
- Principle:Apache Dolphinscheduler DataSource Channel Pattern
- Principle:NVIDIA TransformerEngine Gemma Weight Loading
- Principle:ClickHouse ClickHouse Optimized Memcpy
- Principle:Helicone Helicone Database Schema
- Principle:Astronomer Astronomer cosmos Profile Configuration
- Principle:Tencent Ncnn Vulkan GPU Detection
Implementations
- Implementation:Openai Openai agents python Image Tool Output Pattern
- Implementation:Openai Openai agents python ApplyPatchTool Pattern
- Implementation:Pyro ppl Pyro MiniPyro
- Implementation:Mistralai Client python Mistral Init
- Implementation:LMCache LMCache PySocket Channel
- Implementation:Predibase Lorax FP8 Linear
- Implementation:Ray project Ray Serve Shutdown
- Implementation:OpenGVLab InternVL M4C Evaluator
- Implementation:Apache Druid SampleForParser
- Implementation:Microsoft Onnxruntime CgManifest
Heuristics
- Heuristic:Neuml Txtai Batch Size And Sorting Tip
- Heuristic:Neuml Txtai Model Quantization Defaults
- Heuristic:Treeverse LakeFS Warning Deprecated InternalApi Methods
- Heuristic:NVIDIA DALI Distributed Sharding Strategy
- Heuristic:Interpretml Interpret EBM Hyperparameter Tuning Guide
- Heuristic:CarperAI Trlx Delta Rewards
- Heuristic:EvolvingLMMs Lab Lmms eval Distributed Padding Strategy
- Heuristic:ARISE Initiative Robosuite XML Reset Method Tradeoff
- Heuristic:Predibase Lorax Quantization Backend Selection
- Heuristic:Lm sys FastChat Greedy Decoding Temperature Threshold
Environments
- Environment:Togethercomputer Together python API Credentials
- Environment:Deepspeedai DeepSpeed Python Runtime Environment
- Environment:Google deepmind Mujoco MJX Warp CUDA Environment
- Environment:Ucbepic Docetl LLM API Keys
- Environment:Sktime Pytorch forecasting Core Python Dependencies
- Environment:Eric mitchell Direct preference optimization Python Dependencies
- Environment:Apache Hudi Docker Demo Environment
- Environment:Allenai Open instruct Beaker Cluster
- Environment:Datahub project Datahub Python Ingestion
- Environment:Online ml River Build Toolchain