Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Lm sys FastChat ShareGPT Data Pipeline
- Workflow:Sgl project Sglang Structured Output Generation
- Workflow:ArroyoSystems Arroyo Connection Setup
- Workflow:Apache Dolphinscheduler Datasource Plugin Development
- Workflow:PrefectHQ Prefect Per Worker Task Concurrency
- Workflow:Onnx Onnx Model Validation
- Workflow:Haifengl Smile Data Loading Pipeline
- Workflow:Mlc ai Web llm Function Calling
- Workflow:Allenai Open instruct Reward Model Training
- Workflow:CARLA simulator Carla Simulation Setup and First Steps
Principles
- Principle:Facebookresearch Audiocraft Pretrained Model Loading
- Principle:Openai Whisper Dynamic Time Warping
- Principle:Treeverse LakeFS Imported Data Verification
- Principle:Zai org CogVideo SAT Weight Export
- Principle:PrefectHQ Prefect AI Approval Gate
- Principle:Huggingface Datasets TensorFlow Formatting
- Principle:Avhz RustQuant Geometric Brownian Motion
- Principle:Cypress io Cypress Launchpad Initialization
- Principle:Huggingface Datatrove JSONL Data Writing
- Principle:AUTOMATIC1111 Stable diffusion webui Extra Networks
Implementations
- Implementation:SeleniumHQ Selenium Closure KeyCodes
- Implementation:Langgenius Dify RetrievalConfig Type
- Implementation:Pyro ppl Pyro LKJ
- Implementation:Pytorch Serve DLRMFactory
- Implementation:Bigscience workshop Petals PTuneMixin
- Implementation:FlowiseAI Flowise OpenAIAssistantLayout
- Implementation:NVIDIA DALI EfficientNet Model
- Implementation:Apache Shardingsphere ClusterProcessPersistService Persist
- Implementation:EvolvingLMMs Lab Lmms eval VATEX Utils
- Implementation:Apache Kafka Docker Buildx Create
Heuristics
- Heuristic:Axolotl ai cloud Axolotl Sample Packing Best Practices
- Heuristic:Onnx Onnx Opset Version Selection
- Heuristic:Bitsandbytes foundation Bitsandbytes Blocksize Platform Defaults
- Heuristic:Togethercomputer Together python Fine Tuning Parameter Validation
- Heuristic:Apache Hudi Record Level Index Optimization
- Heuristic:NVIDIA NeMo Curator Semantic Dedup Cluster Sizing
- Heuristic:Mlfoundations Open flamingo Deterministic Shard Shuffling
- Heuristic:Pytorch Serve Ampere Tensor Core Optimization
- Heuristic:MarketSquare Robotframework browser Shared Node Process For Parallel
- Heuristic:LMCache LMCache Chunk Size And Default Config
Environments
- Environment:Predibase Lorax CUDA GPU Runtime
- Environment:Apache Paimon Optional Extensions
- Environment:TobikoData Sqlmesh Dbt Compatibility
- Environment:Sktime Pytorch forecasting Core Python Dependencies
- Environment:Volcengine Verl Ray Distributed Environment
- Environment:Sgl project Sglang Multimodal
- Environment:Cypress io Cypress Node Runtime Environment
- Environment:TobikoData Sqlmesh Python Runtime
- Environment:Norrrrrrr lyn WAInjectBench Conda Python 39 CUDA Environment
- Environment:DataExpert io Data engineer handbook Python Development Environment