Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Microsoft Onnxruntime Distributed Model Training
- Workflow:Rapidsai Cuml Multi GPU Distributed ML
- Workflow:Huggingface Datasets Dataset Loading and Exploration
- Workflow:Vespa engine Vespa Document indexing pipeline
- Workflow:Infiniflow Ragflow Document Processing Pipeline
- Workflow:Datajuicer Data juicer Text Data Processing Pipeline
- Workflow:Norrrrrrr lyn WAInjectBench Text Prompt Injection Detection
- Workflow:Mbzuai oryx Awesome LLM Post training Deep Paper Collection
- Workflow:Rapidsai Cuml Time Series Forecasting
- Workflow:Openai Evals Creating a model graded eval
Principles
- Principle:AUTOMATIC1111 Stable diffusion webui Inpainting Pipeline
- Principle:Microsoft Onnxruntime Nodejs Output Processing
- Principle:HKUDS AI Trader LLM Invocation
- Principle:Haotian liu LLaVA Custom Dataset Formatting
- Principle:Cleanlab Cleanlab Token Label Issue Filtering
- Principle:Nautechsystems Nautilus trader Order Position Event Handling
- Principle:Tensorflow Serving Kubernetes Resource Deployment
- Principle:Nautechsystems Nautilus trader Risk Management
- Principle:Huggingface Datasets Dataset Object Construction
- Principle:Openai Openai agents python Stream Event Consumption
Implementations
- Implementation:Protectai Llm guard Output BanCode
- Implementation:Webdriverio Webdriverio Cloud Service Hooks
- Implementation:InternLM Lmdeploy Core Math
- Implementation:Farama Foundation Gymnasium FunctionalJaxEnv
- Implementation:Lance format Lance Java ColumnAlteration
- Implementation:Cleanlab Cleanlab Datalab Init
- Implementation:TobikoData Sqlmesh EditorTabs
- Implementation:Langgenius Dify CreateApp Workflow
- Implementation:Ray project Ray TaskExecutor
- Implementation:Nautechsystems Nautilus trader Pyproject Configuration
Heuristics
- Heuristic:Fede1024 Rust rdkafka Commit Mode Sync Vs Async
- Heuristic:Bigscience workshop Petals Batch Splitting Threshold
- Heuristic:Speechbrain Speechbrain Score Normalization Tips
- Heuristic:Google research Deduplicate text datasets Ulimit File Descriptors For Merge
- Heuristic:Truera Trulens Trace Compression Token Limits
- Heuristic:Predibase Lorax Flash Attention Backend Selection
- Heuristic:Haosulab ManiSkill Num Envs Backend Selection
- Heuristic:Infiniflow Ragflow Hybrid Search Fallback Strategy
- Heuristic:Apache Druid Auto Granularity Selection
- Heuristic:Open compass VLMEvalKit API Retry With Random Delay
Environments
- Environment:Mlc ai Web llm WebGPU Browser Runtime
- Environment:PacktPublishing LLM Engineers Handbook Selenium Chrome Crawler Environment
- Environment:Apache Airflow Docker Container Environment
- Environment:Huggingface Datatrove IO Dependencies
- Environment:Bentoml BentoML BentoCloud Credentials
- Environment:OpenHands OpenHands Integration Credentials
- Environment:Microsoft Agent framework Python 3 10 Runtime
- Environment:Infiniflow Ragflow Docker Infrastructure
- Environment:Huggingface Peft BitsAndBytes Quantization
- Environment:Vllm project Vllm Environment Variables