Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Zai org CogVideo Diffusers Image to Video Inference
- Workflow:MarketSquare Robotframework browser Library Development and Release
- Workflow:Huggingface Datatrove Minhash Deduplication
- Workflow:ChenghaoMou Text dedup Benchmark Evaluation
- Workflow:Groq Groq python Batch Processing
- Workflow:ChenghaoMou Text dedup SimHash Deduplication
- Workflow:Lance format Lance Version Management
- Workflow:Interpretml Interpret Model Explanation And Visualization
- Workflow:Deepset ai Haystack Document Preprocessing Pipeline
- Workflow:Nautechsystems Nautilus trader Backtest with BacktestEngine
Principles
- Principle:AUTOMATIC1111 Stable diffusion webui Merged model saving
- Principle:Intel Ipex llm DPO Model Export
- Principle:MarketSquare Robotframework browser Response Formatting and Validation
- Principle:Dagster io Dagster Dynamic Partitioning
- Principle:Hpcaitech ColossalAI GRPO Consumer Setup
- Principle:Scikit learn Scikit learn Ranking Metrics
- Principle:Huggingface Alignment handbook Model Loading
- Principle:Huggingface Datasets Column Removal
- Principle:Fastai Fastbook DataLoaders Creation
- Principle:Webdriverio Webdriverio Session Cleanup
Implementations
- Implementation:Infiniflow Ragflow MetadataManageModal Hooks
- Implementation:Sktime Pytorch forecasting DLinear V2
- Implementation:Google deepmind Dm control Binding Generator
- Implementation:Heibaiying BigData Notes FlinkToMySQLSink Implementation
- Implementation:SeleniumHQ Selenium Closure Testing StackTrace
- Implementation:Webdriverio Webdriverio Launcher Class
- Implementation:Helicone Helicone AI Gateway OpenAPI Spec
- Implementation:FlowiseAI Flowise DeleteDocStoreDialog
- Implementation:Scikit learn Scikit learn ConsensusScore
- Implementation:Interpretml Interpret Process Terms For Merge
Heuristics
- Heuristic:Zai org CogVideo Data Preparation Best Practices
- Heuristic:Norrrrrrr lyn WAInjectBench Balanced Class Weights Imbalanced Data
- Heuristic:Fastai Fastbook Weight Decay Tuning
- Heuristic:Langfuse Langfuse Ingestion Date Boundary Delay
- Heuristic:Microsoft Onnxruntime Graph Optimization Level Selection
- Heuristic:Testtimescaling Testtimescaling github io Dual JSON Sync
- Heuristic:Rapidsai Cuml GPU Cache Alignment
- Heuristic:Mlfoundations Open flamingo Loss Masking Strategy
- Heuristic:DataTalksClub Data engineering zoomcamp Dbt Materialization Strategy
- Heuristic:Anthropics Anthropic sdk python Streaming For Long Requests
Environments
- Environment:PacktPublishing LLM Engineers Handbook Selenium Chrome Crawler Environment
- Environment:Hiyouga LLaMA Factory Distributed Training Environment
- Environment:Alibaba MNN GPU OpenCL Environment
- Environment:Neuml Txtai GPU Accelerator Detection
- Environment:MaterializeInc Materialize Buildkite CI Runtime
- Environment:ClickHouse ClickHouse OpenSSL Runtime
- Environment:Evidentlyai Evidently Python Core Environment
- Environment:DataExpert io Data engineer handbook Flink Kafka Docker Environment
- Environment:LLMBook zh LLMBook zh github io HuggingFace Transformers Stack
- Environment:FMInference FlexLLMGen NVMe Disk