Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Avhz RustQuant Yield Curve Construction
- Workflow:Fede1024 Rust rdkafka Transactional Produce Consume
- Workflow:FlagOpen FlagEmbedding Embedder Finetuning
- Workflow:ARISE Initiative Robomimic Trained Policy Evaluation
- Workflow:Mbzuai oryx Awesome LLM Post training Deep Paper Collection
- Workflow:Webdriverio Webdriverio Cloud Service Integration
- Workflow:Google research Deduplicate text datasets Wiki40B TFDS deduplication
- Workflow:MaterializeInc Materialize dbt Integration
- Workflow:Ggml org Ggml MNIST Training And Evaluation
- Workflow:Langgenius Dify Docker Deployment
Principles
- Principle:Ggml org Llama cpp Sampling
- Principle:Neuml Txtai Base Model Configuration
- Principle:Pola rs Polars Advanced SQL Features
- Principle:MarketSquare Robotframework browser Project Build Pipeline
- Principle:Apache Flink PyFlink Build Distribution
- Principle:Openclaw Openclaw Routing Verification
- Principle:Microsoft Onnxruntime Distributed Data Loading
- Principle:Puppeteer Puppeteer Platform Detection
- Principle:Getgauge Taiko Gauge Project Setup
- Principle:PrefectHQ Prefect Durable AI Execution
Implementations
- Implementation:OpenGVLab InternVL Trainer Save Model
- Implementation:Huggingface Transformers Pipeline Preprocess
- Implementation:FlagOpen FlagEmbedding MLVU Count Data
- Implementation:Ucbepic Docetl CodeOperations
- Implementation:Hiyouga LLaMA Factory V1 Launcher
- Implementation:Pola rs Polars Polars Buffer Lib
- Implementation:Datajuicer Data juicer VideoNSFWFilter
- Implementation:Online ml River Docs Parse
- Implementation:Langchain ai Langchain BaseRateLimiter Acquire
- Implementation:Langgenius Dify Marketplace Contract
Heuristics
- Heuristic:Run llama Llama index Evaluator LLM Selection
- Heuristic:AnswerDotAI RAGatouille Collection Size Index Tuning
- Heuristic:Evidentlyai Evidently Column Type Inference Rules
- Heuristic:Scikit learn Scikit learn Working Memory Tuning
- Heuristic:Trailofbits Fickling Injection Mode Selection
- Heuristic:Apache Shardingsphere Worker ID Reservation Strategy
- Heuristic:Pola rs Polars GPU Aggregation Join Speedup
- Heuristic:TobikoData Sqlmesh Execution Time Caching
- Heuristic:ChenghaoMou Text dedup Mersenne Prime Backward Compatibility
- Heuristic:Intel Ipex llm Use Cache Training Vs Inference
Environments
- Environment:Truera Trulens Streamlit Dashboard Environment
- Environment:ARISE Initiative Robomimic Robosuite Simulation Backend
- Environment:Sktime Pytorch forecasting Cpflows MQF2 Dependencies
- Environment:DataTalksClub Data engineering zoomcamp Kestra Orchestration Environment
- Environment:Kubeflow Kubeflow Istio Certmanager Dex Environment
- Environment:CarperAI Trlx DeepSpeed Multi GPU
- Environment:Huggingface Alignment handbook BitsAndBytes CUDA
- Environment:Hiyouga LLaMA Factory Optional Inference Backends
- Environment:FlowiseAI Flowise Docker Environment
- Environment:Wandb Weave Python SDK Runtime