Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Ucbepic Docetl Long Document Chunking
- Workflow:Zai org CogVideo SAT Finetuning
- Workflow:Huggingface Open r1 Reasoning Data Generation
- Workflow:Sktime Pytorch forecasting DeepAR Probabilistic Forecasting
- Workflow:Sktime Pytorch forecasting NBeats Univariate Forecasting
- Workflow:Getgauge Taiko Network Request Interception
- Workflow:Apache Hudi Flink Batch Incremental Read
- Workflow:Intel Ipex llm RAG With LangChain
- Workflow:Trailofbits Fickling Pickle Safety Analysis
- Workflow:Turboderp org Exllamav2 Multimodal Vision Inference
Principles
- Principle:Google deepmind Dm control MJCF Model Export
- Principle:Interpretml Interpret Linear Model Explanation
- Principle:CarperAI Trlx Reward Function Design
- Principle:Cleanlab Cleanlab Token Issue Display
- Principle:Sdv dev SDV Documentation Build Configuration
- Principle:Huggingface Transformers Model Saving
- Principle:ClickHouse ClickHouse DNS Resolution
- Principle:NVIDIA DALI Image Normalization
- Principle:Snorkel team Snorkel Multitask Evaluation Prediction
- Principle:Apache Kafka Broker Invocation
Implementations
- Implementation:Googleapis Python genai Files Upload
- Implementation:Treeverse LakeFS Java SDK StagingApi
- Implementation:Puppeteer Puppeteer BrowserProvider
- Implementation:Rapidsai Cuml PR Workflow
- Implementation:Apache Spark ProxyUtils
- Implementation:Cohere ai Cohere python ChatConnector Model
- Implementation:Astronomer Astronomer cosmos PostgresUserPasswordProfileMapping
- Implementation:Tensorflow Serving HTTPServer Interface
- Implementation:Microsoft Onnxruntime CPU PoolGrad
- Implementation:Bigscience workshop Petals Choose Best Blocks
Heuristics
- Heuristic:Truera Trulens Rate Limiting And Retry Strategy
- Heuristic:Lance format Lance Warning Deprecated Java APIs
- Heuristic:Kornia Kornia Avoid Inplace Ops Compile
- Heuristic:OWASP Www project top 10 for large language model applications Warning Deprecated Markdown To PDF Convert
- Heuristic:DevExpress Testcafe Docker Chrome Tab Retry
- Heuristic:Bigscience workshop Petals Batch Splitting Threshold
- Heuristic:Google deepmind Dm control Prop Settling Physics Tuning
- Heuristic:Datahub project Datahub Secret Handling And Deprecation Patterns
- Heuristic:Cohere ai Cohere python Embed Auto Batching Strategy
- Heuristic:Fede1024 Rust rdkafka Regular Polling Required
Environments
- Environment:LLMBook zh LLMBook zh github io Bitsandbytes Quantization Environment
- Environment:Isaac sim IsaacGymEnvs Python CUDA Runtime
- Environment:Heibaiying BigData Notes Hadoop CDH Environment
- Environment:Recommenders team Recommenders Spark Environment
- Environment:Datahub project Datahub Spark Lineage Environment
- Environment:Apache Shardingsphere Etcd Cluster Coordination
- Environment:Getgauge Taiko Chromium Browser
- Environment:Snorkel team Snorkel PySpark
- Environment:Vllm project Vllm NVIDIA CUDA
- Environment:DataExpert io Data engineer handbook Statsig API Environment