Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Run llama Llama index OpenAI LLM Finetuning
- Workflow:Apache Shardingsphere Dynamic Rule Configuration Change
- Workflow:Protectai Modelscan Programmatic Model Scanning
- Workflow:Alibaba ROLL Knowledge Distillation Pipeline
- Workflow:Gretelai Gretel synthetics DGAN Timeseries Generation
- Workflow:Turboderp org Exllamav2 Interactive Chat
- Workflow:Getgauge Taiko Headless Browser Testing
- Workflow:Mlflow Mlflow GenAI Evaluation
- Workflow:Apache Druid SQL Query Execution
- Workflow:Datahub project Datahub Metadata Actions Pipeline
Principles
- Principle:Heibaiying BigData Notes Hive Database Creation
- Principle:NVIDIA NeMo Aligner DPO Reference Policy Management
- Principle:FlowiseAI Flowise Evaluator Definition
- Principle:Promptfoo Promptfoo Provider Resolution
- Principle:Astronomer Astronomer cosmos Graph Parsing and Task Generation
- Principle:Treeverse LakeFS Retention Rule Configuration
- Principle:Recommenders team Recommenders SAR Model Training
- Principle:Triton inference server Server Ensemble Inference
- Principle:Google deepmind Mujoco Pipeline Architecture
- Principle:Bitsandbytes foundation Bitsandbytes FP8 Simulated Quantization Matmul
Implementations
- Implementation:Haosulab ManiSkill EmptyEnv
- Implementation:Microsoft Semantic kernel JiraPlugin OpenAPI
- Implementation:Microsoft BIPIA Clean Response Inference
- Implementation:DataTalksClub Data engineering zoomcamp Docker Build Run
- Implementation:Guardrails ai Guardrails Guard Load
- Implementation:Duckdb Duckdb Generate Auxiliary
- Implementation:Apache Paimon RestClient
- Implementation:Openai Openai node ChatCompletionCreateParams
- Implementation:Sktime Pytorch forecasting Encoder
- Implementation:CARLA simulator Carla Timestamp Class
Heuristics
- Heuristic:Langfuse Langfuse LLM Rate Limit 24h Abandon
- Heuristic:Norrrrrrr lyn WAInjectBench LoRA Rank Alpha Selection
- Heuristic:Datahub project Datahub Docker Memory Preflight
- Heuristic:Bentoml BentoML Adaptive Batching Tuning
- Heuristic:Apache Airflow Task Idempotency Pattern
- Heuristic:Mit han lab Llm awq Skip QK Projection Clipping
- Heuristic:NVIDIA NeMo Aligner Warning Deprecated Repository
- Heuristic:Spotify Luigi Parameter Propagation Decorators
- Heuristic:Pyro ppl Pyro Guide Initialization Strategy
- Heuristic:Vllm project Vllm KV Cache Block Size Selection
Environments
- Environment:Treeverse LakeFS LakeFS Server Environment
- Environment:Astronomer Astronomer cosmos Python Airflow Runtime
- Environment:Snorkel team Snorkel PyTorch
- Environment:Lance format Lance Python Environment
- Environment:Ucbepic Docetl LLM API Keys
- Environment:Astronomer Astronomer cosmos Kubernetes Provider
- Environment:MaterializeInc Materialize Dbt Materialize Runtime
- Environment:Datahub project Datahub Docker Runtime
- Environment:ChenghaoMou Text dedup Suffix Array External Tools
- Environment:Lance format Lance Rust Toolchain