Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Explodinggradients Ragas Agent Evaluation
- Workflow:Risingwavelabs Risingwave Iceberg Lakehouse Ingestion
- Workflow:NVIDIA NeMo Curator Text Curation Pipeline
- Workflow:Cypress io Cypress CI Pipeline Integration
- Workflow:Risingwavelabs Risingwave Streaming ETL Pipeline
- Workflow:Puppeteer Puppeteer Web Scraping And Interaction
- Workflow:Elevenlabs Elevenlabs python Realtime TTS Streaming
- Workflow:Haotian liu LLaVA Benchmark Evaluation
- Workflow:Google deepmind Dm control Multi Agent Soccer Setup
- Workflow:Datahub project Datahub Java SDK V2 Entity Management
Principles
- Principle:Openai Openai agents python Computer Use
- Principle:Huggingface Datasets SQL Dataset Building
- Principle:EvolvingLMMs Lab Lmms eval Distributed Environment Setup
- Principle:Langgenius Dify Dataset Creation
- Principle:Langfuse Langfuse Data Streaming from ClickHouse
- Principle:Liu00222 Open Prompt Injection Configuration Loading
- Principle:Mit han lab Llm awq Streaming Text Generation
- Principle:Promptfoo Promptfoo Configuration Loading
- Principle:Tensorflow Tfjs Model Compilation
- Principle:Explodinggradients Ragas Tool Call F1 Evaluation
Implementations
- Implementation:Langfuse Langfuse Batch Export Stream Transformations
- Implementation:Intel Ipex llm Transformers Trainer QLoRA
- Implementation:Sdv dev SDV GaussianCopulaSynthesizer Init
- Implementation:Infiniflow Ragflow Http Client
- Implementation:Treeverse LakeFS Java SDK ObjectsApi
- Implementation:Eventual Inc Daft Arrow Utils
- Implementation:CARLA simulator Carla Buffer
- Implementation:Mlc ai Mlc llm OLMo Loader
- Implementation:Microsoft Playwright LaunchApp
- Implementation:Hiyouga LLaMA Factory SFT Workflow
Heuristics
- Heuristic:InternLM Lmdeploy KV Cache Memory Tuning
- Heuristic:Haotian liu LLaVA Gradient Checkpointing Memory Optimization
- Heuristic:Recommenders team Recommenders Test Timing Budgets
- Heuristic:Dagster io Dagster Batch Size Tuning
- Heuristic:Webdriverio Webdriverio Click Interception Workaround
- Heuristic:Unstructured IO Unstructured Multi Python Matrix
- Heuristic:Pola rs Polars Lazy Over Eager Preference
- Heuristic:Sdv dev SDV Version Compatibility
- Heuristic:Truera Trulens Trace Compression Token Limits
- Heuristic:Rapidsai Cuml Batch Size Memory Tradeoff
Environments
- Environment:Tensorflow Serving Kubernetes Deployment Environment
- Environment:Norrrrrrr lyn WAInjectBench Conda Python 39 CUDA Environment
- Environment:Heibaiying BigData Notes Flink 1 9 Environment
- Environment:Openai Openai python Voice Helpers
- Environment:ARISE Initiative Robomimic PyTorch CUDA Environment
- Environment:DevExpress Testcafe Firefox Marionette
- Environment:Princeton nlp Tree of thought llm Python OpenAI
- Environment:SeldonIO Seldon core Kubernetes Cluster Environment
- Environment:ThreeSR Awesome Inference Time Scaling Python Runtime Environment
- Environment:Ggml org Ggml Vulkan GPU Environment