Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Truera Trulens Custom App Instrumentation And Evaluation
- Workflow:Haotian liu LLaVA Web Demo Deployment
- Workflow:Openai Openai agents python Multi Agent Handoff
- Workflow:Microsoft Agent framework Graph Based Workflow Execution
- Workflow:Neuml Txtai Semantic Search
- Workflow:Kserve Kserve InferenceGraph Pipeline
- Workflow:Openai Openai agents python Streaming Agent Execution
- Workflow:Pola rs Polars Data IO and Format Conversion
- Workflow:Snorkel team Snorkel Multitask Classification
- Workflow:LMCache LMCache CacheBlend KV Reuse
Principles
- Principle:DataTalksClub Data engineering zoomcamp Schema Normalization
- Principle:Vllm project Vllm Draft Model Acquisition
- Principle:Alibaba ROLL Agentic Validation
- Principle:Datahub project Datahub Connection Validation
- Principle:OpenHands OpenHands Webhook Acknowledgment
- Principle:Treeverse LakeFS Documentation Site Configuration
- Principle:Webdriverio Webdriverio BrowserStack Extension Management
- Principle:Openai Openai node Client Initialization
- Principle:Googleapis Python genai Generation Configuration
- Principle:Helicone Helicone Mapper Type Detection
Implementations
- Implementation:InternLM Lmdeploy LanguageModel
- Implementation:Lance format Lance FullZipCompressor
- Implementation:Langchain ai Langgraph Pregel Runner
- Implementation:Google research Deduplicate text datasets Make Suffix Array
- Implementation:Recommenders team Recommenders NCF Dataset Init
- Implementation:Online ml River Stats IQR
- Implementation:Duckdb Duckdb Mbedtls Hash
- Implementation:Apache Beam Twister2BatchPipelineTranslator
- Implementation:Lucidrains X transformers NeoMLP
- Implementation:Puppeteer Puppeteer Injected PQuerySelector
Heuristics
- Heuristic:CARLA simulator Carla PID Controller Tuning
- Heuristic:Allenai Open instruct Warning Archived Dev Scripts
- Heuristic:Marker Inc Korea AutoRAG Deterministic Evaluation Generation
- Heuristic:SeleniumHQ Selenium Warning Deprecated Proxy FTP Methods
- Heuristic:ArroyoSystems Arroyo Async UDF Concurrency
- Heuristic:Tensorflow Serving Servable Handle Lifetime
- Heuristic:Datahub project Datahub Git Worktree Gradle Fix
- Heuristic:Heibaiying BigData Notes Spark Streaming Local Threads Tip
- Heuristic:Marker Inc Korea AutoRAG Batch Size Tuning
- Heuristic:NVIDIA DALI Thread Affinity Optimization
Environments
- Environment:Allenai Open instruct CUDA GPU Training
- Environment:Wandb Weave Trace Server Infrastructure
- Environment:Datajuicer Data juicer Ray Cluster Environment
- Environment:DataTalksClub Data engineering zoomcamp Dlt BigQuery Environment
- Environment:Cleanlab Cleanlab Datalab Dependencies
- Environment:Bitsandbytes foundation Bitsandbytes HPU Gaudi Runtime
- Environment:Microsoft DeepSpeedExamples SuperOffload Runtime
- Environment:Lm sys FastChat API Keys And Credentials
- Environment:FlagOpen FlagEmbedding Python PyTorch Environment
- Environment:ChenghaoMou Text dedup Python 3 12 Environment