Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Arize ai Phoenix Span Annotation Pipeline
- Workflow:Princeton nlp Tree of thought llm ToT BFS experiment
- Workflow:Langgenius Dify Plugin Management
- Workflow:Langchain ai Langchain Adding Partner Integration
- Workflow:Eventual Inc Daft Distributed UDF Processing
- Workflow:Lm sys FastChat ShareGPT Data Pipeline
- Workflow:TA Lib Ta lib python Streaming Indicator Computation
- Workflow:Apache Dolphinscheduler Workflow Failover Recovery
- Workflow:Google deepmind Dm control Control Suite RL Training
- Workflow:Dagster io Dagster Modal Serverless Pipeline
Principles
- Principle:Google deepmind Dm control Task Registry Discovery
- Principle:OWASP Www project top 10 for large language model applications Threat Modeling Against Top 10
- Principle:Langgenius Dify Prompt Template Design
- Principle:ClickHouse ClickHouse Remote Syslog Logging
- Principle:Huggingface Diffusers Video Pipeline Selection
- Principle:Apache Paimon Developer Tooling
- Principle:FlowiseAI Flowise Node Initialization
- Principle:Tensorflow Tfjs Base Model Loading
- Principle:Dagster io Dagster Resource Management
- Principle:Online ml River Online Recommendation
Implementations
- Implementation:Haifengl Smile Neighbor Record API
- Implementation:InternLM Lmdeploy KvCacheUtils
- Implementation:ArroyoSystems Arroyo Error Types
- Implementation:CrewAIInc CrewAI Firecrawl Scrape Tool
- Implementation:Alibaba MNN PyMNN Output Processing
- Implementation:CrewAIInc CrewAI PDF Search Tool
- Implementation:SeldonIO Seldon core Hodometer Collect
- Implementation:Alibaba MNN PyMNN Module Forward
- Implementation:LMCache LMCache Internal API Server
- Implementation:Turboderp org Exllamav2 ExLlamaV2WebSocketServer
Heuristics
- Heuristic:BerriAI Litellm Connection Pooling Memory Management
- Heuristic:Marker Inc Korea AutoRAG Batch Size Tuning
- Heuristic:Volcengine Verl Inplace Operations OOM Prevention
- Heuristic:Dagster io Dagster Batch Size Tuning
- Heuristic:Teamcapybara Capybara Async Waiting And Retry
- Heuristic:Princeton nlp Tree of thought llm Value Caching
- Heuristic:Sgl project Sglang Chunked Prefill OOM Prevention
- Heuristic:Openai Openai python Fine Tuning Data Preparation Tips
- Heuristic:Tensorflow Serving Servable Handle Lifetime
- Heuristic:Cypress io Cypress Timeout Tuning
Environments
- Environment:Nightwatchjs Nightwatch Selenium WebDriver 4
- Environment:Microsoft DeepSpeedExamples SuperOffload Runtime
- Environment:Princeton nlp SimPO VLLM Inference
- Environment:Fastai Fastbook Python FastAI Environment
- Environment:Treeverse LakeFS LakeFS Server Environment
- Environment:Huggingface Datasets TensorFlow Integration
- Environment:Tencent Ncnn Build Environment
- Environment:DistrictDataLabs Yellowbrick Python Scikit Learn Environment
- Environment:Spcl Graph of thoughts Local LLaMA GPU Inference
- Environment:Mbzuai oryx Awesome LLM Post training Python Matplotlib