Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Liu00222 Open Prompt Injection DataSentinel Detection
- Workflow:NVIDIA TransformerEngine Accelerate HF Gemma With TE
- Workflow:Scikit learn contrib Imbalanced learn Imbalanced Model Evaluation
- Workflow:Confident ai Deepeval AI Agent Evaluation
- Workflow:SeleniumHQ Selenium Page Object Pattern Testing
- Workflow:Anthropics Anthropic sdk python Extended Thinking Reasoning
- Workflow:Groq Groq python Streaming Chat Completion
- Workflow:Online ml River Online Clustering
- Workflow:Elevenlabs Elevenlabs python Speech to Text Transcription
- Workflow:Neuml Txtai Model Training
Principles
- Principle:AnswerDotAI RAGatouille Document Reranking
- Principle:Ggml org Llama cpp CLI Configuration
- Principle:ARISE Initiative Robosuite Controller Abstraction
- Principle:DataExpert io Data engineer handbook Spark Session Configuration
- Principle:Neuml Txtai Agent Execution
- Principle:Allenai Open instruct Streaming Generation Configuration
- Principle:Tensorflow Serving Multi Version Export
- Principle:Webdriverio Webdriverio Test Spec Authoring
- Principle:Microsoft Agent framework Handler Declaration
- Principle:Farama Foundation Gymnasium Synchronous Vector Execution
Implementations
- Implementation:Treeverse LakeFS Java SDK Model UserList
- Implementation:Ollama Ollama Imagegen Transfer Upload
- Implementation:Apache Paimon DataTypeFamily
- Implementation:Apache Druid Time Manipulation
- Implementation:DataTalksClub Data engineering zoomcamp Spark WithColumnRenamed
- Implementation:Pyro ppl Pyro Regional SIR
- Implementation:NVIDIA TransformerEngine Triton Cross Entropy
- Implementation:Langchain ai Langchain Release Workflow Dispatch
- Implementation:Microsoft Autogen Studio Agentflow
- Implementation:Pyro ppl Pyro LazyJIT
Heuristics
- Heuristic:LaurentMazare Tch rs Device Fallback Pattern
- Heuristic:Scikit learn contrib Imbalanced learn Sampling Before Split Leakage
- Heuristic:NVIDIA NeMo Curator Deduplication Blocksize Tuning
- Heuristic:Microsoft DeepSpeedExamples ZeRO Inference Throughput Tuning
- Heuristic:Hiyouga LLaMA Factory Mixed Precision Training Tips
- Heuristic:Mlflow Mlflow Nested Run Organization
- Heuristic:Duckdb Duckdb Unity Build Strategy
- Heuristic:Allenai Open instruct NCCL CUMEM Disable
- Heuristic:ChenghaoMou Text dedup False Positive Verification Tradeoff
- Heuristic:Kserve Kserve Multinode Replica Calculation
Environments
- Environment:Mit han lab Llm awq Flash Attention Environment
- Environment:Sgl project Sglang Triton
- Environment:Pola rs Polars Rust Build Environment
- Environment:Truera Trulens LangChain LangGraph Environment
- Environment:Apache Flink Hadoop Compatibility Environment
- Environment:Hpcaitech ColossalAI CUDA GPU Environment
- Environment:Speechbrain Speechbrain Speech Enhancement Dependencies
- Environment:Iamhankai Forest of Thought OpenAI API Credentials
- Environment:Apache Spark Kubernetes Runtime
- Environment:Iterative Dvc DVC Environment Variables