Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:ContextualAI HALOs Online Iterative Alignment
- Workflow:ChenghaoMou Text dedup SimHash Deduplication
- Workflow:Microsoft Playwright AI agent driven testing
- Workflow:Microsoft BIPIA Attack Success Rate Evaluation
- Workflow:Gretelai Gretel synthetics DGAN Timeseries Generation
- Workflow:Volcengine Verl PPO Training With Reward Model
- Workflow:Promptfoo Promptfoo Red Team Security Scan
- Workflow:Openai Openai agents python Guardrails Secured Agent
- Workflow:BerriAI Litellm SDK Completion
- Workflow:Googleapis Python genai Image Generation Pipeline
Principles
- Principle:Lucidrains X transformers Variational Latent Language Modeling
- Principle:CARLA simulator Carla Unreal Engine Build
- Principle:Mlc ai Mlc llm Model Library Compilation
- Principle:Hpcaitech ColossalAI Distributed Environment Initialization
- Principle:Apache Beam Pipeline Event Reporting
- Principle:Apache Paimon Indexed Split Result Retrieval
- Principle:Mit han lab Llm awq NVILA Multimodal Architecture
- Principle:Apache Dolphinscheduler Frontend Build Toolchain
- Principle:Duckdb Duckdb Source Amalgamation
- Principle:Alibaba ROLL SFT Configuration
Implementations
- Implementation:Diagram of thought Diagram of thought Summarizer Validated Node Synthesis
- Implementation:FlagOpen FlagEmbedding Matryoshka Mistral Model Compensation
- Implementation:Neuml Txtai Production Deployment Tools
- Implementation:Microsoft Playwright CssTokenizer
- Implementation:Openai Openai node Pagination
- Implementation:Ollama Ollama CreateHandler
- Implementation:Turboderp org Exllamav2 ExLlamaV2MoEMLP
- Implementation:CARLA simulator Carla Python Commands Bindings
- Implementation:Anthropics Anthropic sdk python MessageCountTokensParams
- Implementation:Evidentlyai Evidently Legacy UI App
Heuristics
- Heuristic:Bigscience workshop Petals Batch Splitting Threshold
- Heuristic:ContextualAI HALOs FSDP Sampling Workaround
- Heuristic:Langchain ai Langgraph Retry Policy Configuration
- Heuristic:Cleanlab Cleanlab KNN Distance Metric Selection
- Heuristic:Langchain ai Langgraph Stream Mode Selection
- Heuristic:Ggml org Ggml Gradient Accumulation Batch Sizing
- Heuristic:HKUDS AI Trader Position File Locking
- Heuristic:EvolvingLMMs Lab Lmms eval Limit Flag Testing Only
- Heuristic:Farama Foundation Gymnasium Render Mode Selection Guide
- Heuristic:Puppeteer Puppeteer Chrome Default Launch Arguments
Environments
- Environment:Huggingface Optimum Accelerated Inference Environment
- Environment:Predibase Lorax Python Server Dependencies
- Environment:Huggingface Transformers Python 310 Runtime
- Environment:Allenai Open instruct vLLM Inference
- Environment:Openai Whisper PyTorch CUDA
- Environment:DataTalksClub Data engineering zoomcamp Dlt BigQuery Environment
- Environment:Langchain ai Langchain OpenAI API Credentials
- Environment:Openclaw Openclaw Node 22 Runtime
- Environment:ArroyoSystems Arroyo Object Storage
- Environment:Facebookresearch Audiocraft Python PyTorch CUDA Environment