Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Speechbrain Speechbrain CTC ASR Training
- Workflow:Ollama Ollama Custom Model Creation
- Workflow:Mistralai Client python OCR Document Processing
- Workflow:Langfuse Langfuse Prompt management lifecycle
- Workflow:Datajuicer Data juicer Dataset Quality Analysis
- Workflow:Intel Ipex llm QLoRA Finetuning
- Workflow:Openai Openai agents python Tool Integrated Agent
- Workflow:Openai Openai agents python Multi Agent Handoff
- Workflow:Openai Openai python Embeddings Generation
- Workflow:Duckdb Duckdb Benchmark Execution
Principles
- Principle:Nightwatchjs Nightwatch Client Configuration
- Principle:Langchain ai Langchain Release Triggering
- Principle:Openai Openai node Realtime Conversation Interaction
- Principle:Princeton nlp SimPO Multi Seed Response Generation
- Principle:Openai Evals Persistent Memory Management
- Principle:Apache Kafka Coordinator Read Operations
- Principle:Microsoft Autogen Handoff Routing
- Principle:Nautechsystems Nautilus trader Backtest Engine Configuration
- Principle:ChenghaoMou Text dedup False Positive Verification SimHash
- Principle:Google deepmind Mujoco Sensor Pipeline
Implementations
- Implementation:Langgenius Dify Webapp Auth
- Implementation:Microsoft Autogen Studio Gallery Detail
- Implementation:Googleapis Python genai Models Edit Image
- Implementation:Promptfoo Promptfoo resolveConfigs
- Implementation:Apache Hudi HoodieTableSource GetScanRuntimeProvider
- Implementation:Risingwavelabs Risingwave JniDbzSourceHandler RunJniDbzSourceThread
- Implementation:Mage ai Mage ai Twitter Ads Tap Init
- Implementation:Datajuicer Data juicer Load Formatter
- Implementation:Infiniflow Ragflow UserService
- Implementation:Open compass VLMEvalKit Spatial457
Heuristics
- Heuristic:Microsoft Agent framework Function Invocation Defaults
- Heuristic:LaurentMazare Tch rs MPS Weight Loading Workaround
- Heuristic:Risingwavelabs Risingwave Memory Cache Eviction Policy
- Heuristic:Spcl Graph of thoughts Scoring With Error Counting
- Heuristic:Spotify Luigi Dynamic Requirements Generator
- Heuristic:Openai Evals Event Batching Configuration
- Heuristic:Langfuse Langfuse ClickHouse FINAL Skip Optimization
- Heuristic:Unstructured IO Unstructured Chunk Size Tuning
- Heuristic:Langchain ai Langchain Pydantic V2 Configuration Tips
- Heuristic:Deepseek ai Janus Bfloat16 Dtype Selection
Environments
- Environment:Snorkel team Snorkel PySpark
- Environment:Langchain ai Langgraph Docker Deployment Environment
- Environment:Triton inference server Server TRT LLM Deployment
- Environment:Ray project Ray Java Build Environment
- Environment:Hiyouga LLaMA Factory FP8 Training Environment
- Environment:Rapidsai Cuml CUDA GPU
- Environment:Openai CLIP PyTorch CUDA Runtime
- Environment:Sgl project Sglang CUDA
- Environment:Sgl project Sglang CPU
- Environment:Arize ai Phoenix LLM Provider SDKs