Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Protectai Llm guard LLM Input Output Scanning
- Workflow:Microsoft Onnxruntime ORTModule Training
- Workflow:Groq Groq python Batch Processing
- Workflow:MaterializeInc Materialize Upgrade Testing
- Workflow:CrewAIInc CrewAI Sequential Crew Execution
- Workflow:Groq Groq python Audio Transcription
- Workflow:Microsoft LoRA GPT2 NLG Finetuning
- Workflow:Openclaw Openclaw Agent Message Loop
- Workflow:CrewAIInc CrewAI Knowledge RAG Pipeline
- Workflow:Evidentlyai Evidently Text Data Quality Evaluation
Principles
- Principle:Speechbrain Speechbrain Custom Batch Training For Separation
- Principle:Elevenlabs Elevenlabs python Text Source Preparation
- Principle:Eric mitchell Direct preference optimization Batch Data Pipeline
- Principle:Duckdb Duckdb Fast Number Parsing
- Principle:Huggingface Diffusers Selective Test Execution
- Principle:Axolotl ai cloud Axolotl Vision Language Model Loading
- Principle:Spotify Luigi Task Simulation
- Principle:Datahub project Datahub Stack Lifecycle Management
- Principle:Bitsandbytes foundation Bitsandbytes XPU Backend Operations
- Principle:Risingwavelabs Risingwave Iceberg Catalog Operations
Implementations
- Implementation:Intel Ipex llm NPU Multimodal MiniCPM
- Implementation:Huggingface Datatrove ContextShuffler
- Implementation:Mlc ai Mlc llm Package
- Implementation:LaurentMazare Tch rs Tensor Ops
- Implementation:Hpcaitech ColossalAI TableLoader
- Implementation:LMCache LMCache SGLang Adapter
- Implementation:Openclaw Openclaw StartGatewayServer
- Implementation:Apache Paimon ConfigOptions Python
- Implementation:Ucbepic Docetl LeaseContractExplorer
- Implementation:Open compass VLMEvalKit Ristretto
Heuristics
- Heuristic:Mistralai Client python Resource Context Manager
- Heuristic:Lm sys FastChat GPU Memory Allocation Strategy
- Heuristic:Elevenlabs Elevenlabs python Audio Buffer Sizes
- Heuristic:Tencent Ncnn Lightmode Memory Optimization
- Heuristic:Neuml Txtai Faiss Index Sizing Tip
- Heuristic:Liu00222 Open Prompt Injection Cosine Similarity Segmentation Threshold
- Heuristic:Heibaiying BigData Notes HBase Connection Thread Safety Tip
- Heuristic:HKUDS AI Trader DeepSeek Tool Args Workaround
- Heuristic:Fede1024 Rust rdkafka Multi Version Dependency Hazard
- Heuristic:Mbzuai oryx Awesome LLM Post training Excel Sheet Name Truncation
Environments
- Environment:Apache Druid Druid Cluster Api
- Environment:Alibaba ROLL Megatron Training Environment
- Environment:DistrictDataLabs Yellowbrick Optional NLP Dependencies
- Environment:Apache Kafka Committer Tools Environment
- Environment:Recommenders team Recommenders Spark Environment
- Environment:Treeverse LakeFS Web UI Environment
- Environment:Apache Shardingsphere Calcite Federation Engine
- Environment:BerriAI Litellm Observability Stack
- Environment:Spcl Graph of thoughts Local LLaMA GPU Inference
- Environment:Apache Spark Release Build Environment