Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Risingwavelabs Risingwave Sink Connector Pipeline
- Workflow:Apache Hudi Docker Demo Setup
- Workflow:ArroyoSystems Arroyo Connection Setup
- Workflow:Apache Spark Application Submission
- Workflow:Neuml Txtai Agent Orchestration
- Workflow:Vespa engine Vespa Config subscription lifecycle
- Workflow:Langgenius Dify Knowledge Base Management
- Workflow:Hpcaitech ColossalAI Distributed GRPO Training
- Workflow:Tensorflow Tfjs Pretrained Model Conversion And Inference
- Workflow:Online ml River Streaming Anomaly Detection
Principles
- Principle:Sktime Pytorch forecasting Quantile Loss
- Principle:Ollama Ollama Model Manifest Creation
- Principle:LaurentMazare Tch rs Deep Deterministic Policy Gradient
- Principle:PacktPublishing LLM Engineers Handbook Chunking And Embedding
- Principle:Mbzuai oryx Awesome LLM Post training Paper Categorization
- Principle:Tensorflow Serving TFRT Model Management
- Principle:Langgenius Dify ErrorHandling
- Principle:Mlfoundations Open flamingo Visual Question Answering Evaluation
- Principle:Speechbrain Speechbrain Speaker Diarization Pipeline
- Principle:Speechbrain Speechbrain Whisper Dataset Preparation
Implementations
- Implementation:Deepspeedai DeepSpeed IO Handle
- Implementation:Pyro ppl Pyro Trace Struct
- Implementation:LMCache LMCache Chunk Statistics Lookup Client
- Implementation:Infiniflow Ragflow MetadataManageValuesModal Component
- Implementation:Huggingface Diffusers LoRA Training Loop
- Implementation:Puppeteer Puppeteer Bidi Realm
- Implementation:Scikit learn Scikit learn FetchOpenml
- Implementation:Mlflow Mlflow Set Model Version Tag
- Implementation:Treeverse LakeFS Java SDK Model AuthenticationToken
- Implementation:CARLA simulator Carla InstallPrerequisites Script
Heuristics
- Heuristic:Huggingface Peft Warning Deprecated Bone
- Heuristic:Openai Whisper Median Word Duration Clamping
- Heuristic:ClickHouse ClickHouse Debug Build Tips
- Heuristic:Spcl Graph of thoughts GoT Decompose Sort Merge Strategy
- Heuristic:Helicone Helicone Provider URL Regex Priority
- Heuristic:Huggingface Alignment handbook EOS Token Alignment
- Heuristic:Lm sys FastChat GPU Memory Allocation Strategy
- Heuristic:Google research Deduplicate text datasets HACKSIZE Overlap Buffer
- Heuristic:TA Lib Ta lib python STOCHRSI Vs STOCH RSI
- Heuristic:BerriAI Litellm Connection Pooling Memory Management
Environments
- Environment:Volcengine Verl Megatron Core Environment
- Environment:NVIDIA TransformerEngine CUDA Toolkit Requirements
- Environment:Sktime Pytorch forecasting Optuna Tuning Dependencies
- Environment:Duckdb Duckdb Code Formatting Tools
- Environment:Spotify Luigi SQLAlchemy Database
- Environment:Unslothai Unsloth CUDA BitsAndBytes
- Environment:Huggingface Datatrove Processing Dependencies
- Environment:PrefectHQ Prefect Prefect Server Database
- Environment:Online ml River Build Toolchain
- Environment:Guardrails ai Guardrails OpenTelemetry Tracing