Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Sgl project Sglang Multimodal Vision Language Inference
- Workflow:Princeton nlp SimPO SimPO Training
- Workflow:Groq Groq python Audio Transcription
- Workflow:Princeton nlp SimPO Model Inference
- Workflow:Romsto Speculative Decoding Ngram Assisted Speculative Decoding
- Workflow:LMCache LMCache CacheBlend KV Reuse
- Workflow:Google deepmind Dm control MJCF Model Composition
- Workflow:Open compass VLMEvalKit Adding Custom Benchmark
- Workflow:Apache Paimon Blob Storage With Descriptors
- Workflow:Alibaba ROLL Reward Flow Diffusion Pipeline
Principles
- Principle:Bentoml BentoML Model Cleanup
- Principle:OpenHands OpenHands Business Route Mounting
- Principle:Triton inference server Server ORCA Load Reporting
- Principle:Googleapis Python genai Pagination
- Principle:Fastai Fastbook Language Model Data
- Principle:Recommenders team Recommenders ALS Matrix Factorization
- Principle:Huggingface Datatrove Document Filtering Framework
- Principle:FMInference FlexLLMGen Model Replacement Policy
- Principle:Ggml org Llama cpp Multimodal Encoding And Generation
- Principle:Spotify Luigi HPC Batch Execution
Implementations
- Implementation:Ggml org Llama cpp Peg Parser Header
- Implementation:Infiniflow Ragflow LinkDataPipeline Component
- Implementation:Huggingface Trl Get Dataset GRPO
- Implementation:Triton inference server Server Optimal Config Application
- Implementation:Pyro ppl Pyro ProfileHMM
- Implementation:ArroyoSystems Arroyo Kinesis Sink
- Implementation:Hpcaitech ColossalAI ExperienceMaker Base
- Implementation:Langfuse Langfuse Comments Repository
- Implementation:Triton inference server Server GenQaReshapeModels
- Implementation:Ggml org Ggml Opencl backend
Heuristics
- Heuristic:Nautechsystems Nautilus trader Inflight Order Check Threshold
- Heuristic:CrewAIInc CrewAI Rate Limiting Strategy
- Heuristic:Lucidrains X transformers Rotary Position Embedding Selection
- Heuristic:Mlc ai Web llm Penalty Parameter Defaults
- Heuristic:Huggingface Datasets Parquet Shard Sizing
- Heuristic:Groq Groq python Retry Backoff Strategy
- Heuristic:Sktime Pytorch forecasting Batch Size Selection
- Heuristic:Treeverse LakeFS Action Cache Wait Tip
- Heuristic:Cohere ai Cohere python ToolCallV2 Auto UUID Override
- Heuristic:Turboderp org Exllamav2 Quantization Conversion Tips
Environments
- Environment:Microsoft Onnxruntime Distributed Training Environment
- Environment:Apache Dolphinscheduler Java Runtime
- Environment:Sgl project Sglang Multimodal
- Environment:DataTalksClub Data engineering zoomcamp Docker PostgreSQL Python Environment
- Environment:Vibrantlabsai Ragas Optional NLP Metrics Environment
- Environment:Isaac sim IsaacGymEnvs Python CUDA Runtime
- Environment:ClickHouse ClickHouse Linux Build Environment
- Environment:Romsto Speculative Decoding CUDA PyTorch
- Environment:Intel Ipex llm Portable Environment
- Environment:Bentoml BentoML Triton Inference Server