Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Google research Deduplicate text datasets Cross dataset deduplication
- Workflow:Promptfoo Promptfoo Custom Provider Integration
- Workflow:Nautechsystems Nautilus trader Live trading deployment
- Workflow:PrefectHQ Prefect API Sourced ETL
- Workflow:Diagram of thought Diagram of thought DoT Trace Extraction
- Workflow:Mlfoundations Open flamingo Data Preparation
- Workflow:Langgenius Dify Visual Workflow Builder
- Workflow:Infiniflow Ragflow Knowledge Base Document Ingestion
- Workflow:Sktime Pytorch forecasting NBeats Univariate Forecasting
- Workflow:Openai Evals Building a custom eval
Principles
- Principle:Deepset ai Haystack Text File Conversion
- Principle:AUTOMATIC1111 Stable diffusion webui Weight interpolation methods
- Principle:SqueezeAILab ETS Hyperparameter Configuration
- Principle:FlowiseAI Flowise Chat Widget Embedding
- Principle:Farama Foundation Gymnasium Video Frame Saving
- Principle:Ray project Ray Actor Class Definition
- Principle:Spotify Luigi SQL Query Execution
- Principle:Ray project Ray Application Deployment
- Principle:Haifengl Smile Spatial Index Construction
- Principle:Webdriverio Webdriverio BrowserStack Extension Management
Implementations
- Implementation:Mlc ai Mlc llm Prefix Cache
- Implementation:Speechbrain Speechbrain Hparams AISHELL1 Transformer
- Implementation:Treeverse LakeFS Java SDK Model ObjectStats
- Implementation:ARISE Initiative Robosuite TextureModder
- Implementation:Risingwavelabs Risingwave Await Tree Page
- Implementation:Ucbepic Docetl FileOperations
- Implementation:Scikit learn Scikit learn OPTICS
- Implementation:Huggingface Datatrove InferenceServer
- Implementation:Fastai Fastbook Untar Data Text
- Implementation:LMCache LMCache Custom IPC Types
Heuristics
- Heuristic:ChenghaoMou Text dedup Suffix Array Merge Strategy
- Heuristic:Speechbrain Speechbrain Nonfinite Loss Handling
- Heuristic:ARISE Initiative Robomimic Rollout Horizon Selection
- Heuristic:Facebookresearch Audiocraft Codebook Dead Code Expiration
- Heuristic:Explodinggradients Ragas Embedding Batch Size Tuning
- Heuristic:NVIDIA DALI NVJPEG Memory Preallocation
- Heuristic:Trailofbits Fickling Allowlist Maintenance
- Heuristic:OpenHands OpenHands Redis Distributed Locking
- Heuristic:Openai CLIP Linear Probe Regularization C
- Heuristic:OpenGVLab InternVL Multi GPU ViT Device Mapping
Environments
- Environment:Infiniflow Ragflow Python Runtime
- Environment:Huggingface Datatrove Python Runtime
- Environment:Mbzuai oryx Awesome LLM Post training Python Pandas
- Environment:Romsto Speculative Decoding CUDA PyTorch
- Environment:Huggingface Trl DeepSpeed Environment
- Environment:Unstructured IO Unstructured Libmagic
- Environment:Huggingface Open r1 vLLM Server
- Environment:Deepset ai Haystack Python Runtime Environment
- Environment:Mistralai Client python Agents Environment
- Environment:Apache Druid Druid Cluster Api