Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:TobikoData Sqlmesh Model development and testing
- Workflow:Langgenius Dify Knowledge Base Creation
- Workflow:Huggingface Alignment handbook SFT DPO Alignment Pipeline
- Workflow:Run llama Llama index Evaluation Pipeline
- Workflow:Iterative Dvc Data Tracking
- Workflow:Junyanz Pytorch CycleGAN and pix2pix CycleGAN Training
- Workflow:DistrictDataLabs Yellowbrick Classification Model Evaluation
- Workflow:Anthropics Anthropic sdk python Structured Output Extraction
- Workflow:PeterL1n BackgroundMattingV2 Model export
- Workflow:OpenRLHF OpenRLHF Rejection Sampling
Principles
- Principle:Neuml Txtai Pipeline Definition
- Principle:Vespa engine Vespa Runtime Level Control
- Principle:ClickHouse ClickHouse Binary Linking
- Principle:DataTalksClub Data engineering zoomcamp Dbt Source Declaration
- Principle:Roboflow Rf detr Image Preprocessing
- Principle:LaurentMazare Tch rs Generated FFI Bindings
- Principle:Ggml org Ggml Transformer Graph Construction
- Principle:Pytorch Serve Parallelism Strategy
- Principle:EvolvingLMMs Lab Lmms eval Job Submission
- Principle:Google research Deduplicate text datasets Dataset Serialization TFDS
Implementations
- Implementation:OpenGVLab InternVL ScienceQA Prompt Conversion
- Implementation:Neuml Txtai Agent Init
- Implementation:Avhz RustQuant AnalyticOptionPricer Greeks
- Implementation:Junyanz Pytorch CycleGAN and pix2pix CycleGANModel Optimize Parameters
- Implementation:Microsoft Onnxruntime OrtEnvironment
- Implementation:Apache Druid Type Registry
- Implementation:Speechbrain Speechbrain Prepare Switchboard LM
- Implementation:Ucbepic Docetl ExtractOperation Execute
- Implementation:Risingwavelabs Risingwave ConfigurableOffsetBackingStore
- Implementation:Webdriverio Webdriverio Workers Types
Heuristics
- Heuristic:Neuml Txtai LLM Context Window Fallback
- Heuristic:Triton inference server Server Documentation Standards
- Heuristic:Ucbepic Docetl Token Counting And Truncation
- Heuristic:Bentoml BentoML Warning Deprecated Runner Class
- Heuristic:Huggingface Diffusers Dtype Precision Selection
- Heuristic:CarperAI Trlx Batch Size Tuning
- Heuristic:Apache Shardingsphere Version Cleanup After Switch
- Heuristic:Fastai Fastbook Random Forest Defaults
- Heuristic:Spotify Luigi Dynamic Requirements Generator
- Heuristic:Vespa engine Vespa KStemmer Dictionary Loading
Environments
- Environment:Langgenius Dify Docker Compose Environment
- Environment:OWASP Www project top 10 for large language model applications GenAI Red Team Environment
- Environment:Nautechsystems Nautilus trader Asyncio Uvloop Event Loop
- Environment:Puppeteer Puppeteer Configuration Environment Variables
- Environment:Tensorflow Serving Build Environment
- Environment:Farama Foundation Gymnasium Python 3 10 Runtime
- Environment:ClickHouse ClickHouse CI Docker Environment
- Environment:Microsoft Onnxruntime Distributed Training Environment
- Environment:Turboderp org Exllamav2 CUDA GPU Runtime
- Environment:Infiniflow Ragflow Python Runtime