Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Obss Sahi COCO Evaluation
- Workflow:Facebookresearch Habitat lab HITL Interactive Evaluation
- Workflow:Spotify Luigi Local Batch Pipeline
- Workflow:Cohere ai Cohere python AWS Bedrock Deployment
- Workflow:Datahub project Datahub Python Metadata Emission
- Workflow:Datahub project Datahub Java SDK Metadata Emission
- Workflow:Trailofbits Fickling PyTorch Payload Injection
- Workflow:Arize ai Phoenix Dataset and Experiment Lifecycle
- Workflow:Ggml org Llama cpp LoRA Adapter Workflow
- Workflow:Pytorch Serve Model Deployment
Principles
- Principle:ClickHouse ClickHouse CMake Build Configuration
- Principle:Nightwatchjs Nightwatch Lifecycle Hooks
- Principle:Nightwatchjs Nightwatch Module Import
- Principle:Huggingface Diffusers DreamBooth Export
- Principle:PrefectHQ Prefect HTML Fetching
- Principle:Bitsandbytes foundation Bitsandbytes CPU SIMD Dequantization
- Principle:Zai org CogVideo Lookup Free Quantization
- Principle:Tensorflow Serving HTTP Server Setup
- Principle:NVIDIA NeMo Aligner SFT Data Preparation
- Principle:Apache Kafka SVN Artifact Staging
Implementations
- Implementation:Speechbrain Speechbrain Compute Embeddings
- Implementation:Datajuicer Data juicer OverallAnalysis Analyze
- Implementation:SeleniumHQ Selenium NetworkInterceptor Constructor
- Implementation:Astronomer Astronomer cosmos DbtRunAirflowAsyncBigqueryOperator
- Implementation:Neuml Txtai ImageHash
- Implementation:Ollama Ollama MLXRunner KV Cache
- Implementation:Pyro ppl Pyro Predictive MCMC
- Implementation:ARISE Initiative Robosuite LidObject
- Implementation:Vespa engine Vespa VespaLogHandler Publish
- Implementation:Microsoft Playwright Server Playwright
Heuristics
- Heuristic:Openai CLIP Template Ensemble For Zero Shot
- Heuristic:Axolotl ai cloud Axolotl Gradient Checkpointing Reentrant Rules
- Heuristic:Unstructured IO Unstructured Strategy Fallback Chain
- Heuristic:Lm sys FastChat Flash Attention GPU Requirements
- Heuristic:Puppeteer Puppeteer Mac Silicon Performance
- Heuristic:Ggml org Ggml Quantization Type Selection
- Heuristic:Huggingface Datasets Flatten Indices Performance
- Heuristic:Ucbepic Docetl Validation Retry Strategy
- Heuristic:Mbzuai oryx Awesome LLM Post training Checkpoint Every 3 Papers
- Heuristic:Norrrrrrr lyn WAInjectBench Zero Vector Fallback Failed Embeddings
Environments
- Environment:Vllm project Vllm CUDA
- Environment:FlowiseAI Flowise Database Environment
- Environment:Alibaba MNN GPU OpenCL Environment
- Environment:Confident ai Deepeval LLM Provider Credentials
- Environment:Online ml River Build Toolchain
- Environment:DataTalksClub Data engineering zoomcamp Kafka Confluent Environment
- Environment:Liu00222 Open Prompt Injection Python Dependencies
- Environment:DataExpert io Data engineer handbook Spark Iceberg Docker Environment
- Environment:Mlfoundations Open flamingo WebDataset Training Dependencies
- Environment:Snorkel team Snorkel SpaCy NLP