Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Pola rs Polars Lazy Query Pipeline
- Workflow:Junyanz Pytorch CycleGAN and pix2pix Pix2pix Training
- Workflow:Mage ai Mage ai SQL Database Source Extraction
- Workflow:Bitsandbytes foundation Bitsandbytes 8bit Optimizer Training
- Workflow:Volcengine Verl Data Preprocessing For RL
- Workflow:Diagram of thought Diagram of thought DoT Trace Extraction
- Workflow:Huggingface Transformers Model Training With Trainer
- Workflow:Mlflow Mlflow LLM Tracing
- Workflow:Junyanz Pytorch CycleGAN and pix2pix CycleGAN Training
- Workflow:Fastai Fastbook Tabular Modeling
Principles
- Principle:Online ml River Stream Data Sources
- Principle:Apache Airflow Task Execution TEI
- Principle:Ollama Ollama ML Backend Abstraction
- Principle:ClickHouse ClickHouse HTTP Client Communication
- Principle:Huggingface Datasets CSV Dataset Building
- Principle:Langchain ai Langchain Release Preparation
- Principle:Nautechsystems Nautilus trader Order Position Event Handling
- Principle:EvolvingLMMs Lab Lmms eval Baseline Comparison
- Principle:Heibaiying BigData Notes Kafka Consumer Configuration
- Principle:Kornia Kornia Inlier Extraction
Implementations
- Implementation:Openai Openai python Realtime Connect
- Implementation:Google deepmind Dm control CMU Subsets
- Implementation:Mage ai Mage ai Source Connector Directory Pattern
- Implementation:Sail sg LongSpec APPS Code Evaluator
- Implementation:Puppeteer Puppeteer Injected TextContent
- Implementation:Ray project Ray Serve ReplicaContext
- Implementation:Deepset ai Haystack SentenceTransformersTextEmbedder
- Implementation:Run llama Llama index BatchEvalRunner Evaluate Queries
- Implementation:ARISE Initiative Robosuite TransformUtils
- Implementation:Mage ai Mage ai GitHub PR Commits Schema
Heuristics
- Heuristic:Tensorflow Serving Batching Thread Tuning
- Heuristic:Apache Kafka JVM GC Tuning Defaults
- Heuristic:Sgl project Sglang Chunked Prefill OOM Prevention
- Heuristic:Predibase Lorax Quantization Backend Selection
- Heuristic:OpenHands OpenHands Fail Open Rate Limiting
- Heuristic:Tencent Ncnn Vulkan Pipeline Warmup
- Heuristic:PeterL1n BackgroundMattingV2 Training Batch Size And Resolution
- Heuristic:Langgenius Dify Token Refresh Loop Prevention
- Heuristic:Googleapis Python genai AFC Max Remote Calls Limit
- Heuristic:Mage ai Mage ai Sorted Data Bookmark Strategy
Environments
- Environment:Vespa engine Vespa POSIX Mmap Log Control
- Environment:NVIDIA NeMo Curator NVIDIA DALI
- Environment:Huggingface Transformers Flash Attention 2 Env
- Environment:Apache Kafka Gradle Build Environment
- Environment:Webdriverio Webdriverio Cloud Service Credentials
- Environment:FlowiseAI Flowise Docker Environment
- Environment:Facebookresearch Audiocraft AudioCraft Environment Variables
- Environment:Dagster io Dagster PostgreSQL Storage
- Environment:Lakeraai Pint benchmark Python 310 With Pandas
- Environment:Apache Shardingsphere Etcd Cluster Coordination