Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Deepspeedai DeepSpeed ZeRO Distributed Training
- Workflow:Google research Deduplicate text datasets Single file deduplication
- Workflow:Run llama Llama index Embedding Finetuning
- Workflow:Predibase Lorax OpenAI Chat Completion
- Workflow:Run llama Llama index RAG Query Pipeline
- Workflow:Cypress io Cypress Project Setup and Configuration
- Workflow:LLMBook zh LLMBook zh github io Supervised Finetuning
- Workflow:NVIDIA NeMo Curator Image Curation Pipeline
- Workflow:Datahub project Datahub Docker Quickstart Deployment
- Workflow:SeleniumHQ Selenium Selenium Grid Deployment
Principles
- Principle:HKUDS AI Trader Frontend Cache Management
- Principle:Triton inference server Server Jetson Edge Deployment
- Principle:Snorkel team Snorkel Augmented Data Combination
- Principle:Lm sys FastChat Worker Dispatch Control
- Principle:Huggingface Diffusers Video Memory Management
- Principle:Webdriverio Webdriverio Test Spec Authoring
- Principle:Ggml org Llama cpp KVCache
- Principle:TA Lib Ta lib python Pattern Signal Integration
- Principle:Google deepmind Mujoco CPU Model Loading
- Principle:Ggml org Llama cpp HF to GGUF Conversion
Implementations
- Implementation:Evidentlyai Evidently Grafana Dashboard Config
- Implementation:Datajuicer Data juicer DWposeDetector
- Implementation:NVIDIA DALI GPU Affinity
- Implementation:Iterative Dvc To Json
- Implementation:Marker Inc Korea AutoRAG Make Basic Gen Gt
- Implementation:Scikit learn Scikit learn LatentDirichletAllocation
- Implementation:Spcl Graph of thoughts Thought
- Implementation:Eventual Inc Daft Session Constructor
- Implementation:Open compass VLMEvalKit VGRPBench Score
- Implementation:Langfuse Langfuse Dataset Run Items Converters
Heuristics
- Heuristic:Avdvg InjectGuard Sim K Threshold Tuning
- Heuristic:Gretelai Gretel synthetics Binary Encoder Cutoff
- Heuristic:Princeton nlp Tree of thought llm API Request Batching
- Heuristic:Puppeteer Puppeteer Chrome Default Launch Arguments
- Heuristic:Huggingface Alignment handbook Gradient Checkpointing Use Cache
- Heuristic:Vespa engine Vespa KStemmer Dictionary Loading
- Heuristic:Speechbrain Speechbrain Gradient Clipping Strategy
- Heuristic:ARISE Initiative Robosuite Hard Reset Vs Soft Reset
- Heuristic:Liu00222 Open Prompt Injection Defense Strategy Selection
- Heuristic:OpenBMB UltraFeedback API Retry Strategy
Environments
- Environment:OpenGVLab InternVL PyTorch CUDA
- Environment:Huggingface Datatrove S3 Storage Environment
- Environment:Googleapis Python genai Gemini API Key Authentication
- Environment:InternLM Lmdeploy CUDA GPU Runtime
- Environment:Lm sys FastChat GPU CUDA Inference
- Environment:Huggingface Trl PEFT LoRA Environment
- Environment:Openai Whisper FFmpeg
- Environment:LMCache LMCache VLLM Serving Engine
- Environment:Diagram of thought Diagram of thought Python Graph Libraries
- Environment:Duckdb Duckdb Code Generation Tools