Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:PeterL1n BackgroundMattingV2 Realtime webcam matting
- Workflow:SeleniumHQ Selenium Selenium Grid Deployment
- Workflow:Openai Openai agents python Multi Agent Handoff
- Workflow:VainF Torch Pruning Vision Transformer Pruning
- Workflow:Huggingface Alignment handbook SFT DPO Alignment Pipeline
- Workflow:Liu00222 Open Prompt Injection Prompt Injection Experiment
- Workflow:Hpcaitech ColossalAI Supervised Finetuning
- Workflow:Apache Spark Release Process
- Workflow:Heibaiying BigData Notes Storm Topology Development
- Workflow:Predibase Lorax Multi Adapter Merging
Principles
- Principle:TobikoData Sqlmesh Web UI
- Principle:Allenai Open instruct Reward Model Evaluation
- Principle:Mlc ai Mlc llm Weight Conversion and Quantization
- Principle:Kserve Kserve Multi Model Prediction
- Principle:Langgenius Dify Model Provider Management
- Principle:AUTOMATIC1111 Stable diffusion webui Webui Performance Profiling
- Principle:Axolotl ai cloud Axolotl SFT Training Execution
- Principle:Tensorflow Tfjs Pretrained Model Loading
- Principle:Hiyouga LLaMA Factory Rotary Position Embedding
- Principle:Webdriverio Webdriverio Custom Service Development
Implementations
- Implementation:Risingwavelabs Risingwave SourceHandler Interface
- Implementation:Mlc ai Mlc llm Serve Data
- Implementation:ArroyoSystems Arroyo Validate UDF
- Implementation:Neuml Txtai Graph Query
- Implementation:Google research Deduplicate text datasets Load Dataset TFDS
- Implementation:CARLA simulator Carla Client Start Recorder
- Implementation:Vllm project Vllm CPU Types X86
- Implementation:Facebookresearch Habitat lab VER RolloutStorage
- Implementation:Sdv dev SDV DayZSynthesizer Multi Table
- Implementation:Haosulab ManiSkill Sim2RealEnv
Heuristics
- Heuristic:Google deepmind Dm control MJCF Model Composition Gotchas
- Heuristic:Cohere ai Cohere python Tokenizer Cache With TTL
- Heuristic:TA Lib Ta lib python Thread Safety With Abstract API
- Heuristic:Ggml org Ggml Thread Count Selection
- Heuristic:Openai CLIP Class Name Curation
- Heuristic:Langgenius Dify SQL Escape Backslash First
- Heuristic:Mage ai Mage ai HTTP Request Timeout Defaults
- Heuristic:Tencent Ncnn FP16 Precision Selection
- Heuristic:Rapidsai Cuml Batch Size Memory Tradeoff
- Heuristic:Cleanlab Cleanlab Confident Threshold Heuristic
Environments
- Environment:Apache Airflow Python Runtime Environment
- Environment:Marker Inc Korea AutoRAG API Keys And Credentials
- Environment:Deepseek ai Janus CUDA GPU Environment
- Environment:Datajuicer Data juicer Ray Cluster Environment
- Environment:Apache Dolphinscheduler Java Runtime
- Environment:Spcl Graph of thoughts OpenAI API Access
- Environment:Huggingface Trl PEFT LoRA Environment
- Environment:Iterative Dvc Python Runtime
- Environment:AUTOMATIC1111 Stable diffusion webui GPU Compute Backend
- Environment:SeleniumHQ Selenium Selenium Manager Runtime