Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Teamcapybara Capybara Element Finding And Interaction
- Workflow:Huggingface Datatrove FineWeb Dataset Creation
- Workflow:EvolvingLMMs Lab Lmms eval Custom Model Integration
- Workflow:Turboderp org Exllamav2 Text Generation
- Workflow:Deepset ai Haystack Hybrid Document Search
- Workflow:Gretelai Gretel synthetics DGAN Timeseries Generation
- Workflow:Openai Openai python Realtime Conversation
- Workflow:Puppeteer Puppeteer Cross Browser Automation
- Workflow:Openai Whisper Audio Transcription
- Workflow:LLMBook zh LLMBook zh github io DPO Alignment
Principles
- Principle:Huggingface Datasets Dataset From Pandas Construction
- Principle:FMInference FlexLLMGen Prediction Evaluation Metrics
- Principle:Mit han lab Llm awq Quantized Linear Module
- Principle:Allenai Open instruct Ray Cluster Setup
- Principle:Apache Airflow Local Testing
- Principle:Apache Airflow Provider Documentation
- Principle:Huggingface Peft Prefix Tuning
- Principle:Microsoft LoRA Distributed LoRA Training
- Principle:Fastai Fastbook Tokenization
- Principle:SqueezeAILab ETS ILP Node Selection
Implementations
- Implementation:InternLM Lmdeploy Gemm TunerParams
- Implementation:Getgauge Taiko RadioButton Element
- Implementation:Apache Druid HjsonCompletions
- Implementation:Cypress io Cypress DetectFramework
- Implementation:Apache Paimon View
- Implementation:Ggml org Llama cpp Download
- Implementation:Cohere ai Cohere python ToolCall Model
- Implementation:Apache Kafka Kafka Run Class Exec
- Implementation:HKUDS AI Trader A Stock Daily Price Data
- Implementation:Alibaba ROLL Compute Response Level Rewards
Heuristics
- Heuristic:HKUDS AI Trader Sortino Ratio Capping
- Heuristic:DataExpert io Data engineer handbook Flink Checkpointing Interval Tuning
- Heuristic:Rapidsai Cuml Batch Size Memory Tradeoff
- Heuristic:CARLA simulator Carla Sensor Queue Synchronization Pattern
- Heuristic:Huggingface Trl Distributed Device Map Override
- Heuristic:Diagram of thought Diagram of thought Strict Vs Flexible Critic Rigor
- Heuristic:ArroyoSystems Arroyo Stateful Operator TTL
- Heuristic:Run llama Llama index Finetuning Warmup Steps
- Heuristic:Vibrantlabsai Ragas Concurrency And Retry Configuration
- Heuristic:Openai Evals Eval Resumption Strategy
Environments
- Environment:DataExpert io Data engineer handbook Python Development Environment
- Environment:Huggingface Optimum GPTQ Quantization Environment
- Environment:Astronomer Astronomer cosmos Cloud Provider Dependencies
- Environment:Langgenius Dify Credentials And Env Vars
- Environment:Togethercomputer Together python API Credentials
- Environment:Nautechsystems Nautilus trader Python Cython Rust Runtime
- Environment:Getgauge Taiko Docker Container
- Environment:Lm sys FastChat Python Core Dependencies
- Environment:Diagram of thought Diagram of thought Python Graph Libraries
- Environment:Google deepmind Dm control EGL Headless Rendering