Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Lm sys FastChat LoRA QLoRA Finetuning
- Workflow:Elevenlabs Elevenlabs python Speech to Text Transcription
- Workflow:Alibaba ROLL Supervised Finetuning Pipeline
- Workflow:Nightwatchjs Nightwatch Cucumber BDD Integration
- Workflow:OpenRLHF OpenRLHF Iterative DPO
- Workflow:BerriAI Litellm Response Caching
- Workflow:ClickHouse ClickHouse Server Deployment
- Workflow:Ggml org Ggml Vision Model Inference
- Workflow:DataTalksClub Data engineering zoomcamp dlt Data Ingestion
- Workflow:Kserve Kserve InferenceGraph Pipeline
Principles
- Principle:Huggingface Trl PPO Argument Configuration
- Principle:AUTOMATIC1111 Stable diffusion webui Component Reuse
- Principle:Google deepmind Mujoco Collision Detection
- Principle:Huggingface Datatrove Sentence Level Statistics
- Principle:Facebookresearch Audiocraft Training Checkpoint Management
- Principle:Onnx Onnx Tensor Specification
- Principle:DataExpert io Data engineer handbook Event Tracking
- Principle:Hiyouga LLaMA Factory Muon Optimization
- Principle:Microsoft Agent framework Custom Aggregation Pattern
- Principle:Huggingface Transformers Benchmark Orchestration V1
Implementations
- Implementation:Langfuse Langfuse API Comments Schema
- Implementation:Microsoft Onnxruntime Module TrainStep
- Implementation:Datahub project Datahub KafkaEmitterConfig
- Implementation:Microsoft DeepSpeedExamples LossTracker AverageMeter Accuracy
- Implementation:Apache Druid QueryTab ProcessQuery
- Implementation:Ggml org Llama cpp Get Evaluation Dataset
- Implementation:Mlc ai Mlc llm Bundle weight
- Implementation:Intel Ipex llm NPU BCE Embedding
- Implementation:Treeverse LakeFS Java SDK Model Setup
- Implementation:ARISE Initiative Robosuite HumanoidModel
Heuristics
- Heuristic:Google deepmind Dm control Physics Timestep Configuration
- Heuristic:Apache Druid Capability Detection Strategy
- Heuristic:ChenghaoMou Text dedup SimHash Optimization Ceiling
- Heuristic:Microsoft Semantic kernel Prompt Injection Safety
- Heuristic:Togethercomputer Together python Repetition Penalty Conflict
- Heuristic:Cohere ai Cohere python Embed Auto Batching Strategy
- Heuristic:Snorkel team Snorkel Precision Init Prior
- Heuristic:Fede1024 Rust rdkafka Partitioner Must Not Block
- Heuristic:Facebookresearch Habitat lab Force Single Threaded PyTorch
- Heuristic:ThreeSR Awesome Inference Time Scaling API Rate Limiting Tip
Environments
- Environment:Microsoft Autogen Extension Optional Dependencies
- Environment:Openai CLIP Python Dependencies
- Environment:Run llama Llama index Sentence Transformers Finetuning
- Environment:Deepset ai Haystack Python Runtime Environment
- Environment:DataTalksClub Data engineering zoomcamp Dbt DuckDB Environment
- Environment:Webdriverio Webdriverio Cloud Service Credentials
- Environment:Dotnet Machinelearning Platform Architecture Support
- Environment:Explodinggradients Ragas Optional Metrics Environment
- Environment:MarketSquare Robotframework browser Docker Container
- Environment:Wandb Weave Python SDK Runtime