Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:DataTalksClub Data engineering zoomcamp Spark Batch Processing
- Workflow:Kornia Kornia Image Feature Matching
- Workflow:Scikit learn contrib Imbalanced learn SMOTE Resampling Pipeline
- Workflow:Microsoft Semantic kernel Kernel Setup And Chat Completion
- Workflow:Openai Openai python Responses API Text Generation
- Workflow:Lucidrains X transformers DPO Preference Alignment
- Workflow:Junyanz Pytorch CycleGAN and pix2pix Pretrained Inference
- Workflow:OpenRLHF OpenRLHF Reward Model Training
- Workflow:Openclaw Openclaw Channel Connection
- Workflow:Anthropics Anthropic sdk python Basic Message Conversation
Principles
- Principle:Apache Flink File Connector Table Integration
- Principle:Facebookresearch Habitat lab Task Dataset Selection
- Principle:Protectai Modelscan Model Security Scanning
- Principle:Huggingface Datatrove Exact Substring Deduplication
- Principle:Ollama Ollama CLIStartup
- Principle:Ggml org Llama cpp Public C API
- Principle:Sktime Pytorch forecasting Tensor Utilities
- Principle:Apache Shardingsphere Federation Metadata Refresh
- Principle:Apache Paimon Global Index Scan Building
- Principle:Arize ai Phoenix Evaluator Design
Implementations
- Implementation:Google deepmind Mujoco Engine Init
- Implementation:OpenHands OpenHands LinearManager
- Implementation:Eventual Inc Daft Daft Sql
- Implementation:Apache Dolphinscheduler AbstractDataSourceProcessor Extension
- Implementation:Speechbrain Speechbrain Eval ESC50 Interpret
- Implementation:SeleniumHQ Selenium Closure Dom Forms
- Implementation:Lance format Lance StructEncoding
- Implementation:OpenBMB UltraFeedback Multi Backend Inference
- Implementation:Apache Shardingsphere ClusterContextManagerBuilder Build
- Implementation:Togethercomputer Together python Together Client Init
Heuristics
- Heuristic:Apache Flink Bin Packing Complexity Guard
- Heuristic:Testtimescaling Testtimescaling github io Persist Credentials False
- Heuristic:Helicone Helicone Rate Limiting Fail Open
- Heuristic:Huggingface Alignment handbook Global Batch Size Scaling
- Heuristic:Haotian liu LLaVA Flash Attention GPU Requirement
- Heuristic:Obss Sahi Match Threshold Tuning
- Heuristic:Apache Druid Query Error Suggestion Patterns
- Heuristic:ChenghaoMou Text dedup Suffix Array Merge Strategy
- Heuristic:Cypress io Cypress Timeout Tuning
- Heuristic:Hiyouga LLaMA Factory Gradient Checkpointing Memory Optimization
Environments
- Environment:Vllm project Vllm GitHub
- Environment:Google deepmind Dm control OSMesa Software Rendering
- Environment:Guardrails ai Guardrails Python 3 10 Runtime
- Environment:Treeverse LakeFS LakeFS Server Environment
- Environment:Facebookresearch Audiocraft Python PyTorch CUDA Environment
- Environment:Langgenius Dify Vector Database Environment
- Environment:Google deepmind Mujoco Python Bindings Environment
- Environment:Mlfoundations Open flamingo PyTorch CUDA Distributed
- Environment:ThreeSR Awesome Inference Time Scaling Python Runtime Environment
- Environment:Getgauge Taiko Node Runtime