Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Teamcapybara Capybara Form Testing
- Workflow:Datajuicer Data juicer Distributed Ray Processing
- Workflow:Langgenius Dify Knowledge Base Management
- Workflow:Microsoft Autogen Swarm Agent Handoff
- Workflow:Rapidsai Cuml Random Forest Training And Inference
- Workflow:Hiyouga LLaMA Factory DPO Preference Alignment
- Workflow:Diagram of thought Diagram of thought DoT Iterative Reasoning
- Workflow:Pola rs Polars Data IO and Format Conversion
- Workflow:PeterL1n BackgroundMattingV2 Video matting inference
- Workflow:Langchain ai Langchain Vector Store Operations
Principles
- Principle:Alibaba ROLL Diffusion Worker Initialization
- Principle:Ggml org Ggml GGUF Tensor Serialization
- Principle:ClickHouse ClickHouse Stateless Test Execution
- Principle:Huggingface Datatrove Pipeline Type Constants
- Principle:Google deepmind Dm control Rendering Backend Configuration
- Principle:Eventual Inc Daft Data Ingestion Parquet
- Principle:EvolvingLMMs Lab Lmms eval Task Selection
- Principle:OpenBMB UltraFeedback Score Validation and Correction
- Principle:Googleapis Python genai Cache Management
- Principle:Avdvg InjectGuard Evaluation And Metrics
Implementations
- Implementation:Helicone Helicone Filters
- Implementation:Haosulab ManiSkill MP Solutions Pattern
- Implementation:Mage ai Mage ai Google Cloud Storage Source
- Implementation:Mistralai Client python GCP Models Init
- Implementation:Ggml org Llama cpp Unicode
- Implementation:BerriAI Litellm Get Model Cost Map
- Implementation:Avdvg InjectGuard CSVLoader Load
- Implementation:Tensorflow Tfjs Recurrent Layers
- Implementation:Google deepmind Mujoco Studio App
- Implementation:EvolvingLMMs Lab Lmms eval Flickr30k Utils
Heuristics
- Heuristic:OpenGVLab InternVL Gradient Checkpointing Memory
- Heuristic:Duckdb Duckdb Build Parallelism Tuning
- Heuristic:Turboderp org Exllamav2 Memory Optimization Techniques
- Heuristic:Microsoft Semantic kernel Experimental Feature Opt In
- Heuristic:OWASP Www project top 10 for large language model applications Sandbox Containerization Pattern
- Heuristic:Facebookresearch Audiocraft Audio Normalization Strategies
- Heuristic:Huggingface Open r1 Test Batch Early Termination
- Heuristic:NVIDIA NeMo Aligner Adam State Offloading Tip
- Heuristic:Openai Evals Event Batching Configuration
- Heuristic:Zai org CogVideo Frame Count and Resolution Constraints
Environments
- Environment:VainF Torch Pruning CUDA GPU Benchmarking
- Environment:Ollama Ollama CGo Runtime
- Environment:Mistralai Client python GCP Deployment Environment
- Environment:Infiniflow Ragflow Data Source Credentials
- Environment:HKUDS AI Trader MCP Services
- Environment:SeldonIO Seldon core Kafka Messaging Environment
- Environment:Huggingface Diffusers Quantization Environment
- Environment:Mbzuai oryx Awesome LLM Post training Git CLI
- Environment:Truera Trulens OpenAI Provider Environment
- Environment:Apache Dolphinscheduler ZooKeeper Registry