Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Ggml org Ggml MNIST Training And Evaluation
- Workflow:Vllm project Vllm OpenAI Compatible Serving
- Workflow:Ucbepic Docetl Python API Pipeline
- Workflow:Huggingface Peft LoRA Causal LM Finetuning
- Workflow:Hpcaitech ColossalAI DPO Alignment
- Workflow:Dagster io Dagster Bluesky Analytics
- Workflow:Anthropics Anthropic sdk python Extended Thinking Reasoning
- Workflow:Mlfoundations Open flamingo Few Shot Evaluation
- Workflow:Googleapis Python genai Context Caching
- Workflow:Vllm project Vllm Speculative Decoding
Principles
- Principle:Mbzuai oryx Awesome LLM Post training Paper Categorization
- Principle:Ggml org Llama cpp Conversion Environment Setup
- Principle:Spcl Graph of thoughts Ground Truth Evaluation
- Principle:Triton inference server Server Ensemble Configuration
- Principle:Huggingface Datatrove Bloom Filter Deduplication
- Principle:Neuml Txtai Agent Execution
- Principle:Lance format Lance Approximate Nearest Neighbor Search
- Principle:FMInference FlexLLMGen Distributed Pipeline Parallel Inference
- Principle:Confident ai Deepeval Automated Changelog Generation
- Principle:Apache Flink Congestion Control Rate Limiting
Implementations
- Implementation:Hpcaitech ColossalAI RewardModelTrainer
- Implementation:Datajuicer Data juicer DiversityAnalysis
- Implementation:Lm sys FastChat Split Long Conversation
- Implementation:Vespa engine Vespa Bootstrap Cmake Sh
- Implementation:Datahub project Datahub Run Quickstart Preflight Checks
- Implementation:HKUDS AI Trader Get Daily Price Crypto
- Implementation:Datahub project Datahub ProtobufExtensionUtil
- Implementation:PeterL1n BackgroundMattingV2 Torch onnx export
- Implementation:ArroyoSystems Arroyo Join Planner
- Implementation:DevExpress Testcafe TestFileParserBase
Heuristics
- Heuristic:Cleanlab Cleanlab Object Detection Scoring Constants
- Heuristic:Huggingface Trl DeepSpeed ZeRO3 Generation Tradeoff
- Heuristic:Vespa engine Vespa Warning Deprecated Cloud API Constructors
- Heuristic:Apache Airflow Memory Management Tips
- Heuristic:CarperAI Trlx KL Coefficient Adaptation
- Heuristic:Heibaiying BigData Notes Hive ORC Parquet Storage Tip
- Heuristic:Apache Beam GC Thrashing Detection
- Heuristic:Microsoft Onnxruntime Convergence Debugging Tips
- Heuristic:Farama Foundation Gymnasium Sync Vs Async VectorEnv Selection
- Heuristic:Cleanlab Cleanlab Multiprocessing Platform Strategy
Environments
- Environment:AUTOMATIC1111 Stable diffusion webui Xformers Attention
- Environment:Apache Dolphinscheduler ZooKeeper Registry
- Environment:DataExpert io Data engineer handbook PostgreSQL Docker Environment
- Environment:Huggingface Datasets Image Dependencies
- Environment:Run llama Llama index OpenAI API Configuration
- Environment:DevExpress Testcafe Node Runtime
- Environment:Nautechsystems Nautilus trader Databento API Credentials
- Environment:Testtimescaling Testtimescaling github io Python 3 Runtime
- Environment:Snorkel team Snorkel PyTorch
- Environment:Eventual Inc Daft AI Provider Dependencies