Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Togethercomputer Together python Fine Tuning
- Workflow:Duckdb Duckdb Benchmark Execution
- Workflow:MarketSquare Robotframework browser API Testing via Browser
- Workflow:PeterL1n BackgroundMattingV2 Model export
- Workflow:Tencent Ncnn Object Detection Inference
- Workflow:Mlflow Mlflow Model Serving
- Workflow:Marker Inc Korea AutoRAG Pipeline Deployment
- Workflow:Neuml Txtai API Deployment
- Workflow:Openai Evals Running an eval set
- Workflow:Pytorch Serve LLM Deployment vLLM
Principles
- Principle:BerriAI Litellm Integration Selection
- Principle:ContextualAI HALOs Feedback Labeling
- Principle:Marker Inc Korea AutoRAG Query Generation
- Principle:Isaac sim IsaacGymEnvs Checkpoint Export and Logging
- Principle:NVIDIA DALI Custom Operator Build System
- Principle:Scikit learn contrib Imbalanced learn Cluster Centroid Under Sampling
- Principle:Turboderp org Exllamav2 Bit Allocation Optimization
- Principle:NVIDIA NeMo Curator File Partitioning
- Principle:Huggingface Datasets Pandas Conversion
- Principle:Googleapis Python genai Client Initialization
Implementations
- Implementation:OpenRLHF OpenRLHF DeepspeedStrategy save model
- Implementation:Run llama Llama index BaseSelector
- Implementation:Huggingface Trl PPOTrainer Train
- Implementation:OpenGVLab InternVL LLaVA Model Worker
- Implementation:InternLM Lmdeploy Serve Proxy
- Implementation:Ucbepic Docetl BaseSchemas
- Implementation:Openai Openai python CLI Entry Point
- Implementation:Google deepmind Dm control TCP Initializer
- Implementation:Datajuicer Data juicer ImageAestheticsFilter
- Implementation:Langchain ai Langchain FireworksLLM
Heuristics
- Heuristic:Openai Evals Event Batching Configuration
- Heuristic:Mbzuai oryx Awesome LLM Post training Depth Limit Recursion At 2
- Heuristic:Openclaw Openclaw Cache TTL Asymmetric Strategy
- Heuristic:AnswerDotAI RAGatouille Searcher Configuration By Collection Size
- Heuristic:Axolotl ai cloud Axolotl Memory Optimization Tips
- Heuristic:NVIDIA NeMo Curator GPU Memory Resource Allocation
- Heuristic:Alibaba MNN Weight Quantization Strategy
- Heuristic:Google research Deduplicate text datasets Ulimit File Descriptors For Merge
- Heuristic:Microsoft BIPIA Torch Compile Platform Guard
- Heuristic:Scikit learn contrib Imbalanced learn KNeighbors Selection Tips
Environments
- Environment:ChenghaoMou Text dedup Python 3 12 Environment
- Environment:Google research Deduplicate text datasets Python HuggingFace Environment
- Environment:Duckdb Duckdb Release Publishing Env
- Environment:Ucbepic Docetl Python Runtime
- Environment:TA Lib Ta lib python Python Build Environment
- Environment:Junyanz Pytorch CycleGAN and pix2pix DDP Multi GPU
- Environment:Kornia Kornia CUDA GPU Environment
- Environment:Mbzuai oryx Awesome LLM Post training Python Requests
- Environment:Sail sg LongSpec Inference Environment
- Environment:Duckdb Duckdb Code Generation Tools