Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Dolphinscheduler Datasource Plugin Development
- Workflow:Hiyouga LLaMA Factory LoRA SFT Finetuning
- Workflow:NVIDIA TransformerEngine Accelerate HF Gemma With TE
- Workflow:Obss Sahi COCO Evaluation
- Workflow:Isaac sim IsaacGymEnvs Factory Assembly Training
- Workflow:Unstructured IO Unstructured Performance Profiling
- Workflow:Mage ai Mage ai API Source Extraction
- Workflow:Heibaiying BigData Notes Storm Topology Development
- Workflow:Hiyouga LLaMA Factory PPO RLHF Training
- Workflow:Onnx Onnx External Data Handling
Principles
- Principle:Ggml org Ggml Tensor Context Management
- Principle:Pyro ppl Pyro Custom Distribution Framework
- Principle:Explodinggradients Ragas Evaluation Dataset Preparation
- Principle:Ggml org Ggml Neural Network Graph Building
- Principle:Datahub project Datahub Emitter Initialization
- Principle:Triton inference server Server Ensemble Configuration
- Principle:Facebookresearch Audiocraft Audio Tokenizer Selection
- Principle:Mlc ai Web llm Cross Thread Streaming
- Principle:AUTOMATIC1111 Stable diffusion webui Extra Networks Framework
- Principle:Google deepmind Dm control Locomotion Visualization
Implementations
- Implementation:Recommenders team Recommenders K8s Utils
- Implementation:Openai Openai node Ecosystem CLI
- Implementation:Volcengine Verl Compute Value Loss
- Implementation:Spotify Luigi DataprocTask
- Implementation:DataTalksClub Data engineering zoomcamp Spark UnionAll
- Implementation:Ggml org Llama cpp Llama Memory Seq Pos Max
- Implementation:Hiyouga LLaMA Factory WebUI Export Component
- Implementation:Kubeflow Pipelines Profile Controller Sync
- Implementation:Langgenius Dify SendCompletionMessage
- Implementation:Risingwavelabs Risingwave Explain DistSQL Page
Heuristics
- Heuristic:Promptfoo Promptfoo Adaptive Concurrency Tuning
- Heuristic:Huggingface Datasets Cache Fingerprinting Tips
- Heuristic:Microsoft BIPIA OpenAI Rate Limit Retry
- Heuristic:Nautechsystems Nautilus trader Streaming Mode For Large Backtests
- Heuristic:Obss Sahi Class Agnostic vs Per Class NMS
- Heuristic:PeterL1n BackgroundMattingV2 Checkpoint Interval Tuning
- Heuristic:HKUDS AI Trader DeepSeek Tool Args Workaround
- Heuristic:Huggingface Alignment handbook Global Batch Size Scaling
- Heuristic:Datajuicer Data juicer Checkpoint Resumption Strategy
- Heuristic:Princeton nlp Tree of thought llm Global State Token Counting
Environments
- Environment:Intel Ipex llm vLLM XPU Serving Environment
- Environment:BerriAI Litellm Provider API Credentials
- Environment:Sdv dev SDV GPU CUDA Support
- Environment:Vllm project Vllm CUDA Hopper
- Environment:Sktime Pytorch forecasting Core Python Dependencies
- Environment:Vllm project Vllm Environment Variables
- Environment:Isaac sim IsaacGymEnvs Pip Dependencies
- Environment:Apache Shardingsphere Etcd Cluster Coordination
- Environment:Treeverse LakeFS Web UI Environment
- Environment:Marker Inc Korea AutoRAG API Keys Configuration