Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Wandb Weave LLM Integration Tracing
- Workflow:Lucidrains X transformers DPO Preference Alignment
- Workflow:Google research Deduplicate text datasets Suffix array querying
- Workflow:PeterL1n BackgroundMattingV2 Training pipeline
- Workflow:Huggingface Transformers Pipeline Inference
- Workflow:Fede1024 Rust rdkafka At Least Once Processing
- Workflow:LLMBook zh LLMBook zh github io Inference and Quantization
- Workflow:Guardrails ai Guardrails Server Deployment
- Workflow:ARISE Initiative Robomimic Trained Policy Evaluation
- Workflow:Sgl project Sglang ModelOpt Quantization And Export
Principles
- Principle:Scikit learn contrib Imbalanced learn Condensed Nearest Neighbour
- Principle:Spcl Graph of thoughts Result Serialization
- Principle:Arize ai Phoenix Span Annotation
- Principle:ARISE Initiative Robosuite Gripper Control Abstraction
- Principle:Intel Ipex llm LLM Initialization LangChain
- Principle:Nautechsystems Nautilus trader Catalog Data Writing
- Principle:Onnx Onnx External Data Saving
- Principle:OpenGVLab InternVL Image Transform Pipeline
- Principle:Bigscience workshop Petals Data Preparation
- Principle:OpenHands OpenHands Middleware Configuration
Implementations
- Implementation:Hpcaitech ColossalAI DetachedTrainer
- Implementation:Speechbrain Speechbrain Prepare Switchboard Root
- Implementation:Haosulab ManiSkill RoboCasaAccessories
- Implementation:FlagOpen FlagEmbedding LLM Embedder Retrieval Args
- Implementation:AUTOMATIC1111 Stable diffusion webui Image Viewer
- Implementation:Tensorflow Serving Bundle Factory Util
- Implementation:Ray project Ray BaseTaskCaller
- Implementation:Openclaw Openclaw Chrome Extension Background
- Implementation:MarketSquare Robotframework browser Wait For Navigation
- Implementation:PeterL1n BackgroundMattingV2 ImageSequenceWriter
Heuristics
- Heuristic:Allenai Open instruct BFloat16 Training
- Heuristic:Zai org CogVideo Decord Import Order Bug
- Heuristic:Scikit learn Scikit learn Data Leakage Prevention
- Heuristic:Ray project Ray NaN Score Filtering In PBT
- Heuristic:Haotian liu LLaVA Flash Attention GPU Requirement
- Heuristic:Infiniflow Ragflow Hybrid Search Fallback Strategy
- Heuristic:Vllm project Vllm GPU Memory Utilization Tuning
- Heuristic:Treeverse LakeFS Batch Delay Tuning
- Heuristic:LLMBook zh LLMBook zh github io DPO Beta Hyperparameter
- Heuristic:Apache Hudi Record Level Index Optimization
Environments
- Environment:Dotnet Machinelearning Native Build Toolchain
- Environment:OpenGVLab InternVL PyTorch CUDA
- Environment:Kserve Kserve VLLM Runtime
- Environment:Onnx Onnx Cpp Build Environment
- Environment:Apache Spark Kubernetes Runtime
- Environment:Spotify Luigi Python Runtime
- Environment:Nautechsystems Nautilus trader Databento API Credentials
- Environment:Apache Druid Integration Test Docker
- Environment:CARLA simulator Carla Python API Runtime
- Environment:Junyanz Pytorch CycleGAN and pix2pix Python PyTorch Runtime