Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:AnswerDotAI RAGatouille In Memory Retrieval
- Workflow:ArroyoSystems Arroyo SQL Pipeline Lifecycle
- Workflow:Explodinggradients Ragas RAG Evaluation
- Workflow:Recommenders team Recommenders News Recommendation NRMS
- Workflow:Sktime Pytorch forecasting TFT Hyperparameter Optimization
- Workflow:Openai Openai node Chat Completion
- Workflow:Hpcaitech ColossalAI DPO Alignment
- Workflow:Allenai Open instruct GRPO Reinforcement Learning
- Workflow:Cohere ai Cohere python Chat Completion
- Workflow:Allenai Open instruct Reward Model Training
Principles
- Principle:DataTalksClub Data engineering zoomcamp Dlt Pipeline Execution
- Principle:ClickHouse ClickHouse HTTP Authentication
- Principle:Huggingface Alignment handbook Direct Preference Optimization
- Principle:PacktPublishing LLM Engineers Handbook SageMaker Training Orchestration
- Principle:DistrictDataLabs Yellowbrick Text Feature Visualization
- Principle:Princeton nlp Tree of thought llm Task Instantiation
- Principle:Microsoft Onnxruntime Model Conversion to ONNX
- Principle:Apache Kafka Broker Logging Configuration
- Principle:Google deepmind Mujoco Scene Rendering
- Principle:Iterative Dvc Plot Definition Collection
Implementations
- Implementation:Kornia Kornia Unsharp Mask
- Implementation:Alibaba MNN FlatBuffers IDL Gen Go
- Implementation:Vibrantlabsai Ragas Tokenizers
- Implementation:Iterative Dvc Index Graph Build
- Implementation:Online ml River Neighbors LazySearch
- Implementation:Langgenius Dify Use Explore
- Implementation:Norrrrrrr lyn WAInjectBench Validation TPR Selection
- Implementation:LaurentMazare Tch rs Stable Diffusion Pipeline
- Implementation:Ucbepic Docetl Directive OperatorFusion
- Implementation:Ollama Ollama Imagegen CLI
Heuristics
- Heuristic:Kubeflow Pipelines Resource Sizing For Components
- Heuristic:ContextualAI HALOs FSDP Sampling Workaround
- Heuristic:Microsoft Autogen Warning Deprecated JSON Env Files
- Heuristic:Ray project Ray Graceful Shutdown Timing
- Heuristic:Gretelai Gretel synthetics Binary Encoder Cutoff
- Heuristic:Recommenders team Recommenders TensorFlow Session Ordering
- Heuristic:Confident ai Deepeval Async Concurrency Tuning
- Heuristic:Onnx Onnx Protobuf 2GB Limit Workaround
- Heuristic:Bentoml BentoML Adaptive Batching Tuning
- Heuristic:Junyanz Pytorch CycleGAN and pix2pix Batch Size One Default
Environments
- Environment:Scikit learn Scikit learn Python Runtime Environment
- Environment:Princeton nlp SimPO VLLM Inference
- Environment:Nightwatchjs Nightwatch Node 18 Runtime
- Environment:Vllm project Vllm Python
- Environment:NVIDIA NeMo Aligner PyTriton Serving Environment
- Environment:Pyro ppl Pyro Python PyTorch Core
- Environment:Kserve Kserve SRIOV RDMA Network
- Environment:Mbzuai oryx Awesome LLM Post training Python Matplotlib
- Environment:Testtimescaling Testtimescaling github io Python 3 Runtime
- Environment:Datahub project Datahub Python Ingestion