Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Spark Standalone Cluster Deployment
- Workflow:Huggingface Peft Seq2Seq AdaLoRA Finetuning
- Workflow:PacktPublishing LLM Engineers Handbook Digital Data ETL
- Workflow:Openai Openai node Streaming To Client
- Workflow:VainF Torch Pruning Object Detection Pruning
- Workflow:Microsoft Semantic kernel Vector Store RAG Pipeline
- Workflow:ARISE Initiative Robosuite Teleoperation
- Workflow:Facebookresearch Audiocraft Model Export And Deployment
- Workflow:Huggingface Open r1 Dataset Pass Rate Filtering
- Workflow:Hiyouga LLaMA Factory Model Inference and Serving
Principles
- Principle:Treeverse LakeFS Import Source Preparation
- Principle:Alibaba ROLL Agentic Advantage Estimation
- Principle:Microsoft Onnxruntime On Device Training Loop
- Principle:Danijar Dreamerv3 Data Collection And Training
- Principle:Scikit learn Scikit learn Regression Metrics
- Principle:Bigscience workshop Petals Prompt Tuning
- Principle:Microsoft Agent framework Human in the Loop Request
- Principle:Predibase Lorax Adapter Merge Strategies
- Principle:Langchain ai Langgraph Edge Configuration
- Principle:Scikit learn contrib Imbalanced learn Combined Over Under Sampling
Implementations
- Implementation:BerriAI Litellm Sensitive Data Masker
- Implementation:Haosulab ManiSkill Pose
- Implementation:Mit han lab Llm awq NVILA Benchmark
- Implementation:SeldonIO Seldon core Transformers Pipeline Save Pretrained
- Implementation:Openai Openai agents python ItemHelpers
- Implementation:AUTOMATIC1111 Stable diffusion webui Hypertile Optimization
- Implementation:Apache Druid Kubernetes Operator Main
- Implementation:EvolvingLMMs Lab Lmms eval SciBench Utils
- Implementation:Tensorflow Serving StreamingBatchScheduler
- Implementation:Teamcapybara Capybara Capybara Add Selector
Heuristics
- Heuristic:EvolvingLMMs Lab Lmms eval Request Caching Strategy
- Heuristic:SqueezeAILab ETS Thread Parallelism Suppression
- Heuristic:TA Lib Ta lib python NaN Propagation Behavior
- Heuristic:Intel Ipex llm Use Cache Training Vs Inference
- Heuristic:Mage ai Mage ai Sorted Data Bookmark Strategy
- Heuristic:Liu00222 Open Prompt Injection PPL Threshold Tuning
- Heuristic:BerriAI Litellm Batch Size Flush Interval Tuning
- Heuristic:Mbzuai oryx Awesome LLM Post training Reference Citation Cap 200
- Heuristic:BerriAI Litellm Cooldown Threshold Tuning
- Heuristic:Snorkel team Snorkel Precision Init Prior
Environments
- Environment:Lucidrains X transformers Python Environment
- Environment:Onnx Onnx Python Runtime Environment
- Environment:Zai org CogVideo Diffusers Finetuning Environment
- Environment:FlagOpen FlagEmbedding GPU Accelerator Environment
- Environment:Gretelai Gretel synthetics PyTorch CUDA Environment
- Environment:OpenBMB UltraFeedback vLLM Multi GPU Environment
- Environment:CARLA simulator Carla Simulation Runtime
- Environment:TobikoData Sqlmesh Snowflake Connection
- Environment:Bentoml BentoML Triton Inference Server
- Environment:NVIDIA TransformerEngine CUDA Toolkit Requirements