Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Kafka Release Candidate Staging
- Workflow:Microsoft Agent framework Multi Agent Concurrent Orchestration
- Workflow:Apache Beam Local Pipeline Execution
- Workflow:InternLM Lmdeploy LLM Offline Batch Inference
- Workflow:Alibaba MNN Stable Diffusion Deployment
- Workflow:Hiyouga LLaMA Factory DPO Preference Alignment
- Workflow:Protectai Llm guard LLM Input Output Scanning
- Workflow:Axolotl ai cloud Axolotl Full Finetuning Distributed
- Workflow:EvolvingLMMs Lab Lmms eval End to End Evaluation
- Workflow:Obss Sahi COCO Evaluation
Principles
- Principle:Ggml org Llama cpp Input Provenance Tracking
- Principle:Sktime Pytorch forecasting Samformer Architecture
- Principle:Kserve Kserve Canary Traffic Splitting
- Principle:LLMBook zh LLMBook zh github io Rotary Position Embedding
- Principle:Fastai Fastbook Language Model Fine Tuning
- Principle:Dotnet Machinelearning SSA Model Fitting
- Principle:Apache Druid Dimension Measure Configuration
- Principle:Huggingface Diffusers Memory Optimization
- Principle:Bentoml BentoML Deployment Termination
- Principle:Marker Inc Korea AutoRAG Evaluator Initialization
Implementations
- Implementation:Alibaba MNN SourceModule
- Implementation:Google deepmind Mujoco mjr readPixels
- Implementation:OpenRLHF OpenRLHF Blending datasets
- Implementation:FlagOpen FlagEmbedding MLVU Topic Reasoning Data
- Implementation:Ollama Ollama Imagegen MLX C
- Implementation:Apache Druid DruidSqlAceMode
- Implementation:Openai Openai node Browser Webpack Lockfile
- Implementation:Vespa engine Vespa Publish Artifacts Sh
- Implementation:Microsoft Semantic kernel AddOpenAIChatClient
- Implementation:Kserve Kserve Triton Runtime
Heuristics
- Heuristic:NVIDIA TransformerEngine FP8 Recipe Auto Selection
- Heuristic:Bigscience workshop Petals Prompt Embeddings Float32 Precision
- Heuristic:Unslothai Unsloth VLLM Memory Utilization
- Heuristic:Huggingface Peft DoRA Inference Caching
- Heuristic:Turboderp org Exllamav2 Paged Cache Configuration
- Heuristic:Unstructured IO Unstructured Strategy Fallback Chain
- Heuristic:VainF Torch Pruning Pruning Ratio vs Parameter Ratio
- Heuristic:ARISE Initiative Robosuite Observation Key Selection
- Heuristic:PeterL1n BackgroundMattingV2 Checkpoint Interval Tuning
- Heuristic:Infiniflow Ragflow Agent Max Rounds Strategy
Environments
- Environment:LLMBook zh LLMBook zh github io Data Processing Environment
- Environment:Marker Inc Korea AutoRAG Korean NLP Dependencies
- Environment:Sgl project Sglang CUDA Runtime
- Environment:Huggingface Optimum Accelerated Inference Environment
- Environment:Huggingface Datatrove Slurm Cluster Environment
- Environment:Huggingface Alignment handbook BitsAndBytes CUDA
- Environment:Langfuse Langfuse Redis 7 Queue Cache
- Environment:Speechbrain Speechbrain Multi GPU DDP
- Environment:Triton inference server Server GPU CUDA Runtime
- Environment:Mlfoundations Open flamingo PyTorch CUDA Distributed