Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:ContextualAI HALOs Reward Model Training
- Workflow:OpenRLHF OpenRLHF DPO Training
- Workflow:Apache Hudi Flink Schema Evolution
- Workflow:LLMBook zh LLMBook zh github io DPO Alignment
- Workflow:Ray project Ray Remote Task Execution
- Workflow:Bigscience workshop Petals Prompt Tuning Classification
- Workflow:Openai Openai node Structured Output Parsing
- Workflow:Vllm project Vllm Multi LoRA Serving
- Workflow:Scikit learn contrib Imbalanced learn SMOTE Resampling Pipeline
- Workflow:Microsoft LoRA LoRA Integration
Principles
- Principle:Speechbrain Speechbrain Noisy Speech Data Preparation
- Principle:Arize ai Phoenix Experiment Result Analysis
- Principle:Lm sys FastChat Condensed Rotary Embedding
- Principle:Huggingface Trl DPO Preference Dataset Loading
- Principle:Spotify Luigi External API Integration
- Principle:MarketSquare Robotframework browser Plugin Loading Mechanism
- Principle:Online ml River Estimator Base Architecture
- Principle:Trailofbits Fickling Pickle VM Tracing
- Principle:Openclaw Openclaw Channel Setup
- Principle:Nightwatchjs Nightwatch Test Execution
Implementations
- Implementation:LaurentMazare Tch rs Torch Api H
- Implementation:Openai CLIP LogisticRegression Wrapper
- Implementation:Bentoml BentoML Testing Server
- Implementation:Run llama Llama index RagDatasetGenerator
- Implementation:Mlc ai Mlc llm Attention Op
- Implementation:Ollama Ollama Imagegen Cache TeaCache
- Implementation:Anthropics Anthropic sdk python Package Init
- Implementation:ArroyoSystems Arroyo Sse Connector
- Implementation:Ucbepic Docetl Pipeline Optimize
- Implementation:MaterializeInc Materialize Optbench SQL Module
Heuristics
- Heuristic:OpenBMB UltraFeedback API Retry Strategy
- Heuristic:Vibrantlabsai Ragas Nest Asyncio Uvloop Compatibility
- Heuristic:Volcengine Verl Layered Summon Memory Tradeoff
- Heuristic:Fede1024 Rust rdkafka Queue Buffering Priority
- Heuristic:Marker Inc Korea AutoRAG Hybrid Retrieval Score Normalization
- Heuristic:Explodinggradients Ragas LLM Temperature Defaults
- Heuristic:AUTOMATIC1111 Stable diffusion webui Cross Attention Memory Slicing
- Heuristic:Mbzuai oryx Awesome LLM Post training Checkpoint Every 3 Papers
- Heuristic:Apache Flink False Positive Availability Optimization
- Heuristic:Obss Sahi Auto Slice Resolution
Environments
- Environment:Google deepmind Dm control OSMesa Software Rendering
- Environment:Arize ai Phoenix Frontend Node 22
- Environment:Vllm project Vllm Distributed
- Environment:Allenai Open instruct CUDA GPU Training
- Environment:VainF Torch Pruning PyTorch Python Core
- Environment:Webdriverio Webdriverio Browser Driver Environment
- Environment:DataExpert io Data engineer handbook Flink Kafka Docker Environment
- Environment:Huggingface Datatrove S3 Storage Environment
- Environment:FMInference FlexLLMGen CUDA GPU
- Environment:OWASP Www project top 10 for large language model applications PR Description Generator Runtime