Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Astronomer Astronomer cosmos Local dbt DAG rendering
- Workflow:OpenRLHF OpenRLHF Reward Model Training
- Workflow:NVIDIA NeMo Aligner RLHF PPO Training
- Workflow:Kubeflow Kubeflow Platform Deployment
- Workflow:Eventual Inc Daft Distributed UDF Processing
- Workflow:Astronomer Astronomer cosmos Kubernetes dbt execution
- Workflow:Volcengine Verl GRPO Training Pipeline
- Workflow:Ggml org Ggml GPT2 Text Generation
- Workflow:Datahub project Datahub Python Metadata Emission
- Workflow:Bentoml BentoML Model Store Management
Principles
- Principle:Ucbepic Docetl LLM Powered Text Extraction
- Principle:Alibaba MNN Weight Quantization
- Principle:Norrrrrrr lyn WAInjectBench Ensemble Aggregation Text
- Principle:Langfuse Langfuse Post Ingestion Side Effects
- Principle:Cleanlab Cleanlab Clean Model Inference
- Principle:Langgenius Dify Dataset Creation
- Principle:HKUDS AI Trader Benchmark Index Analysis
- Principle:Danijar Dreamerv3 Distributed Logger Aggregation
- Principle:Protectai Llm guard Programming Language Detection
- Principle:Cleanlab Cleanlab Coteaching Algorithm
Implementations
- Implementation:OpenGVLab InternVL Evaluate Sh
- Implementation:Sktime Pytorch forecasting Encoder
- Implementation:ArroyoSystems Arroyo Nexmark Operator
- Implementation:ClickHouse ClickHouse Hex Encoding
- Implementation:Google deepmind Mujoco Render GL2 Header
- Implementation:Google deepmind Dm control MJCF Export
- Implementation:Openai Openai python Compacted Response
- Implementation:Ollama Ollama MLXRunner Fast
- Implementation:Heibaiying BigData Notes StreamExecutionEnvironment GetExecutionEnvironment
- Implementation:Google deepmind Mujoco WASM Bindings
Heuristics
- Heuristic:OpenBMB UltraFeedback GPU Memory Utilization
- Heuristic:SeleniumHQ Selenium Thread Safety With ThreadGuard
- Heuristic:Langgenius Dify Gevent Monkey Patching Order
- Heuristic:Sgl project Sglang Attention Backend Selection
- Heuristic:Getgauge Taiko Browser Launch Flags
- Heuristic:Apache Kafka Unknown Record Type Upgrade Safety
- Heuristic:Huggingface Peft LoRA Initialization Strategy Selection
- Heuristic:Microsoft Agent framework Declaration Only Tools Pattern
- Heuristic:Cleanlab Cleanlab Multiprocessing Platform Strategy
- Heuristic:Ray project Ray Graceful Shutdown Timing
Environments
- Environment:Ucbepic Docetl Python Runtime
- Environment:Unslothai Unsloth CUDA BitsAndBytes
- Environment:Langchain ai Langchain LangSmith Tracing Config
- Environment:Googleapis Python genai Python 3 10 SDK Runtime
- Environment:Puppeteer Puppeteer Cross Platform Browser Environment
- Environment:Langfuse Langfuse Redis 7 Queue Cache
- Environment:Mistralai Client python GCP Deployment Environment
- Environment:Huggingface Open r1 Slurm Cluster
- Environment:LLMBook zh LLMBook zh github io Data Processing Environment
- Environment:Ray project Ray Java Build Environment