Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Wandb Weave LLM Integration Tracing
- Workflow:Treeverse LakeFS Data Version Control With Branches
- Workflow:Gretelai Gretel synthetics DataFrame Batch Synthesis
- Workflow:OpenRLHF OpenRLHF Math Reasoning Training
- Workflow:Volcengine Verl Vision Language Model RL Training
- Workflow:Kserve Kserve LLM Disaggregated Serving
- Workflow:Explodinggradients Ragas Metric Prompt Optimization
- Workflow:Apache Beam Portable Pipeline Submission
- Workflow:Microsoft DeepSpeedExamples RLHF Training Pipeline
- Workflow:TA Lib Ta lib python Installation And Setup
Principles
- Principle:Puppeteer Puppeteer Cross Browser Configuration
- Principle:Apache Shardingsphere Active Version Switching
- Principle:SeldonIO Seldon core Explanation Generation
- Principle:Mlflow Mlflow Trace Destination Configuration
- Principle:HKUDS AI Trader LangChain Agent Initialization
- Principle:Huggingface Datasets Hub Dataset Deletion
- Principle:Apache Airflow DAG Deployment
- Principle:Nautechsystems Nautilus trader Execution Reconciliation
- Principle:Trailofbits Fickling Analysis Result Serialization
- Principle:Heibaiying BigData Notes MapReduce Reduce Phase
Implementations
- Implementation:Deepset ai Haystack SentenceTransformersDocumentEmbedder
- Implementation:TobikoData Sqlmesh ModelColumns
- Implementation:Openai Openai node Eval Run OutputItems
- Implementation:Kornia Kornia Lovasz Softmax Loss
- Implementation:Apache Druid QueryTab ProcessQuery
- Implementation:Open compass VLMEvalKit DREAM
- Implementation:Scikit learn Scikit learn Cross Val Predict
- Implementation:Alibaba MNN Diffusion Demo CLI
- Implementation:Ggml org Llama cpp Common Init From Params Target
- Implementation:Lm sys FastChat Clean ShareGPT
Heuristics
- Heuristic:NVIDIA NeMo Curator GPU Memory Resource Allocation
- Heuristic:Ray project Ray Autoscaling Delay Tuning
- Heuristic:Vespa engine Vespa KStemmer Dictionary Loading
- Heuristic:LLMBook zh LLMBook zh github io BF16 Mixed Precision Default
- Heuristic:Mistralai Client python Resource Context Manager
- Heuristic:Mlfoundations Open flamingo RICES Feature Caching
- Heuristic:Deepspeedai DeepSpeed Vocabulary Tensor Core Alignment
- Heuristic:Apache Spark Partition Sizing Tips
- Heuristic:OpenBMB UltraFeedback API Retry Strategy
- Heuristic:Cypress io Cypress Browser Version Workarounds
Environments
- Environment:Heibaiying BigData Notes HBase Environment
- Environment:Dotnet Machinelearning OneDal Acceleration
- Environment:PacktPublishing LLM Engineers Handbook Unsloth Finetuning Environment
- Environment:Mlflow Mlflow MLflow Server Environment
- Environment:Ggml org Llama cpp Vulkan GPU Environment
- Environment:OWASP Www project top 10 for large language model applications Pre Commit Hooks Environment
- Environment:Dotnet Machinelearning ONNX Runtime Environment
- Environment:Lucidrains X transformers Python Environment
- Environment:Sgl project Sglang CUDA Runtime
- Environment:Volcengine Verl SGLang Rollout Environment