Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Huggingface Alignment handbook QLoRA Single GPU Finetuning
- Workflow:Princeton nlp Tree of thought llm Baseline comparison
- Workflow:Mlc ai Web llm Structured Output Generation
- Workflow:Apache Druid Visual Data Exploration
- Workflow:Openai Openai python Realtime Conversation
- Workflow:Obss Sahi Sliced Inference Pipeline
- Workflow:Google research Deduplicate text datasets Cross dataset deduplication
- Workflow:Vibrantlabsai Ragas Testset Generation
- Workflow:ClickHouse ClickHouse Contributing Pull Request
- Workflow:NVIDIA NeMo Aligner Supervised Fine Tuning
Principles
- Principle:Truera Trulens Application Recording
- Principle:Langfuse Langfuse ChatML Normalization
- Principle:Mage ai Mage ai Destination Configuration
- Principle:Huggingface Transformers Environment Setup
- Principle:Mbzuai oryx Awesome LLM Post training Category Taxonomy Definition
- Principle:Huggingface Transformers Documentation Metadata Management
- Principle:Huggingface Transformers Adapter Training
- Principle:PrefectHQ Prefect Asset Definition
- Principle:Langchain ai Langgraph Cache Backend Selection
- Principle:Cohere ai Cohere python Rerank Response Processing
Implementations
- Implementation:NVIDIA TransformerEngine TE Autocast
- Implementation:SeleniumHQ Selenium Git Fork And Branch Pattern
- Implementation:Online ml River Bandit BayesUCB
- Implementation:LLMBook zh LLMBook zh github io GPTQConfig Quantization
- Implementation:Hpcaitech ColossalAI LoRAConstructor
- Implementation:Scikit learn Scikit learn FetchLfw
- Implementation:Huggingface Datasets StreamingDownloadManager
- Implementation:Alibaba ROLL ModelUtils
- Implementation:Mlflow Mlflow ML Package Versions Data
- Implementation:Bentoml BentoML Bentos Export Import
Heuristics
- Heuristic:SeleniumHQ Selenium FindElements For Absence Check
- Heuristic:VainF Torch Pruning GQA Head Pruning Constraints
- Heuristic:Huggingface Datasets Batch Size Optimization
- Heuristic:OpenGVLab InternVL Dynamic Resolution Tiling
- Heuristic:EvolvingLMMs Lab Lmms eval Distributed Padding Strategy
- Heuristic:Onnx Onnx Big Endian Byte Order Handling
- Heuristic:Protectai Modelscan Stricter Zip Detection
- Heuristic:Microsoft BIPIA Torch Compile Platform Guard
- Heuristic:DevExpress Testcafe MacOS Browser Launch Serialization
- Heuristic:Dagster io Dagster Retry Strategy Configuration
Environments
- Environment:Huggingface Trl Python Core Dependencies
- Environment:Eventual Inc Daft Ray Distributed Runner
- Environment:Deepspeedai DeepSpeed XPU Environment
- Environment:Haifengl Smile Native BLAS LAPACK ARPACK
- Environment:SeldonIO Seldon core Go Build Toolchain Environment
- Environment:LLMBook zh LLMBook zh github io Bitsandbytes Quantization Environment
- Environment:Openai Evals Python Runtime
- Environment:Cohere ai Cohere python AWS Integration Dependencies
- Environment:Dagster io Dagster GRPC Communication
- Environment:Princeton nlp SimPO VLLM Inference