Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Sail sg LongSpec GLIDE Draft Model Training
- Workflow:Anthropics Anthropic sdk python Streaming Message Interaction
- Workflow:LMCache LMCache P2P KV Cache Sharing
- Workflow:Ucbepic Docetl YAML Pipeline Execution
- Workflow:Microsoft Autogen Graph Based Agent Orchestration
- Workflow:Duckdb Duckdb Extension Development And Distribution
- Workflow:Openai CLIP Zero shot image classification
- Workflow:Run llama Llama index OpenAI LLM Finetuning
- Workflow:Mlfoundations Open flamingo Distributed Training
- Workflow:Apache Dolphinscheduler Datasource Plugin Development
Principles
- Principle:Apache Hudi Clustering Strategy Configuration
- Principle:Zai org CogVideo Optical Flow Estimation
- Principle:OpenGVLab InternVL Multimodal Data Collation
- Principle:Hiyouga LLaMA Factory Low Rank Adaptation
- Principle:Ggml org Llama cpp Chat Template Types
- Principle:Ucbepic Docetl Pandas Semantic Operations
- Principle:Pyro ppl Pyro Epidemiological Modeling
- Principle:Run llama Llama index Agent Execution
- Principle:DistrictDataLabs Yellowbrick Influential Outlier Detection
- Principle:Shiyu coder Kronos Qlib Training Dataset
Implementations
- Implementation:FMInference FlexLLMGen Batch Query Test
- Implementation:Lance format Lance DecoderBench
- Implementation:NVIDIA NeMo Curator XennaStageAdapter
- Implementation:Online ml River Preprocessing LDA
- Implementation:Deepset ai Haystack TransformersSimilarityRanker
- Implementation:NVIDIA NeMo Curator FastText Filters
- Implementation:Microsoft DeepSpeedExamples BingBert Timer
- Implementation:Cohere ai Cohere python Text Preparation Pattern
- Implementation:Ollama Ollama Imagegen Manifest Weights
- Implementation:FlowiseAI Flowise AgentReasoningCard
Heuristics
- Heuristic:FMInference FlexLLMGen Weight Compression 4bit
- Heuristic:Dagster io Dagster Record Over Dataclass
- Heuristic:Avhz RustQuant Learning Rate Tuning
- Heuristic:Apache Dolphinscheduler Netty Thread Sizing
- Heuristic:Mistralai Client python Stream File Uploads
- Heuristic:Openclaw Openclaw Config Cascade Resolution
- Heuristic:Eric mitchell Direct preference optimization TF32 Matmul Precision
- Heuristic:Duckdb Duckdb Sanitizer Configuration
- Heuristic:Dotnet Machinelearning AutoML SMAC Dimension Limit
- Heuristic:Kornia Kornia CPU GPU Branching Tip
Environments
- Environment:Huggingface Alignment handbook Python Transformers
- Environment:Mistralai Client python Agents Environment
- Environment:Dotnet Machinelearning Dotnet SDK And Runtime
- Environment:Interpretml Interpret Visualization Environment
- Environment:VainF Torch Pruning CUDA GPU Benchmarking
- Environment:LLMBook zh LLMBook zh github io PyTorch CUDA GPU Environment
- Environment:Neuml Txtai Python Core Environment
- Environment:Anthropics Anthropic sdk python GCP Vertex Environment
- Environment:Sgl project Sglang Runtime
- Environment:Datajuicer Data juicer LLM API Credentials Environment