Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Duckdb Duckdb Extension Development And Distribution
- Workflow:Googleapis Python genai Text Content Generation
- Workflow:Evidentlyai Evidently ML Model Quality Report
- Workflow:Microsoft Autogen Graph Based Agent Orchestration
- Workflow:Datahub project Datahub CLI Metadata Ingestion
- Workflow:Neuml Txtai Semantic Search Pipeline
- Workflow:Wandb Weave Prompt Management
- Workflow:Webdriverio Webdriverio Page Object Pattern
- Workflow:EvolvingLMMs Lab Lmms eval Server Mode Evaluation
- Workflow:Vibrantlabsai Ragas Experiment Driven Development
Principles
- Principle:Cypress io Cypress Release Verification
- Principle:Ggml org Llama cpp Inference Context Creation
- Principle:Bitsandbytes foundation Bitsandbytes FSDP Quant State Recovery
- Principle:Mbzuai oryx Awesome LLM Post training Repository Publishing
- Principle:ClickHouse ClickHouse Banned Function Enforcement
- Principle:Kubeflow Pipelines XGBoost Model Training
- Principle:Guardrails ai Guardrails Server Config Scaffolding
- Principle:Online ml River Streaming ROCAUC
- Principle:LaurentMazare Tch rs Generative Adversarial Network
- Principle:FlagOpen FlagEmbedding Query Passage Pair Formatting
Implementations
- Implementation:Open compass VLMEvalKit VLAAThinkerChat
- Implementation:Langchain ai Langchain RecursiveCharacterTextSplitter Split Documents
- Implementation:Eventual Inc Daft ResourceRequest
- Implementation:NVIDIA NeMo Aligner Anneal SDXL
- Implementation:Speechbrain Speechbrain Prepare Switchboard LM
- Implementation:Datajuicer Data juicer ImageDiffusionMapper
- Implementation:EvolvingLMMs Lab Lmms eval Task Utility Interface
- Implementation:InternLM Lmdeploy Gemm Types
- Implementation:Microsoft DeepSpeedExamples Text Generation Test
- Implementation:Eventual Inc Daft DataFrame Groupby
Heuristics
- Heuristic:Obss Sahi Class Agnostic vs Per Class NMS
- Heuristic:Wandb Weave Sentinel Value Handling
- Heuristic:Danijar Dreamerv3 Percentile Return Normalization
- Heuristic:NVIDIA DALI Warning Deprecated C API V1 Functions
- Heuristic:Mlflow Mlflow Nested Run Organization
- Heuristic:Explodinggradients Ragas Retry And Backoff Configuration
- Heuristic:Apache Beam Watermark Update Throttling
- Heuristic:ContextualAI HALOs Online Round Budgeting
- Heuristic:EvolvingLMMs Lab Lmms eval Limit Flag Testing Only
- Heuristic:Ollama Ollama Download Retry Strategy
Environments
- Environment:Shiyu coder Kronos DDP Multi GPU Environment
- Environment:OpenGVLab InternVL PyTorch CUDA
- Environment:Openai Openai agents python MCP Dependencies
- Environment:Recommenders team Recommenders Spark Environment
- Environment:Fede1024 Rust rdkafka CI Test Runner
- Environment:Groq Groq python Python Groq SDK
- Environment:Microsoft BIPIA OpenAI API Environment
- Environment:Kubeflow Kubeflow Kubectl Kustomize CLI Environment
- Environment:Intel Ipex llm RAG LlamaIndex Environment
- Environment:Mlc ai Web llm Node Build Toolchain