Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Avhz RustQuant Monte Carlo Option Pricing
- Workflow:Mlfoundations Open flamingo Few Shot Evaluation
- Workflow:Microsoft Autogen Swarm Agent Handoff
- Workflow:Deepset ai Haystack Extractive QA Pipeline
- Workflow:Fede1024 Rust rdkafka At Least Once Processing
- Workflow:Apache Paimon Data Ingestion With Ray Sink
- Workflow:EvolvingLMMs Lab Lmms eval End to End Evaluation
- Workflow:Dotnet Machinelearning Text Classification
- Workflow:MaterializeInc Materialize Docker Image Build
- Workflow:Onnx Onnx Model Composition
Principles
- Principle:Huggingface Optimum GPTQ Quantizer Configuration
- Principle:Webdriverio Webdriverio Performance Instrumentation
- Principle:Sail sg LongSpec Math Equivalence Evaluation
- Principle:Apache Paimon Indexed Split Result Retrieval
- Principle:Mbzuai oryx Awesome LLM Post training Keyword Data Loading
- Principle:Scikit learn contrib Imbalanced learn Benchmark Dataset Loading
- Principle:Huggingface Trl DPO Training
- Principle:SeldonIO Seldon core HuggingFace Text Inference
- Principle:Eric mitchell Direct preference optimization Model Loading
- Principle:Apache Airflow RC Verification
Implementations
- Implementation:Scikit learn Scikit learn MLPClassifier
- Implementation:DistrictDataLabs Yellowbrick ValidationCurve Visualizer
- Implementation:Kornia Kornia Transpiler
- Implementation:Sgl project Sglang Kernel Ops Header
- Implementation:NVIDIA DALI Fn Decoders Image Random Crop
- Implementation:ARISE Initiative Robosuite TransformUtils
- Implementation:Evidentlyai Evidently Legacy UI Base
- Implementation:Puppeteer Puppeteer Frame Wait Methods
- Implementation:Online ml River Stats Mode
- Implementation:Openai Openai python Vector Store Search Response
Heuristics
- Heuristic:Infiniflow Ragflow Citation Threshold Decay
- Heuristic:Wandb Weave Payload Size Limits
- Heuristic:Mage ai Mage ai Record Deduplication Before Batch Export
- Heuristic:Turboderp org Exllamav2 Paged Cache Configuration
- Heuristic:OpenRLHF OpenRLHF Gradient Checkpointing Memory Tip
- Heuristic:Mbzuai oryx Awesome LLM Post training Reference Citation Cap 200
- Heuristic:Mbzuai oryx Awesome LLM Post training Checkpoint Every 3 Papers
- Heuristic:Togethercomputer Together python Retry Backoff Strategy
- Heuristic:Cleanlab Cleanlab KNN Distance Metric Selection
- Heuristic:NVIDIA NeMo Aligner Warning Deprecated Repository
Environments
- Environment:Evidentlyai Evidently Spark Engine Environment
- Environment:Unstructured IO Unstructured OpenAI API
- Environment:Neuml Txtai GPU Accelerator Detection
- Environment:DataTalksClub Data engineering zoomcamp Kestra Orchestration Environment
- Environment:Open compass VLMEvalKit GPU CUDA Environment
- Environment:Alibaba MNN GPU OpenCL Environment
- Environment:Getgauge Taiko Node Runtime
- Environment:Pola rs Polars Python Runtime Environment
- Environment:Eventual Inc Daft Python PyArrow Core
- Environment:Turboderp org Exllamav2 Flash Attention Backend