Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Deepspeedai DeepSpeed Hybrid Engine RLHF Training
- Workflow:Sdv dev SDV Multi table synthesis
- Workflow:Confident ai Deepeval LLM Tracing and Observability
- Workflow:DataTalksClub Data engineering zoomcamp Kafka Stream Processing
- Workflow:TobikoData Sqlmesh Github CICD automation
- Workflow:Apache Hudi Docker Demo Setup
- Workflow:Dagster io Dagster Dbt Integration
- Workflow:Infiniflow Ragflow Chat Application Setup
- Workflow:Langgenius Dify App Creation and Configuration
- Workflow:Sgl project Sglang Multimodal Vision Language Inference
Principles
- Principle:Testtimescaling Testtimescaling github io Event Driven Step Progression
- Principle:Deepspeedai DeepSpeed Mesh Device Configuration
- Principle:Eric mitchell Direct preference optimization Concatenated Forward Pass
- Principle:Zai org CogVideo Text to Video Generation
- Principle:Microsoft Playwright Verify Expectations with Agent
- Principle:NVIDIA NeMo Aligner Reward Model Data Preparation
- Principle:Lakeraai Pint benchmark Custom Eval Function Interface
- Principle:Datajuicer Data juicer Operator Package Registration
- Principle:Deepseek ai Janus Vision Encoding and Embedding Fusion
- Principle:Neuml Txtai Index Update
Implementations
- Implementation:Microsoft Onnxruntime InferenceSession Get Inputs
- Implementation:CARLA simulator Carla Show Topology Tool
- Implementation:Tensorflow Tfjs LayersModel Compile
- Implementation:EvolvingLMMs Lab Lmms eval WildVision Bench Evaluation Utils
- Implementation:SeleniumHQ Selenium ChromiumDriverLogLevel
- Implementation:Iterative Dvc Dependency Dataset
- Implementation:Explodinggradients Ragas AspectCritic Metric
- Implementation:NVIDIA DALI EfficientNet Backbone
- Implementation:Langchain ai Langgraph Config Graph Paths
- Implementation:Turboderp org Exllamav2 ExLlamaV2TokenEnforcerFilter
Heuristics
- Heuristic:Scikit learn contrib Imbalanced learn Sparse Matrix Handling
- Heuristic:VainF Torch Pruning GQA Head Pruning Constraints
- Heuristic:DataExpert io Data engineer handbook Flink Checkpointing Interval Tuning
- Heuristic:CrewAIInc CrewAI LLM Provider Message Workarounds
- Heuristic:LMCache LMCache Chunk Size And Default Config
- Heuristic:Fede1024 Rust rdkafka Manual Offset Store Pattern
- Heuristic:Apache Dolphinscheduler Load Balancer Strategy Selection
- Heuristic:Heibaiying BigData Notes Kafka Consumer Offset Strategy Tip
- Heuristic:Cypress io Cypress Global Install Warning
- Heuristic:OpenGVLab InternVL Dynamic Resolution Tiling
Environments
- Environment:Togethercomputer Together python API Credentials
- Environment:LMCache LMCache NIXL Transfer Library
- Environment:OWASP Www project top 10 for large language model applications PR Description Generator Runtime
- Environment:Run llama Llama index Fsspec Remote Storage
- Environment:Deepspeedai DeepSpeed Multi Accelerator Environment
- Environment:Sgl project Sglang Multi Platform Accelerators
- Environment:PacktPublishing LLM Engineers Handbook Selenium Chrome Crawler Environment
- Environment:Duckdb Duckdb Code Formatting Tools
- Environment:DataTalksClub Data engineering zoomcamp Kafka Confluent Environment
- Environment:Fede1024 Rust rdkafka Kafka Broker Runtime