Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Helicone Helicone Local Development Setup
- Workflow:Datahub project Datahub Java SDK V2 Entity Management
- Workflow:Isaac sim IsaacGymEnvs Custom Task Development
- Workflow:Facebookresearch Habitat lab Custom Task Extension
- Workflow:Apache Flink Stream File Compaction
- Workflow:Explodinggradients Ragas Test Data Generation
- Workflow:Apache Paimon Blob Storage With Descriptors
- Workflow:BerriAI Litellm Fine Tuning Job
- Workflow:Predibase Lorax Structured JSON Output
- Workflow:Bigscience workshop Petals Server Contribution
Principles
- Principle:Protectai Modelscan Model File Abstraction
- Principle:MaterializeInc Materialize Multi Backend Data Ingestion
- Principle:FlowiseAI Flowise Chat Prediction
- Principle:Huggingface Alignment handbook Multi Task SFT Training
- Principle:Datahub project Datahub Protobuf Schema Conversion
- Principle:Liu00222 Open Prompt Injection Model Query Interface
- Principle:Duckdb Duckdb SQL Grammar Generation
- Principle:Nightwatchjs Nightwatch Browser Command Execution
- Principle:Kserve Kserve PD Scheduler Routing
- Principle:AUTOMATIC1111 Stable diffusion webui Extension Architecture
Implementations
- Implementation:SeleniumHQ Selenium Urls
- Implementation:MaterializeInc Materialize Zippy Test Framework
- Implementation:Sktime Pytorch forecasting Check Estimator
- Implementation:Open compass VLMEvalKit MMHelix Base Evaluator
- Implementation:Marker Inc Korea AutoRAG Parser Start Parsing
- Implementation:Spcl Graph of thoughts Aggregate Operation
- Implementation:Intel Ipex llm Deepspeed AutoTP FastAPI Serving
- Implementation:Cypress io Cypress OpenModule Start
- Implementation:AUTOMATIC1111 Stable diffusion webui CLIP Interrogator
- Implementation:Microsoft Semantic kernel Vertex Embeddings TestData
Heuristics
- Heuristic:DataTalksClub Data engineering zoomcamp Kafka Consumer Poll Timeout
- Heuristic:Haosulab ManiSkill Physics Solver Tuning
- Heuristic:Sktime Pytorch forecasting Batch Size Selection
- Heuristic:SeldonIO Seldon core Tracing Latency Tip
- Heuristic:VainF Torch Pruning Over Pruning Prevention
- Heuristic:Scikit learn contrib Imbalanced learn KNeighbors Selection Tips
- Heuristic:DataExpert io Data engineer handbook SparkSession Singleton Pattern
- Heuristic:Confident ai Deepeval Dotenv Loading Order
- Heuristic:Spotify Luigi Atomic File Writes
- Heuristic:VainF Torch Pruning GQA Head Pruning Constraints
Environments
- Environment:FlagOpen FlagEmbedding Python PyTorch Environment
- Environment:Ggml org Ggml C Cpp Build Environment
- Environment:Obss Sahi Python Detection Frameworks
- Environment:Langgenius Dify Credentials And Env Vars
- Environment:Iterative Dvc Remote Storage Backends
- Environment:VainF Torch Pruning PyTorch Python Core
- Environment:Lm sys FastChat Python Core Dependencies
- Environment:InternLM Lmdeploy Python Dependencies
- Environment:TobikoData Sqlmesh GitHub CICD Runner
- Environment:Vespa engine Vespa Java 17 Build Runtime