Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Kafka PR Merge And Backport
- Workflow:Apache Airflow Provider Distribution Development
- Workflow:Huggingface Datasets Dataset Streaming
- Workflow:Pola rs Polars Data IO and Format Conversion
- Workflow:Facebookresearch Audiocraft EnCodec Compression Training
- Workflow:Heibaiying BigData Notes Storm Topology Development
- Workflow:Ray project Ray Build and Release Pipeline
- Workflow:Obss Sahi Sliced Inference Pipeline
- Workflow:Lm sys FastChat LoRA QLoRA Finetuning
- Workflow:Heibaiying BigData Notes Flink Kafka Streaming Pipeline
Principles
- Principle:Sktime Pytorch forecasting V2 Model Base
- Principle:Duckdb Duckdb Source Package Building
- Principle:MarketSquare Robotframework browser Test Execution
- Principle:Ggml org Llama cpp Terminal IO
- Principle:Dagster io Dagster Dynamic Partitioning
- Principle:SeleniumHQ Selenium Element Caching Strategy
- Principle:Fastai Fastbook Backpropagation
- Principle:Ggml org Llama cpp Vocabulary System
- Principle:Testtimescaling Testtimescaling github io Integration Verification
- Principle:Hiyouga LLaMA Factory Gradient Checkpointing Theory
Implementations
- Implementation:OpenGVLab InternVL MPTConfig
- Implementation:Ggml org Ggml Cann aclnn ops
- Implementation:FlowiseAI Flowise ItemCard
- Implementation:OpenHands OpenHands JiraDcIntegrationStore
- Implementation:Online ml River Compose Select
- Implementation:Norrrrrrr lyn WAInjectBench SentenceTransformer Init
- Implementation:Apache Dolphinscheduler BaseAdHocAndPooledClient Extension
- Implementation:Sail sg LongSpec Glide Inference Init
- Implementation:Nautechsystems Nautilus trader ImportableStrategyConfig Init
- Implementation:Speechbrain Speechbrain Hparams CommonVoice Conformer Transducer
Heuristics
- Heuristic:Danijar Dreamerv3 Replay Context Carry Init
- Heuristic:Googleapis Python genai LRO Polling Backoff
- Heuristic:Huggingface Datasets Num Proc Guidelines
- Heuristic:Nautechsystems Nautilus trader Order Rate Limiting Configuration
- Heuristic:Promptfoo Promptfoo Transient Error Classification
- Heuristic:Fede1024 Rust rdkafka Cooperative Rebalance Protocol
- Heuristic:Romsto Speculative Decoding Shared Tokenizer Requirement
- Heuristic:Truera Trulens Temperature Zero For Deterministic Scoring
- Heuristic:Togethercomputer Together python Multipart Upload Strategy
- Heuristic:Huggingface Open r1 Reward Function Tuning
Environments
- Environment:ThreeSR Awesome Inference Time Scaling Git CLI Environment
- Environment:Kserve Kserve VLLM Runtime
- Environment:Spcl Graph of thoughts Python 3 8 Runtime
- Environment:Facebookresearch Audiocraft AudioCraft Environment Variables
- Environment:Haosulab ManiSkill Motion Planning Deps
- Environment:Langchain ai Langgraph Postgres Checkpoint Environment
- Environment:Dagster io Dagster PostgreSQL Storage
- Environment:Apache Dolphinscheduler Node Pnpm Runtime
- Environment:Huggingface Datasets Python PyArrow Core
- Environment:Microsoft BIPIA Python CUDA GPU Environment