Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Gretelai Gretel synthetics ACTGAN Tabular Synthesis
- Workflow:Mlc ai Web llm Structured Output Generation
- Workflow:Apache Flink File Sink Pipeline
- Workflow:Junyanz Pytorch CycleGAN and pix2pix CycleGAN Training
- Workflow:Langfuse Langfuse Batch export pipeline
- Workflow:Marker Inc Korea AutoRAG Evaluation Data Creation
- Workflow:Zai org CogVideo Diffusers Text to Video Inference
- Workflow:Langfuse Langfuse Evaluation pipeline
- Workflow:OpenBMB UltraFeedback GPT4 Preference Annotation
- Workflow:Huggingface Datasets Dataset Preprocessing
Principles
- Principle:Alibaba MNN Runtime Configuration
- Principle:Iamhankai Forest of Thought Answer Equivalence Checking
- Principle:Ggml org Llama cpp Template Capability Detection
- Principle:Gretelai Gretel synthetics Batch Synthetic Generation
- Principle:Avhz RustQuant Exotic Option Contracts
- Principle:LMCache LMCache RoPE Position Recovery
- Principle:Scikit learn Scikit learn Semi Supervised Learning
- Principle:Huggingface Datatrove Text Formatting Framework
- Principle:Mlc ai Web llm Streaming Response Processing
- Principle:Speechbrain Speechbrain Source Separation Evaluation
Implementations
- Implementation:Kserve Kserve Triton MMS Perf Test
- Implementation:Microsoft Playwright APIRequestContext Dispose
- Implementation:Google deepmind Mujoco MJWarp Passive
- Implementation:Microsoft Agent framework Workflow As Agent
- Implementation:Sktime Pytorch forecasting EncoderDecoderTimeSeriesDataModule
- Implementation:Openai Openai python Response Refusal Delta
- Implementation:Huggingface Datatrove FastTextClassifierFilter
- Implementation:Mlc ai Mlc llm Download and cache mlc weights
- Implementation:Teamcapybara Capybara Selector Link
- Implementation:Online ml River Stream Shuffle
Heuristics
- Heuristic:EvolvingLMMs Lab Lmms eval Distributed Padding Strategy
- Heuristic:Openai Openai python Structured Output Strict Schema
- Heuristic:Microsoft Autogen Agent Thread Safety
- Heuristic:Microsoft Semantic kernel Prompt Injection Safety
- Heuristic:Huggingface Peft Gradient Checkpointing With Quantization
- Heuristic:TobikoData Sqlmesh Snapshot TTL Defaults
- Heuristic:Apache Dolphinscheduler Epoll Vs NIO Selection
- Heuristic:Turboderp org Exllamav2 Quantization Conversion Tips
- Heuristic:ARISE Initiative Robosuite XML Reset Method Tradeoff
- Heuristic:Fede1024 Rust rdkafka Partitioner Must Not Block
Environments
- Environment:Apache Dolphinscheduler Netty Runtime
- Environment:Huggingface Datasets SQL Dependencies
- Environment:Lance format Lance Rust Toolchain
- Environment:Openai Openai agents python OpenAI API Credentials
- Environment:EvolvingLMMs Lab Lmms eval Server Mode Environment
- Environment:Iamhankai Forest of Thought Python CUDA Runtime
- Environment:Sgl project Sglang GitHub Actions
- Environment:Huggingface Datasets Image Dependencies
- Environment:Facebookresearch Habitat lab HITL Runtime Environment
- Environment:Google deepmind Dm control EGL Headless Rendering