Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Datahub project Datahub Metadata Ingestion Pipeline
- Workflow:Vllm project Vllm Offline Text Generation
- Workflow:ARISE Initiative Robosuite Teleoperation
- Workflow:Ggml org Ggml GPT2 Text Generation
- Workflow:Openclaw Openclaw Agent Message Loop
- Workflow:Groq Groq python Text To Speech
- Workflow:Arize ai Phoenix Prompt Management Pipeline
- Workflow:Duckdb Duckdb Code Generation Pipeline
- Workflow:Mbzuai oryx Awesome LLM Post training Deep Paper Collection
- Workflow:Truera Trulens LangGraph Agent Evaluation
Principles
- Principle:Sktime Pytorch forecasting Synthetic Data Generation
- Principle:HKUDS AI Trader Frontend Portfolio Display
- Principle:Helicone Helicone Registry Snapshot Testing
- Principle:Deepspeedai DeepSpeed Pipeline Module Construction
- Principle:Cleanlab Cleanlab Automated Issue Detection
- Principle:Getgauge Taiko Runtime Configuration
- Principle:Apache Dolphinscheduler RPC Service Contract Definition
- Principle:Ollama Ollama Anthropic Trace Logging
- Principle:Neuml Txtai Dimensionality Reduction
- Principle:MarketSquare Robotframework browser ES2015 Module Transpilation
Implementations
- Implementation:Alibaba MNN RapidJSON Reader
- Implementation:Diagram of thought Diagram of thought Full Vs Minimal Template Selection
- Implementation:Apache Druid Compaction Config Completions
- Implementation:Ollama Ollama Llama Impl
- Implementation:Neuml Txtai M2V Vectors
- Implementation:LMCache LMCache LMCacheWorker Register
- Implementation:Datajuicer Data juicer ImageTaggingMapper
- Implementation:Ggml org Llama cpp Arg Parser
- Implementation:Pyro ppl Pyro SubsampleMessenger
- Implementation:Ggml org Llama cpp Download
Heuristics
- Heuristic:Lm sys FastChat Vicuna SFT Training Hyperparameters
- Heuristic:Princeton nlp Tree of thought llm API Request Batching
- Heuristic:Kserve Kserve Multinode Replica Calculation
- Heuristic:Microsoft Agent framework Tool Approval Mode Production
- Heuristic:Tensorflow Serving Warning Deprecated CreateTfrtSavedModel Raw
- Heuristic:Cleanlab Cleanlab Label Quality Scoring Method Selection
- Heuristic:Haosulab ManiSkill Initial Pose Performance
- Heuristic:SeleniumHQ Selenium Warning Deprecated HasDownloads GetDownloadableFiles
- Heuristic:Openai Whisper Compression Ratio Threshold
- Heuristic:Deepspeedai DeepSpeed Shared Memory Sizing
Environments
- Environment:Spotify Luigi AWS S3 Storage
- Environment:OpenRLHF OpenRLHF CUDA GPU Environment
- Environment:Allenai Open instruct vLLM Inference
- Environment:FlagOpen FlagEmbedding Finetuning Environment
- Environment:Pytorch Serve Python PyTorch Runtime
- Environment:Huggingface Alignment handbook BitsAndBytes CUDA
- Environment:Dotnet Machinelearning Platform Architecture Support
- Environment:Gretelai Gretel synthetics Python Base Environment
- Environment:Speechbrain Speechbrain Multi GPU DDP
- Environment:ThreeSR Awesome Inference Time Scaling Semantic Scholar API Environment