Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Evidentlyai Evidently Text Data Quality Evaluation
- Workflow:Mbzuai oryx Awesome LLM Post training Awesome List Curation
- Workflow:Deepset ai Haystack Document Preprocessing Pipeline
- Workflow:InternLM Lmdeploy W4A16 AWQ Quantization
- Workflow:Norrrrrrr lyn WAInjectBench Embedding Classifier Training
- Workflow:OpenRLHF OpenRLHF SFT Training
- Workflow:DataExpert io Data engineer handbook Dimensional Data Modeling Environment Setup
- Workflow:Speechbrain Speechbrain Text to Speech Training
- Workflow:SeldonIO Seldon core Production Monitoring Pipeline
- Workflow:Apache Dolphinscheduler RPC Service Communication
Principles
- Principle:Langchain ai Langchain Release Preparation
- Principle:Puppeteer Puppeteer Browser Version Verification
- Principle:Googleapis Python genai Local Tokenization
- Principle:PrefectHQ Prefect Frontend Build Optimization
- Principle:Open compass VLMEvalKit Results Summarization
- Principle:Apache Dolphinscheduler RPC Server Handler
- Principle:Langchain ai Langgraph Store Batch Operations
- Principle:Ggml org Llama cpp Diffusion Text Generation
- Principle:Interpretml Interpret Debug And Logging
- Principle:Ggml org Llama cpp Terminal IO
Implementations
- Implementation:Openai Openai python Completion Usage Model
- Implementation:Kserve Kserve LLM Worker Template
- Implementation:Google deepmind Dm control Variation Math
- Implementation:Datajuicer Data juicer TsvFormatter
- Implementation:PeterL1n BackgroundMattingV2 VideoDataset
- Implementation:Lm sys FastChat Build Side By Side Arena Named UI
- Implementation:Facebookresearch Habitat lab SessionRecorder init
- Implementation:Apache Dolphinscheduler FailoverCoordinator GlobalFailover
- Implementation:Apache Kafka CoordinatorRuntime OnMetadataUpdate
- Implementation:Huggingface Datasets SparkDatasetReader
Heuristics
- Heuristic:Princeton nlp SimPO Concatenated Forward Pass
- Heuristic:Togethercomputer Together python Fine Tuning Parameter Validation
- Heuristic:DevExpress Testcafe Concurrency Factor Limit
- Heuristic:NVIDIA DALI Batch Size Tuning
- Heuristic:Dotnet Machinelearning Sparsity Threshold Optimization
- Heuristic:Princeton nlp Tree of thought llm API Request Batching
- Heuristic:Mbzuai oryx Awesome LLM Post training Reference Citation Cap 200
- Heuristic:BerriAI Litellm Token Counting Buffer
- Heuristic:Neuml Txtai Faiss Index Sizing Tip
- Heuristic:Iamhankai Forest of Thought Tree Iteration Scaling
Environments
- Environment:Langgenius Dify Python Backend Environment
- Environment:BerriAI Litellm PostgreSQL Database
- Environment:MaterializeInc Materialize Buildkite CI Runtime
- Environment:Apache Beam Java Build Environment
- Environment:Risingwavelabs Risingwave Docker Deployment Environment
- Environment:Vespa engine Vespa CMake Cpp23 Build Environment
- Environment:Zai org CogVideo Diffusers Inference Environment
- Environment:Recommenders team Recommenders Spark Environment
- Environment:Microsoft LoRA PyTorch CUDA Environment
- Environment:Datajuicer Data juicer Ray Cluster Environment