Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Kubeflow Pipelines Standalone Deployment
- Workflow:Sgl project Sglang Frontend Language Multi Turn Chat
- Workflow:Puppeteer Puppeteer Page Screenshot Capture
- Workflow:Openclaw Openclaw Channel Connection
- Workflow:FlagOpen FlagEmbedding Embedder Finetuning
- Workflow:PacktPublishing LLM Engineers Handbook Feature Engineering
- Workflow:Pola rs Polars Time Series Analysis
- Workflow:Langchain ai Langgraph CLI Deployment
- Workflow:PrefectHQ Prefect Per Worker Task Concurrency
- Workflow:Microsoft Playwright Network mocking and interception
Principles
- Principle:TA Lib Ta lib python Abstract Function Instantiation
- Principle:Apache Flink Bucket Assignment
- Principle:DistrictDataLabs Yellowbrick Classification Report Visualization
- Principle:Heibaiying BigData Notes Storm Topology Wiring
- Principle:Openai Openai python Realtime WebSocket Connection
- Principle:Kubeflow Kubeflow Installation Method Selection
- Principle:Neuml Txtai Retrieval Augmented Generation
- Principle:DistrictDataLabs Yellowbrick Elbow Method Cluster Selection
- Principle:Langgenius Dify Model and Prompt Configuration
- Principle:Scikit learn Scikit learn Online Learning
Implementations
- Implementation:Apache Airflow Metrics Template
- Implementation:Sgl project Sglang ServerArgs Init
- Implementation:Allenai Open instruct Layer Init
- Implementation:ARISE Initiative Robosuite LightingModder
- Implementation:Ucbepic Docetl AI Chat And Prompt Improvement
- Implementation:Onnx Onnx OpSchema System
- Implementation:Ucbepic Docetl MOAR SearchUtils
- Implementation:Langchain ai Langchain Hub Push Pull
- Implementation:Sgl project Sglang Kernel Utils Header
- Implementation:Facebookresearch Habitat lab MobileManipulator
Heuristics
- Heuristic:Huggingface Diffusers Dtype Precision Selection
- Heuristic:Datahub project Datahub Emitter Selection Strategy
- Heuristic:Spotify Luigi Streaming MapReduce Processing
- Heuristic:Huggingface Peft LoRA Default Configuration
- Heuristic:Spcl Graph of thoughts Scoring With Error Counting
- Heuristic:Open compass VLMEvalKit Video Frame Sampling Configuration
- Heuristic:Microsoft Onnxruntime ORTModule Wrapping Order
- Heuristic:Snorkel team Snorkel NLP Preprocessor Memoization
- Heuristic:Rapidsai Cuml Batch Size Memory Tradeoff
- Heuristic:Langchain ai Langchain Text Splitter Separator Hierarchy
Environments
- Environment:Facebookresearch Audiocraft XFormers Memory Efficient Attention
- Environment:ThreeSR Awesome Inference Time Scaling Git CLI Environment
- Environment:SqueezeAILab ETS Multi GPU Sglang Runtime
- Environment:Microsoft Playwright Platform Support Environment
- Environment:Datahub project Datahub Docker Quickstart Environment
- Environment:Kserve Kserve GPU Accelerator
- Environment:FMInference FlexLLMGen NVMe Disk
- Environment:DataTalksClub Data engineering zoomcamp Dbt DuckDB Environment
- Environment:Mistralai Client python Python SDK Environment
- Environment:Run llama Llama index Python LlamaIndex Core