Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Bigscience workshop Petals Prompt Tuning Chatbot
- Workflow:Fede1024 Rust rdkafka Mock Cluster Testing
- Workflow:Mit han lab Llm awq AWQ Model Quantization
- Workflow:TobikoData Sqlmesh Dbt project migration
- Workflow:SeldonIO Seldon core Model Deployment
- Workflow:Eventual Inc Daft Data Lakehouse ETL
- Workflow:Dotnet Machinelearning GenAI Causal LM Inference
- Workflow:MarketSquare Robotframework browser Installation and Setup
- Workflow:Anthropics Anthropic sdk python Extended Thinking Reasoning
- Workflow:Avhz RustQuant Stochastic Process Simulation
Principles
- Principle:Kubeflow Pipelines Parallel Iteration
- Principle:Ggml org Llama cpp Model Architecture Support
- Principle:Elevenlabs Elevenlabs python Audio Source Selection
- Principle:Openai Evals Solver Output Postprocessing
- Principle:Pytorch Serve Speech Recognition Inference
- Principle:Lakeraai Pint benchmark Dataset Preparation
- Principle:Sdv dev SDV Synthesizer Persistence
- Principle:AUTOMATIC1111 Stable diffusion webui Embedding creation
- Principle:FlowiseAI Flowise Lead Capture
- Principle:CrewAIInc CrewAI Semantic Retrieval
Implementations
- Implementation:BerriAI Litellm Responses API
- Implementation:Langgenius Dify Contract Router
- Implementation:CARLA simulator Carla ROS2 Interface
- Implementation:Online ml River Sketch HeavyHitters
- Implementation:TobikoData Sqlmesh EditorPreview
- Implementation:Open compass VLMEvalKit MMIF Function And Compare
- Implementation:Deepspeedai DeepSpeed Transformer CUDA
- Implementation:Ollama Ollama Mtmd Llama4
- Implementation:Microsoft Playwright AndroidDispatcher
- Implementation:Huggingface Optimum TaskProcessor
Heuristics
- Heuristic:Openai Evals Eval Resumption Strategy
- Heuristic:DataTalksClub Data engineering zoomcamp CSV Chunk Size Optimization
- Heuristic:Apache Airflow Variable Access Pattern
- Heuristic:AnswerDotAI RAGatouille Auto Batch Size For Long Documents
- Heuristic:Iamhankai Forest of Thought UCB Exploration Constant
- Heuristic:SeldonIO Seldon core Model Scheduling Preference Tip
- Heuristic:Puppeteer Puppeteer Timeout Hierarchy
- Heuristic:Norrrrrrr lyn WAInjectBench Zero Vector Fallback Failed Embeddings
- Heuristic:Helicone Helicone ClickHouse ReplacingMergeTree FINAL
- Heuristic:Openai Openai python Warning Deprecated Eval Stored Completions
Environments
- Environment:Microsoft BIPIA Python CUDA GPU Environment
- Environment:FlowiseAI Flowise Queue Mode Environment
- Environment:Datajuicer Data juicer Python Runtime Environment
- Environment:Mlfoundations Open flamingo HuggingFace Open CLIP Dependencies
- Environment:Apache Spark Release Build Environment
- Environment:Fastai Fastbook NLP SpaCy Environment
- Environment:LLMBook zh LLMBook zh github io HuggingFace Transformers Stack
- Environment:Apache Dolphinscheduler Database Backend
- Environment:ThreeSR Awesome Inference Time Scaling Python Runtime Environment
- Environment:Dotnet Machinelearning Native Build Toolchain