Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Run llama Llama index ReAct Agent
- Workflow:Run llama Llama index Embedding Finetuning
- Workflow:Dotnet Machinelearning Text Classification
- Workflow:Unslothai Unsloth Vision Model Finetuning
- Workflow:Apache Spark Building and Testing
- Workflow:AUTOMATIC1111 Stable diffusion webui Textual inversion training
- Workflow:DataTalksClub Data engineering zoomcamp Kestra ETL Pipeline
- Workflow:LaurentMazare Tch rs LLM Text Generation
- Workflow:Webdriverio Webdriverio Page Object Pattern
- Workflow:Webdriverio Webdriverio WDIO Testrunner Setup
Principles
- Principle:Norrrrrrr lyn WAInjectBench Ensemble Aggregation Text
- Principle:DataTalksClub Data engineering zoomcamp SQL Revenue Aggregation
- Principle:Ggml org Llama cpp Ngram Speculative Drafting
- Principle:Huggingface Diffusers Quantized Model Loading
- Principle:Eventual Inc Daft Iceberg Writing
- Principle:Facebookresearch Habitat lab Environment Setup
- Principle:FlowiseAI Flowise Workspace Navigation
- Principle:Webdriverio Webdriverio Service Hook Lifecycle
- Principle:Speechbrain Speechbrain Noisy Speech Data Preparation
- Principle:InternLM Lmdeploy Multimodal Inference
Implementations
- Implementation:Online ml River FeatureExtraction Vectorize
- Implementation:Apache Flink FileCommitter Commit
- Implementation:Ucbepic Docetl Directive ClarifyInstructions
- Implementation:Bentoml BentoML GrpcClient
- Implementation:Astronomer Astronomer cosmos SnowflakePrivateKeyProfileMapping
- Implementation:Facebookresearch Habitat lab ClientMessageManager
- Implementation:SeldonIO Seldon core Seldon Model CRD
- Implementation:BerriAI Litellm Least Busy Strategy
- Implementation:Apache Paimon KeyValueDataWriter
- Implementation:ArroyoSystems Arroyo Pipeline Config Modal
Heuristics
- Heuristic:Lance format Lance BM25 FTS Configuration
- Heuristic:Openai Evals Event Batching Configuration
- Heuristic:ARISE Initiative Robosuite XML Reset Method Tradeoff
- Heuristic:Huggingface Alignment handbook Liger Kernel Memory
- Heuristic:Axolotl ai cloud Axolotl Memory Optimization Tips
- Heuristic:CrewAIInc CrewAI MCP Timeout And Retry Strategy
- Heuristic:Marker Inc Korea AutoRAG GPU Memory Cleanup Pattern
- Heuristic:Ggml org Llama cpp GPU Layer Offloading Verification
- Heuristic:Romsto Speculative Decoding KV Cache Instability
- Heuristic:Ray project Ray Serve Concurrency And Backpressure
Environments
- Environment:Datahub project Datahub Docker Runtime
- Environment:Infiniflow Ragflow GPU CUDA Environment
- Environment:Fastai Fastbook NLP SpaCy Environment
- Environment:Puppeteer Puppeteer Configuration Environment Variables
- Environment:Huggingface Alignment handbook DeepSpeed Multi Node
- Environment:OWASP Www project top 10 for large language model applications Pydantic Invoice Agent Runtime
- Environment:Huggingface Open r1 Slurm Cluster
- Environment:Puppeteer Puppeteer Cross Platform Browser Environment
- Environment:Openai Openai agents python Memory Extensions Dependencies
- Environment:Langgenius Dify Credentials And Env Vars