Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Predibase Lorax Single LoRA Inference
- Workflow:Datahub project Datahub Protobuf Schema Ingestion
- Workflow:Getgauge Taiko Interactive Test Recording
- Workflow:Mlflow Mlflow Prompt Management
- Workflow:Mit han lab Llm awq HuggingFace Model Export
- Workflow:ChenghaoMou Text dedup Suffix Array Deduplication
- Workflow:Princeton nlp Tree of thought llm Adding new task
- Workflow:Bitsandbytes foundation Bitsandbytes 8bit Optimizer Training
- Workflow:OWASP Www project top 10 for large language model applications Agentic Security Assessment
- Workflow:Elevenlabs Elevenlabs python Text to Speech Generation
Principles
- Principle:Vespa engine Vespa Indexing Error Handling
- Principle:Ggml org Llama cpp Embedding Computation
- Principle:Sgl project Sglang Schema Constrained Decoding
- Principle:PrefectHQ Prefect HTML Fetching
- Principle:Cleanlab Cleanlab Spurious Correlation Analysis
- Principle:Bentoml BentoML Model Cloud Sync
- Principle:Truera Trulens Dashboard Visualization
- Principle:Huggingface Datasets Text Dataset Building
- Principle:Roboflow Rf detr Model Initialization
- Principle:Mlc ai Web llm Web Worker Engine Handler
Implementations
- Implementation:FMInference FlexLLMGen DeepSpeed Quantization Utils
- Implementation:Bentoml BentoML Models List Get
- Implementation:Predibase Lorax Flash RoBERTa
- Implementation:Webdriverio Webdriverio InsightsHandler Class
- Implementation:Protectai Modelscan CLI Scan Command
- Implementation:Nautechsystems Nautilus trader BacktestEngine Add Strategy
- Implementation:Gretelai Gretel synthetics DGAN Train Numpy
- Implementation:InternLM Lmdeploy Daily Ete Test 3090
- Implementation:Vllm project Vllm SM100 MLA Tile Scheduler
- Implementation:Predibase Lorax Client Init
Heuristics
- Heuristic:Fede1024 Rust rdkafka Multi Version Dependency Hazard
- Heuristic:Huggingface Datasets Cache Fingerprinting Tips
- Heuristic:Lucidrains X transformers Sampling Temperature Strategy
- Heuristic:Unstructured IO Unstructured Chunk Size Tuning
- Heuristic:FlagOpen FlagEmbedding Length Sorted Batching
- Heuristic:Openai CLIP JIT Vs Non JIT Loading
- Heuristic:OWASP Www project top 10 for large language model applications SHA Pinning For GitHub Actions
- Heuristic:Facebookresearch Habitat lab Resume State Config Override
- Heuristic:Iterative Dvc YAML Dual Parser Strategy
- Heuristic:Apache Airflow DAG Complexity Reduction
Environments
- Environment:DataExpert io Data engineer handbook Flink Kafka Docker Environment
- Environment:NVIDIA DALI FFmpeg Environment
- Environment:PacktPublishing LLM Engineers Handbook Docker MongoDB Qdrant Infrastructure
- Environment:Huggingface Alignment handbook DeepSpeed Multi Node
- Environment:Pola rs Polars Python Runtime Environment
- Environment:OWASP Www project top 10 for large language model applications Pydantic Invoice Agent Runtime
- Environment:Promptfoo Promptfoo Node Runtime
- Environment:Intel Ipex llm Pipeline Parallel Environment
- Environment:Openai Openai python Python 3 9 Plus
- Environment:Huggingface Datasets Python PyArrow Core