Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Mlc ai Web llm Text Embeddings And RAG
- Workflow:Kubeflow Pipelines XGBoost Training Pipeline
- Workflow:Ggml org Llama cpp Interactive Chat
- Workflow:DataExpert io Data engineer handbook Flink Kafka Streaming Pipeline
- Workflow:ChenghaoMou Text dedup SimHash Deduplication
- Workflow:Huggingface Datatrove Common Crawl Processing
- Workflow:Google research Deduplicate text datasets Cross dataset deduplication
- Workflow:MaterializeInc Materialize CI Pipeline Generation
- Workflow:Allenai Open instruct SFT Finetuning
- Workflow:Mit han lab Llm awq TinyChat LLM Deployment
Principles
- Principle:Unslothai Unsloth AIME Evaluation
- Principle:Gretelai Gretel synthetics Conditional Data Sampling
- Principle:OpenGVLab InternVL Streamlit Chat Interface
- Principle:Ollama Ollama HardwareDiscovery
- Principle:Predibase Lorax Continuous Batching Inference
- Principle:Heibaiying BigData Notes MapReduce Job Assembly
- Principle:Liu00222 Open Prompt Injection Known Answer Detection
- Principle:Puppeteer Puppeteer Screenshot Capture
- Principle:BerriAI Litellm Server Startup
- Principle:Mlflow Mlflow Trace Assessment
Implementations
- Implementation:Online ml River Time Series HoltWinters
- Implementation:NVIDIA NeMo Curator WARC Iterator
- Implementation:FMInference FlexLLMGen DeepSpeed Autotuning Utils
- Implementation:Iterative Dvc Lock
- Implementation:Google deepmind Mujoco MJX Warp Collision Driver
- Implementation:Bentoml BentoML Image Builder
- Implementation:Ray project Ray Repro CI Tool
- Implementation:Onnx Onnx BaseConverter Class
- Implementation:Microsoft Playwright DevtoolsController
- Implementation:Cohere ai Cohere python FinetunedModel Settings
Heuristics
- Heuristic:Zai org CogVideo Frame Count and Resolution Constraints
- Heuristic:CrewAIInc CrewAI LLM Provider Message Workarounds
- Heuristic:LMCache LMCache Chunk Size And Default Config
- Heuristic:Heibaiying BigData Notes Kafka Consumer Offset Strategy Tip
- Heuristic:Norrrrrrr lyn WAInjectBench NaN Inf Fallback FP32 Recovery
- Heuristic:Lucidrains X transformers Sampling Temperature Strategy
- Heuristic:ARISE Initiative Robomimic BatchNorm To GroupNorm For EMA
- Heuristic:PrefectHQ Prefect Task Timeout Thread Limitation
- Heuristic:Kserve Kserve Prefix Cache Consistency
- Heuristic:Apache Paimon Compression Tuning
Environments
- Environment:Cleanlab Cleanlab Python Core Environment
- Environment:Microsoft Onnxruntime CUDA GPU Environment
- Environment:Iterative Dvc Python Runtime
- Environment:Openai Openai agents python Voice Dependencies
- Environment:Wandb Weave Trace Server Infrastructure
- Environment:NVIDIA NeMo Curator NVIDIA DALI
- Environment:OpenGVLab InternVL PEFT LoRA
- Environment:Hiyouga LLaMA Factory Optional Inference Backends
- Environment:MaterializeInc Materialize Kubernetes Helm Runtime
- Environment:Webdriverio Webdriverio Cloud Service Credentials