Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Microsoft Autogen Multi Agent Conversation
- Workflow:DistrictDataLabs Yellowbrick Feature Analysis and Selection
- Workflow:Kserve Kserve LLM Inference Serving
- Workflow:Datajuicer Data juicer Distributed Ray Processing
- Workflow:NVIDIA NeMo Aligner RLHF PPO Training
- Workflow:Huggingface Open r1 GRPO Reasoning Training
- Workflow:Dagster io Dagster Bluesky Analytics
- Workflow:Langfuse Langfuse Evaluation pipeline
- Workflow:Treeverse LakeFS Garbage Collection
- Workflow:Speechbrain Speechbrain Text to Speech Training
Principles
- Principle:Datahub project Datahub Client Authentication
- Principle:Eventual Inc Daft Descriptive Statistics
- Principle:Datahub project Datahub Sample Data Loading
- Principle:Gretelai Gretel synthetics Synthetic Data Quality Evaluation
- Principle:Datahub project Datahub Docker Prerequisites
- Principle:Tensorflow Tfjs Model Compilation
- Principle:Apache Paimon Atomic Commit
- Principle:Ggml org Llama cpp Draft Model Loading
- Principle:Liu00222 Open Prompt Injection Configuration Loading
- Principle:Mlflow Mlflow Parameter Logging
Implementations
- Implementation:Ollama Ollama Imagegen MLX Go
- Implementation:BerriAI Litellm Redact Messages
- Implementation:Openai Openai node Translations Resource
- Implementation:Openai Openai node ChatCompletionStream Class
- Implementation:Open compass VLMEvalKit Omni Verifier
- Implementation:Evidentlyai Evidently LLM Judge Descriptors
- Implementation:LMCache LMCache Connector V1
- Implementation:Huggingface Transformers Benchmark V1 Runner
- Implementation:CrewAIInc CrewAI RAG Tool
- Implementation:Predibase Lorax Triton LibEntry
Heuristics
- Heuristic:Datahub project Datahub Validation Cross API
- Heuristic:Apache Spark Memory Tuning Tips
- Heuristic:Spcl Graph of thoughts Scoring With Error Counting
- Heuristic:Apache Paimon Vector Index Configuration Tips
- Heuristic:Mistralai Client python Stream File Uploads
- Heuristic:Romsto Speculative Decoding Ngram Order Selection
- Heuristic:Gretelai Gretel synthetics Memory Chunking For Normalization
- Heuristic:Huggingface Datatrove Gopher Quality Thresholds
- Heuristic:Apache Beam Lock Contention Batching
- Heuristic:OWASP Www project top 10 for large language model applications Deliberately Insecure Code Isolation
Environments
- Environment:Interpretml Interpret Native Libebm Environment
- Environment:Huggingface Open r1 Slurm Cluster
- Environment:Datajuicer Data juicer Python Runtime Environment
- Environment:Deepspeedai DeepSpeed CPU Environment
- Environment:Deepspeedai DeepSpeed Multi Accelerator Environment
- Environment:Vllm project Vllm Environment Variables
- Environment:Facebookresearch Habitat lab SLURM Distributed Environment
- Environment:DataExpert io Data engineer handbook Spark Iceberg Docker Environment
- Environment:Alibaba ROLL vLLM Inference Environment
- Environment:Kubeflow Kubeflow Kubernetes Cluster Environment