Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Ggml org Ggml MNIST Training And Evaluation
- Workflow:Scikit learn contrib Imbalanced learn SMOTE Resampling Pipeline
- Workflow:HKUDS AI Trader End to End US Stock Trading
- Workflow:Open compass VLMEvalKit Video Benchmark Evaluation
- Workflow:Openai CLIP Zero shot image classification
- Workflow:Infiniflow Ragflow Docker Deployment
- Workflow:Datahub project Datahub Metadata Ingestion Pipeline
- Workflow:Apache Spark Kubernetes Deployment
- Workflow:Apache Dolphinscheduler Datasource Plugin Development
- Workflow:Neuml Txtai Agent Orchestration
Principles
- Principle:Eric mitchell Direct preference optimization Training Loop
- Principle:Intel Ipex llm QLoRA Model Loading
- Principle:Huggingface Trl PPO Trainer Initialization
- Principle:Guardrails ai Guardrails Stream Result Handling
- Principle:Kserve Kserve Controller Deployment
- Principle:Mistralai Client python Stream Event Processing
- Principle:Google deepmind Dm control Manipulation Visualization
- Principle:Eventual Inc Daft Data Preprocessing Regex Extraction
- Principle:Eric mitchell Direct preference optimization Log Probability Extraction
- Principle:Facebookresearch Habitat lab Task Dataset Selection
Implementations
- Implementation:Alibaba MNN FlatBuffers Reflection Generated
- Implementation:FlagOpen FlagEmbedding RetroMAE EnhancedDecoder
- Implementation:LMCache LMCache LRU Evictor
- Implementation:Huggingface Datatrove RegexFilter
- Implementation:Predibase Lorax Base Model
- Implementation:Ggml org Ggml Cpu amx mmq
- Implementation:Liu00222 Open Prompt Injection compute conditional probability
- Implementation:ArroyoSystems Arroyo Tumbling Window
- Implementation:Deepspeedai DeepSpeed ZeRO Init
- Implementation:Google deepmind Mujoco initOpenGL Pattern
Heuristics
- Heuristic:Vespa engine Vespa Document Batch Processing Strategy
- Heuristic:Gretelai Gretel synthetics Mixed Precision Training Tradeoff
- Heuristic:Pola rs Polars Collect All For Diverging Queries
- Heuristic:Danijar Dreamerv3 Percentile Return Normalization
- Heuristic:Kserve Kserve Prefix Cache Consistency
- Heuristic:Tensorflow Tfjs WASM Cross Origin Isolation
- Heuristic:Langfuse Langfuse LLM Rate Limit 24h Abandon
- Heuristic:ContextualAI HALOs TF32 Matmul Acceleration
- Heuristic:NVIDIA DALI Last Batch Policy Selection
- Heuristic:LLMBook zh LLMBook zh github io DPO Beta Hyperparameter
Environments
- Environment:Vllm project Vllm Environment Variables
- Environment:Huggingface Datatrove IO Dependencies
- Environment:Cohere ai Cohere python Python SDK Runtime
- Environment:Vespa engine Vespa FNET Transport Config
- Environment:Axolotl ai cloud Axolotl Multi GPU
- Environment:Unslothai Unsloth Llama Cpp
- Environment:PrefectHQ Prefect Prefect Server Database
- Environment:Treeverse LakeFS S3 Gateway Test Environment
- Environment:Run llama Llama index Sentence Transformers Finetuning
- Environment:Googleapis Python genai Python 3 10 SDK Runtime