Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Kserve Kserve InferenceGraph Pipeline
- Workflow:Dotnet Machinelearning AutoML Experiment
- Workflow:Microsoft DeepSpeedExamples VisualChat Multimodal Training
- Workflow:Sgl project Sglang Frontend Language Multi Turn Chat
- Workflow:FMInference FlexLLMGen Single GPU Offloaded Inference
- Workflow:Volcengine Verl Data Preprocessing For RL
- Workflow:SqueezeAILab ETS ETS Experiment Pipeline
- Workflow:Pytorch Serve Large Model Inference
- Workflow:ContextualAI HALOs Online Iterative Alignment
- Workflow:Cleanlab Cleanlab Classification Label Issue Detection
Principles
- Principle:Romsto Speculative Decoding Logits Processing
- Principle:Norrrrrrr lyn WAInjectBench Supervised Training Loop
- Principle:Lucidrains X transformers Non Autoregressive Wrapper Setup
- Principle:Fede1024 Rust rdkafka Concurrent Worker Scaling
- Principle:Langchain ai Langchain Package Configuration
- Principle:Ollama Ollama Local Manifest Management
- Principle:Intel Ipex llm NPU Multimodal Inference
- Principle:Online ml River Online Probability Distributions
- Principle:Huggingface Datatrove Token Statistics
- Principle:Webdriverio Webdriverio WebDriver Bidi Protocol
Implementations
- Implementation:Vllm project Vllm Test AMD Pipeline Config
- Implementation:Google deepmind Mujoco mj saveLastXML
- Implementation:Langchain ai Langgraph Storm Example
- Implementation:Google deepmind Dm control Suite Walker
- Implementation:LMCache LMCache Mem Alloc
- Implementation:CARLA simulator Carla Map API Spec
- Implementation:Ollama Ollama Llama Model BailingMoE
- Implementation:Huggingface Transformers Check Repo
- Implementation:Datajuicer Data juicer Service API
- Implementation:Online ml River Datasets HTTP
Heuristics
- Heuristic:Kserve Kserve Server Side Apply For CRDs
- Heuristic:Huggingface Datatrove Gopher Quality Thresholds
- Heuristic:Alibaba ROLL Gradient Checkpointing Recomputation
- Heuristic:Vespa engine Vespa RPM Zstd Compression Settings
- Heuristic:Protectai Llm guard Token Limit Early Guard
- Heuristic:OWASP Www project top 10 for large language model applications Deliberately Insecure Code Isolation
- Heuristic:Norrrrrrr lyn WAInjectBench LoRA Rank Alpha Selection
- Heuristic:Avdvg InjectGuard Embedding Normalization Cosine Equivalence
- Heuristic:DevExpress Testcafe MacOS Browser Launch Serialization
- Heuristic:Wandb Weave Batch Processing Tuning
Environments
- Environment:TA Lib Ta lib python TA Lib C Library
- Environment:Huggingface Datatrove Processing Dependencies
- Environment:Junyanz Pytorch CycleGAN and pix2pix DDP Multi GPU
- Environment:Marker Inc Korea AutoRAG VLLM Environment
- Environment:Ray project Ray CI Build Matrix Environment
- Environment:Mlflow Mlflow OpenAI LLM Integration Environment
- Environment:Haotian liu LLaVA Python CUDA Training Environment
- Environment:Datajuicer Data juicer Python Runtime Environment
- Environment:Heibaiying BigData Notes Java 8 Maven Environment
- Environment:Langgenius Dify Python Backend Environment