Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — full-stack AI/ML coding agent
- Leeroopedia MCP — knowledge search for any coding agent
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Deepspeedai DeepSpeed Inference Engine Optimization
- Workflow:Cohere ai Cohere python Model Finetuning
- Workflow:Deepset ai Haystack Document Indexing Pipeline
- Workflow:Nautechsystems Nautilus trader Backtest with BacktestEngine
- Workflow:NVIDIA DALI Image Preprocessing Pipeline
- Workflow:Hiyouga LLaMA Factory PPO RLHF Training
- Workflow:Hiyouga LLaMA Factory Model Export and Merging
- Workflow:Deepseek ai Janus Rectified Flow Image Generation
- Workflow:Interpretml Interpret Model Explanation And Visualization
- Workflow:ARISE Initiative Robomimic Hyperparameter Sweep
Principles
- Principle:Haifengl Smile Data Transformation
- Principle:Turboderp org Exllamav2 Device Management
- Principle:FlagOpen FlagEmbedding LLM Dense Retrieval Training
- Principle:Langchain ai Langchain Streaming Method Invocation
- Principle:Datajuicer Data juicer QA Optimization
- Principle:Webdriverio Webdriverio Service Launcher Pattern
- Principle:Run llama Llama index Index Persistence
- Principle:Protectai Llm guard Programming Language Detection
- Principle:Allenai Open instruct Project Dependency Management
- Principle:LaurentMazare Tch rs Sequential Model Definition
Implementations
- Implementation:DataExpert io Data engineer handbook Tumble Over Window
- Implementation:Online ml River Stream Iter Vaex
- Implementation:Facebookresearch Habitat lab ActionSpace
- Implementation:Ollama Ollama Llama KV Cache ISWA
- Implementation:Spcl Graph of thoughts AbstractLanguageModel
- Implementation:Ollama Ollama Imagegen ZImage Transformer
- Implementation:Googleapis Python genai Part From Uri And Bytes
- Implementation:CarperAI Trlx Default SFT Config
- Implementation:Hiyouga LLaMA Factory API Chat
- Implementation:FlagOpen FlagEmbedding BGE Finetune Modeling
Heuristics
- Heuristic:Dotnet Machinelearning Sparsity Threshold Optimization
- Heuristic:Huggingface Trl Gradient Checkpointing Use Reentrant
- Heuristic:Openai Openai agents python Sensitive Data Logging Defaults
- Heuristic:Google deepmind Mujoco MJX Benchmarking Tips
- Heuristic:Protectai Llm guard Token Limit Early Guard
- Heuristic:Pytorch Serve Ampere Tensor Core Optimization
- Heuristic:Huggingface Datatrove Gopher Quality Thresholds
- Heuristic:Duckdb Duckdb Build Parallelism Tuning
- Heuristic:PeterL1n BackgroundMattingV2 Checkpoint Interval Tuning
- Heuristic:Gretelai Gretel synthetics Batch Size Divisibility Constraints
Environments
- Environment:Facebookresearch Habitat lab Python 3 9 Core Dependencies
- Environment:Microsoft DeepSpeedExamples CIFAR10 Training Environment
- Environment:Langfuse Langfuse ClickHouse Analytics
- Environment:Iterative Dvc Python Runtime
- Environment:Intel Ipex llm NPU Cpp Environment
- Environment:Arize ai Phoenix OpenTelemetry SDK
- Environment:Hiyouga LLaMA Factory Core Python GPU Environment
- Environment:Apache Kafka Docker Build Environment
- Environment:LMCache LMCache NIXL Transfer Library
- Environment:Rapidsai Cuml Dask Distributed