Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Iterative Dvc Pipeline Reproduction
- Workflow:Shiyu coder Kronos Batch Prediction
- Workflow:Duckdb Duckdb Building From Source
- Workflow:PacktPublishing LLM Engineers Handbook Feature Engineering
- Workflow:Farama Foundation Gymnasium RL Agent Training Loop
- Workflow:Triton inference server Server Model Performance Tuning
- Workflow:Huggingface Alignment handbook Multi Stage Post Training
- Workflow:DataExpert io Data engineer handbook Dimensional Data Modeling Environment Setup
- Workflow:Lakeraai Pint benchmark Custom Dataset Benchmarking
- Workflow:MaterializeInc Materialize dbt Integration
Principles
- Principle:Microsoft Onnxruntime Source Framework Training
- Principle:Google deepmind Mujoco Constraint Solving
- Principle:Pytorch Serve Environment Setup
- Principle:Ollama Ollama GGUF Model Conversion Mistral Causal
- Principle:Huggingface Datatrove Token Statistics
- Principle:AUTOMATIC1111 Stable diffusion webui Attention Optimization
- Principle:InternLM Lmdeploy Generation Configuration
- Principle:OpenRLHF OpenRLHF DeepSpeed Checkpoint Conversion
- Principle:Webdriverio Webdriverio Reporter Pattern
- Principle:Allenai Open instruct Beaker Experiment Launch
Implementations
- Implementation:FMInference FlexLLMGen DeepSpeed Autotuning Utils
- Implementation:Online ml River Tree Splitter QO
- Implementation:ContextualAI HALOs BradleyTerryTrainer Train
- Implementation:Lance format Lance ListEncoding
- Implementation:Lance format Lance Commit Compaction
- Implementation:Mistralai Client python Message Models
- Implementation:Apache Druid Website Redirects
- Implementation:Ray project Ray Deploy Jars
- Implementation:Ggml org Llama cpp Chat Peg Parser Header
- Implementation:PrefectHQ Prefect PrefectDbtRunner Invoke
Heuristics
- Heuristic:Allenai Open instruct NCCL CUMEM Disable
- Heuristic:PeterL1n BackgroundMattingV2 ONNX Patch Method Compatibility
- Heuristic:Ggml org Ggml Gradient Accumulation Batch Sizing
- Heuristic:Huggingface Alignment handbook Liger Kernel Memory
- Heuristic:Dotnet Machinelearning AutoML SMAC Dimension Limit
- Heuristic:Tensorflow Serving Warning Deprecated CreateTfrtSavedModel Raw
- Heuristic:Huggingface Optimum Device Offload Constraints
- Heuristic:Microsoft Agent framework PowerFx Python Version Limit
- Heuristic:Iterative Dvc Shell Execution Pitfalls
- Heuristic:Groq Groq python Streaming Usage Stats
Environments
- Environment:Datajuicer Data juicer Python Runtime Environment
- Environment:MarketSquare Robotframework browser Python Runtime
- Environment:Arize ai Phoenix OpenTelemetry SDK
- Environment:SqueezeAILab ETS Multi GPU Sglang Runtime
- Environment:Huggingface Datasets TensorFlow Integration
- Environment:Apache Airflow Docker Container Environment
- Environment:LMCache LMCache VLLM Serving Engine
- Environment:FlagOpen FlagEmbedding Finetuning Environment
- Environment:Predibase Lorax Docker Container Runtime
- Environment:Astronomer Astronomer cosmos Python Airflow Runtime