Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Datahub project Datahub Docker Quickstart Deployment
- Workflow:Huggingface Datatrove Synthetic Data Generation
- Workflow:Rapidsai Cuml Random Forest Training And Inference
- Workflow:Zai org CogVideo SAT Finetuning
- Workflow:Ggml org Llama cpp Model Perplexity Evaluation
- Workflow:Apache Flink Async Sink Lifecycle
- Workflow:Lucidrains X transformers DPO Preference Alignment
- Workflow:Langgenius Dify Visual Workflow Builder
- Workflow:AUTOMATIC1111 Stable diffusion webui Image to image generation
- Workflow:DataTalksClub Data engineering zoomcamp Spark Batch Processing
Principles
- Principle:Testtimescaling Testtimescaling github io Badge Data Generation
- Principle:Google deepmind Mujoco Restrict Optimization
- Principle:Langgenius Dify SideEffectHooks
- Principle:CarperAI Trlx PPO Configuration
- Principle:Kubeflow Pipelines Iterative Training Termination
- Principle:MaterializeInc Materialize Upgrade Validation
- Principle:Run llama Llama index Batch Evaluation Setup
- Principle:Marker Inc Korea AutoRAG Trial Summary And Dashboard
- Principle:Microsoft Semantic kernel Event Routing
- Principle:PacktPublishing LLM Engineers Handbook Document Cleaning
Implementations
- Implementation:Huggingface Datatrove TypesHelper
- Implementation:Allenai Open instruct HFDataLoader
- Implementation:Onnx Onnx Compose Merge Models
- Implementation:ClickHouse ClickHouse Musl Log Data
- Implementation:Openclaw Openclaw ResolveSandboxConfigForAgent
- Implementation:AUTOMATIC1111 Stable diffusion webui SD3 Supporting Models
- Implementation:Togethercomputer Together python Check File
- Implementation:Neuml Txtai Hub Cloud
- Implementation:Run llama Llama index LLM Utils
- Implementation:Risingwavelabs Risingwave NoDataRecoverySnapshotter
Heuristics
- Heuristic:Googleapis Python genai LRO Polling Backoff
- Heuristic:Ggml org Ggml Thread Count Selection
- Heuristic:Trailofbits Fickling Severity Threshold Selection
- Heuristic:Microsoft Semantic kernel Experimental Feature Opt In
- Heuristic:OpenGVLab InternVL Loss Reduction Strategy
- Heuristic:Tensorflow Serving Model Warmup Strategy
- Heuristic:CARLA simulator Carla Sensor Queue Synchronization Pattern
- Heuristic:Intel Ipex llm QLoRA Training Hyperparameters
- Heuristic:Puppeteer Puppeteer Navigation Race Condition Avoidance
- Heuristic:PrefectHQ Prefect Retry Backoff Strategy
Environments
- Environment:Microsoft Onnxruntime Distributed Training Environment
- Environment:Huggingface Datasets Search Dependencies
- Environment:Apache Spark JDK Build Environment
- Environment:Norrrrrrr lyn WAInjectBench External Repos Dependencies
- Environment:Treeverse LakeFS Web UI Environment
- Environment:Langgenius Dify Credentials And Env Vars
- Environment:Dagster io Dagster GRPC Communication
- Environment:Allenai Open instruct Python 3 12 Runtime
- Environment:Huggingface Datatrove Python Runtime
- Environment:Mlflow Mlflow Python Runtime Environment