Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Mage ai Mage ai Destination Data Loading
- Workflow:Kubeflow Kubeflow AI Lifecycle Pipeline
- Workflow:OWASP Www project top 10 for large language model applications Vulnerability Entry Development
- Workflow:Haosulab ManiSkill Sim2Real Deployment
- Workflow:CrewAIInc CrewAI Flow Based Orchestration
- Workflow:Protectai Llm guard Scanner Benchmarking
- Workflow:Datahub project Datahub Docker Quickstart Deployment
- Workflow:Deepspeedai DeepSpeed Pipeline Parallel Training
- Workflow:Haifengl Smile Matrix Decomposition Pipeline
- Workflow:Guardrails ai Guardrails Structured Data Generation
Principles
- Principle:SqueezeAILab ETS Answer Normalization And Grading
- Principle:Haosulab ManiSkill LeRobot Format Export
- Principle:Iterative Dvc Artifact Resolution
- Principle:AUTOMATIC1111 Stable diffusion webui Diffusion Sampling Methods
- Principle:Isaac sim IsaacGymEnvs Robot Controller Configuration
- Principle:ARISE Initiative Robosuite Operational Space Control
- Principle:ARISE Initiative Robomimic Train Validation Split
- Principle:ClickHouse ClickHouse Network Address Representation
- Principle:ArroyoSystems Arroyo Pipeline Shutdown
- Principle:Helicone Helicone Cost Computation
Implementations
- Implementation:Online ml River Optim Momentum
- Implementation:Infiniflow Ragflow Next Request
- Implementation:FlowiseAI Flowise ExecutionsListTable
- Implementation:AUTOMATIC1111 Stable diffusion webui SD3 Implementations
- Implementation:NVIDIA NeMo Curator RayActorPoolAdapter
- Implementation:Langfuse Langfuse Dataset Run Items Converters
- Implementation:Neuml Txtai NetworkX Graph
- Implementation:Infiniflow Ragflow Admin EmailForm Component
- Implementation:ClickHouse ClickHouse Decimal
- Implementation:FlowiseAI Flowise MarketplaceCanvasNode
Heuristics
- Heuristic:Duckdb Duckdb Test Development Guidelines
- Heuristic:Vibrantlabsai Ragas Warning Deprecated Legacy LLM Wrappers
- Heuristic:Google research Deduplicate text datasets Ulimit File Descriptors For Merge
- Heuristic:Iamhankai Forest of Thought UCB Exploration Constant
- Heuristic:Facebookresearch Habitat lab DDPPO Straggler Preemption
- Heuristic:CARLA simulator Carla Walker Spawn Vertical Offset
- Heuristic:Mlc ai Web llm Service Worker Keep Alive
- Heuristic:Wandb Weave Retry And Error Handling
- Heuristic:Microsoft LoRA LoRA Rank Selection
- Heuristic:Explodinggradients Ragas Embedding Batch Size Tuning
Environments
- Environment:Apache Kafka Docker Build Environment
- Environment:Googleapis Python genai Python 3 10 SDK Runtime
- Environment:Haosulab ManiSkill Motion Planning Deps
- Environment:Dagster io Dagster GRPC Communication
- Environment:LMCache LMCache Python Runtime
- Environment:Apache Paimon Optional Extensions
- Environment:Sgl project Sglang Distributed
- Environment:DataTalksClub Data engineering zoomcamp Dlt BigQuery Environment
- Environment:Protectai Llm guard API Server Deployment
- Environment:Tensorflow Serving Docker Runtime Environment