Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:ARISE Initiative Robomimic Training Policy From Demonstrations
- Workflow:Mistralai Client python Function Calling
- Workflow:Togethercomputer Together python Batch Inference
- Workflow:DataTalksClub Data engineering zoomcamp Kestra ETL Pipeline
- Workflow:Mlflow Mlflow Model Logging and Registry
- Workflow:Google deepmind Dm control Composer Environment Building
- Workflow:Marker Inc Korea AutoRAG Data Creation Pipeline
- Workflow:Hpcaitech ColossalAI Supervised Finetuning
- Workflow:Anthropics Anthropic sdk python Extended Thinking Reasoning
- Workflow:Dotnet Machinelearning GenAI Causal LM Inference
Principles
- Principle:Junyanz Pytorch CycleGAN and pix2pix Dataset Acquisition
- Principle:Truera Trulens LangChain App Wrapping
- Principle:Interpretml Interpret Score Tensor Harmonization
- Principle:DataTalksClub Data engineering zoomcamp Kestra Table Creation
- Principle:Apache Dolphinscheduler RPC Server Handler
- Principle:Ggml org Ggml WebGPU Computation
- Principle:Datahub project Datahub Recipe Configuration
- Principle:HKUDS AI Trader Price Data Fetching
- Principle:Kubeflow Pipelines XGBoost Model Prediction
- Principle:Mage ai Mage ai API Stream Discovery
Implementations
- Implementation:Tencent Ncnn NMS Sorted Bboxes
- Implementation:Facebookresearch Habitat lab SimpleCNN
- Implementation:FlowiseAI Flowise CreateNewAPI
- Implementation:Evidentlyai Evidently Legacy Recsys Preset
- Implementation:Duckdb Duckdb TDigest
- Implementation:Datahub project Datahub RestEmitter Create
- Implementation:Evidentlyai Evidently Legacy Generate Column Metrics
- Implementation:Kserve Kserve InferenceRouterType Enum
- Implementation:Microsoft DeepSpeedExamples DeepSpeed Save Checkpoint
- Implementation:Lucidrains X transformers DPO Policy Model Evaluation
Heuristics
- Heuristic:Apache Kafka JVM GC Tuning Defaults
- Heuristic:AUTOMATIC1111 Stable diffusion webui VRAM Management Strategies
- Heuristic:Getgauge Taiko Browser Launch Flags
- Heuristic:Marker Inc Korea AutoRAG Deterministic Evaluation Generation
- Heuristic:Pyro ppl Pyro MCMC Warmup Adaptation
- Heuristic:Snorkel team Snorkel Binary Only Slicing
- Heuristic:Roboflow Rf detr Layer Wise LR Decay
- Heuristic:Mage ai Mage ai Sorted Data Bookmark Strategy
- Heuristic:ClickHouse ClickHouse Jemalloc Production Requirement
- Heuristic:Getgauge Taiko Element Actionability Checks
Environments
- Environment:Deepset ai Haystack HuggingFace Model Environment
- Environment:OpenHands OpenHands Third Party Runtime Credentials
- Environment:Datajuicer Data juicer LLM API Credentials Environment
- Environment:Kubeflow Kubeflow Python KFP SDK Environment
- Environment:Datajuicer Data juicer GPU CUDA Environment
- Environment:NVIDIA DALI CMake Build Environment
- Environment:Lm sys FastChat Python Core Dependencies
- Environment:Microsoft BIPIA Python CUDA GPU Environment
- Environment:Recommenders team Recommenders Spark Environment
- Environment:Google deepmind Dm control GLFW Desktop Rendering