Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Datahub project Datahub Protobuf Schema Ingestion
- Workflow:Arize ai Phoenix Prompt Management Pipeline
- Workflow:Ggml org Ggml GPT2 Text Generation
- Workflow:ArroyoSystems Arroyo SQL Pipeline Lifecycle
- Workflow:Datahub project Datahub CLI Metadata Ingestion
- Workflow:Openai Evals Running an eval set
- Workflow:Scikit learn contrib Imbalanced learn SMOTE Resampling Pipeline
- Workflow:Alibaba ROLL DPO Training Pipeline
- Workflow:Microsoft Playwright Network mocking and interception
- Workflow:Cohere ai Cohere python Model Finetuning
Principles
- Principle:Tensorflow Serving Version Label Routing
- Principle:Snorkel team Snorkel Labeling Function Analysis
- Principle:OpenRLHF OpenRLHF Optimizer and Scheduler Setup
- Principle:Speechbrain Speechbrain Conventional Enhancement Training
- Principle:Googleapis Python genai Image Editing
- Principle:Turboderp org Exllamav2 Image Embedding Extraction
- Principle:Zai org CogVideo Caption Output
- Principle:Allenai Open instruct DPO Loss Dispatch
- Principle:NVIDIA DALI Spatial Augmentation Detection
- Principle:Marker Inc Korea AutoRAG Configuration Loading
Implementations
- Implementation:Tensorflow Serving Json Tensor Test
- Implementation:SeldonIO Seldon core Seldon Pipeline CRD Explainer
- Implementation:Datajuicer Data juicer ImageDetectionYoloMapper
- Implementation:MarketSquare Robotframework browser Data Types
- Implementation:Huggingface Datatrove DummyInferenceServer
- Implementation:BerriAI Litellm JSON Validation Rule
- Implementation:Triton inference server Server L0 Memory Growth Test
- Implementation:CARLA simulator Carla Client Set Replayer Time Factor
- Implementation:FlagOpen FlagEmbedding Matryoshka Mistral Model Inference
- Implementation:OpenGVLab InternVL CLIPVisionTower
Heuristics
- Heuristic:ARISE Initiative Robomimic Checkpoint Selection Strategy
- Heuristic:Deepseek ai Janus CFG Weight Tuning
- Heuristic:Speechbrain Speechbrain GAN Dual Optimizer Pattern
- Heuristic:BerriAI Litellm Retry Backoff Strategy
- Heuristic:PacktPublishing LLM Engineers Handbook Dataset Generation Quality Filters
- Heuristic:Bentoml BentoML Warning Deprecated Server Module
- Heuristic:HKUDS AI Trader Linear Retry Backoff
- Heuristic:Facebookresearch Habitat lab VER Tuning Guidelines
- Heuristic:NVIDIA NeMo Curator Deduplication Blocksize Tuning
- Heuristic:Avhz RustQuant Discretization Scheme Selection
Environments
- Environment:Hiyouga LLaMA Factory FP8 Training Environment
- Environment:Deepspeedai DeepSpeed NVMe Environment
- Environment:Huggingface Alignment handbook BitsAndBytes CUDA
- Environment:ArroyoSystems Arroyo Kubernetes Deployment
- Environment:Huggingface Datasets Search Dependencies
- Environment:Dotnet Machinelearning Dotnet SDK And Runtime
- Environment:Neuml Txtai GPU Accelerator Environment
- Environment:Hiyouga LLaMA Factory Core Python GPU Environment
- Environment:Heibaiying BigData Notes HBase Environment
- Environment:Mlc ai Mlc llm Metal macOS iOS Environment