Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:OpenRLHF OpenRLHF DPO Training
- Workflow:VainF Torch Pruning LLM Structural Pruning
- Workflow:DataTalksClub Data engineering zoomcamp Kafka Stream Processing
- Workflow:Sktime Pytorch forecasting DeepAR Probabilistic Forecasting
- Workflow:Deepspeedai DeepSpeed AutoTP Training
- Workflow:CARLA simulator Carla Autonomous Navigation
- Workflow:Evidentlyai Evidently Text Data Quality Evaluation
- Workflow:LLMBook zh LLMBook zh github io LoRA Finetuning
- Workflow:MarketSquare Robotframework browser Browser Test Authoring
- Workflow:Togethercomputer Together python Chat Completion
Principles
- Principle:Webdriverio Webdriverio XPath Processing
- Principle:Groq Groq python Transcription Result Extraction
- Principle:Neuml Txtai Model Integration
- Principle:Scikit learn Scikit learn Clustering
- Principle:Facebookresearch Audiocraft Audio Token Decoding
- Principle:Datajuicer Data juicer Statistics Key Definition
- Principle:Mistralai Client python SDK Installation
- Principle:Huggingface Optimum Export Validation
- Principle:TobikoData Sqlmesh Production Promotion
- Principle:Vllm project Vllm Structured Output Configuration
Implementations
- Implementation:Openai Openai python Websocket Connection Options
- Implementation:Google deepmind Dm control MuJoCo Profiling Wrapper
- Implementation:Apache Paimon Lint Python Script
- Implementation:Apache Druid DoctorDialog
- Implementation:Huggingface Diffusers DiffusionPipeline From Pretrained
- Implementation:Scikit learn Scikit learn RandomForestClassifier Init
- Implementation:Onnx Onnx Net Drawer
- Implementation:Ucbepic Docetl Docker Compose Launch
- Implementation:FlagOpen FlagEmbedding BGE M3 Modeling
- Implementation:Roboflow Rf detr RFDETR Predict
Heuristics
- Heuristic:Puppeteer Puppeteer Timeout Hierarchy
- Heuristic:Google deepmind Dm control Tolerance Reward Tuning
- Heuristic:PeterL1n BackgroundMattingV2 Checkpoint Interval Tuning
- Heuristic:Vespa engine Vespa Config Polling Timeout Tuning
- Heuristic:Nautechsystems Nautilus trader Cache Buffer Interval Tuning
- Heuristic:Lakeraai Pint benchmark Chunking Stride 25 Percent Overlap
- Heuristic:Onnx Onnx Opset Version Selection
- Heuristic:Huggingface Datatrove Thundering Herd Prevention
- Heuristic:Eventual Inc Daft Delta Lake S3 Locking
- Heuristic:Sktime Pytorch forecasting Early Stopping Patience
Environments
- Environment:EvolvingLMMs Lab Lmms eval Server Mode Environment
- Environment:Huggingface Alignment handbook Python Transformers
- Environment:Vibrantlabsai Ragas LLM Provider Credentials
- Environment:Sgl project Sglang Performance Dashboard
- Environment:Langgenius Dify Docker Deployment Environment
- Environment:Protectai Llm guard Python Runtime Dependencies
- Environment:DistrictDataLabs Yellowbrick Python Scikit Learn Environment
- Environment:Triton inference server Server TRT LLM Deployment
- Environment:Mbzuai oryx Awesome LLM Post training Python Matplotlib
- Environment:Kubeflow Pipelines Kubernetes Cluster