Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Langfuse Langfuse Evaluation pipeline
- Workflow:Haifengl Smile Nearest Neighbor Search
- Workflow:Huggingface Trl Reward Model Training
- Workflow:Vibrantlabsai Ragas Custom Metric Creation
- Workflow:Huggingface Trl Direct Preference Optimization
- Workflow:Apache Spark Release Process
- Workflow:Anthropics Anthropic sdk python Basic Message Conversation
- Workflow:Apache Druid SQL Based Data Ingestion
- Workflow:AUTOMATIC1111 Stable diffusion webui Text to image generation
- Workflow:Online ml River Online Clustering
Principles
- Principle:Microsoft Autogen Result Aggregation
- Principle:PeterL1n BackgroundMattingV2 Pretrained weight transfer
- Principle:Turboderp org Exllamav2 Model Weight Loading
- Principle:Onnx Onnx External Data Loading
- Principle:Mlc ai Mlc llm On Device Deployment
- Principle:Bitsandbytes foundation Bitsandbytes 4bit Quantization Lookup Tables
- Principle:Cleanlab Cleanlab Issue Summarization
- Principle:AUTOMATIC1111 Stable diffusion webui Weight merging execution
- Principle:Huggingface Peft AdaLoRA Rank Allocation
- Principle:Apache Druid Source Connection
Implementations
- Implementation:Pyro ppl Pyro TorchDistributionMixin
- Implementation:Lance format Lance JNI Fragment
- Implementation:PacktPublishing LLM Engineers Handbook HfApi Model Info
- Implementation:Langchain ai Langchain Pyproject Toml Configuration
- Implementation:Infiniflow Ragflow Agent Constants
- Implementation:Infiniflow Ragflow FilesHooks
- Implementation:Nautechsystems Nautilus trader ParquetDataCatalog Write Data
- Implementation:Google deepmind Dm control Blender Scene
- Implementation:Tensorflow Tfjs Wrappers Test
- Implementation:Infiniflow Ragflow AddedSourceCard Component
Heuristics
- Heuristic:OpenBMB UltraFeedback API Retry Strategy
- Heuristic:Mbzuai oryx Awesome LLM Post training Reference Citation Cap 200
- Heuristic:Hiyouga LLaMA Factory Mixed Precision Training Tips
- Heuristic:Astronomer Astronomer cosmos Deprecation Migration Paths
- Heuristic:Getgauge Taiko Implicit Wait Tuning
- Heuristic:Datahub project Datahub Validation Across All APIs
- Heuristic:Cypress io Cypress V8 Snapshot Memory
- Heuristic:Wandb Weave Payload Size Limits
- Heuristic:OpenBMB UltraFeedback Principle Distribution Tuning
- Heuristic:Risingwavelabs Risingwave Source Backoff Strategy
Environments
- Environment:Dotnet Machinelearning Native Build Toolchain
- Environment:Snorkel team Snorkel PyTorch
- Environment:DataTalksClub Data engineering zoomcamp Docker PostgreSQL Python Environment
- Environment:Recommenders team Recommenders Python Core Dependencies
- Environment:Marker Inc Korea AutoRAG Vector Database Backends
- Environment:Onnx Onnx Python Runtime Environment
- Environment:Langchain ai Langchain Anthropic API Credentials
- Environment:Bigscience workshop Petals Python Transformers
- Environment:Microsoft DeepSpeedExamples CIFAR10 Training Environment
- Environment:DistrictDataLabs Yellowbrick Optional NLP Dependencies