Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Microsoft Semantic kernel Process Orchestration
- Workflow:Intel Ipex llm QLoRA Finetuning
- Workflow:NVIDIA NeMo Curator Video Curation Pipeline
- Workflow:Protectai Llm guard PII Anonymization Deanonymization
- Workflow:PrefectHQ Prefect AI Data Analyst Agent
- Workflow:Openai Openai agents python Tool Integrated Agent
- Workflow:Arize ai Phoenix LLM Evaluation Pipeline
- Workflow:Cleanlab Cleanlab Datalab Dataset Audit
- Workflow:Openai Openai python Chat Completion
- Workflow:CarperAI Trlx RLHF Dialogue Alignment
Principles
- Principle:Kserve Kserve PD Scheduler Routing
- Principle:Hpcaitech ColossalAI GRPO Reward Configuration
- Principle:Tensorflow Serving Model Training
- Principle:Hiyouga LLaMA Factory Proximal Policy Optimization
- Principle:Ggml org Llama cpp Logging System
- Principle:Datajuicer Data juicer Operator Dependency Management
- Principle:Huggingface Datasets Audio Feature Handling
- Principle:Scikit learn contrib Imbalanced learn Value Difference Metric
- Principle:Neuml Txtai Benchmark Evaluation
- Principle:SqueezeAILab ETS ILP Node Selection
Implementations
- Implementation:Duckdb Duckdb Interpreted Benchmark
- Implementation:Apache Druid SqlKeywords
- Implementation:Eventual Inc Daft DataFrame Write Iceberg
- Implementation:NVIDIA TransformerEngine NVFP4 Storage
- Implementation:Langchain ai Langchain BaseRateLimiter Acquire
- Implementation:EvolvingLMMs Lab Lmms eval Open ASR Utils
- Implementation:Mlc ai Mlc llm Medusa Model
- Implementation:DistrictDataLabs Yellowbrick ClassificationScoreVisualizer
- Implementation:Openai Openai node Beta Realtime Resource
- Implementation:Nautechsystems Nautilus trader Pyproject Configuration
Heuristics
- Heuristic:NVIDIA NeMo Curator Video Frame Sampling Strategy
- Heuristic:Togethercomputer Together python Repetition Penalty Conflict
- Heuristic:Trailofbits Fickling Injection Mode Selection
- Heuristic:Confident ai Deepeval Async Concurrency Tuning
- Heuristic:Alibaba MNN NC4HW4 Data Layout
- Heuristic:Deepseek ai Janus CFG Weight Tuning
- Heuristic:Gretelai Gretel synthetics Binary Encoder Cutoff
- Heuristic:Ggml org Ggml Gradient Accumulation Batch Sizing
- Heuristic:CARLA simulator Carla PID Controller Tuning
- Heuristic:Mlc ai Web llm Service Worker Keep Alive
Environments
- Environment:DataTalksClub Data engineering zoomcamp Docker PostgreSQL Python Environment
- Environment:Volcengine Verl Python Core Dependencies
- Environment:Togethercomputer Together python API Credentials
- Environment:Zai org CogVideo Diffusers Finetuning Environment
- Environment:Mbzuai oryx Awesome LLM Post training Python Pandas
- Environment:Kubeflow Pipelines Kubernetes Cluster
- Environment:Langchain ai Langchain LangSmith Tracing Config
- Environment:CarperAI Trlx NeMo Megatron
- Environment:Scikit learn contrib Imbalanced learn Python Scikit learn
- Environment:Mlfoundations Open flamingo PyTorch CUDA Distributed