Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Sdv dev SDV Multi table synthesis
- Workflow:LaurentMazare Tch rs Pretrained Image Classification
- Workflow:SeleniumHQ Selenium Chrome DevTools Protocol Integration
- Workflow:Puppeteer Puppeteer Cross Browser Automation
- Workflow:Ray project Ray Cross Language Invocation
- Workflow:Snorkel team Snorkel Data Augmentation
- Workflow:Online ml River Drift Adaptive Classification
- Workflow:Avdvg InjectGuard Vector Similarity Detection Pipeline
- Workflow:Snorkel team Snorkel Slice Aware Training
- Workflow:Guardrails ai Guardrails LLM Output Validation
Principles
- Principle:ARISE Initiative Robosuite Composite Object Construction
- Principle:Tensorflow Serving Version Policy Configuration
- Principle:Haosulab ManiSkill Agent Controller Architecture
- Principle:AUTOMATIC1111 Stable diffusion webui Hypernetwork dataset preparation
- Principle:MarketSquare Robotframework browser Installation Path Selection
- Principle:Haosulab ManiSkill Robot Agent Definition
- Principle:Microsoft Playwright Verify Expectations with Agent
- Principle:FMInference FlexLLMGen Dataset Loading And Serialization
- Principle:LLMBook zh LLMBook zh github io Reward Modeling
- Principle:Huggingface Diffusers DreamBooth Export
Implementations
- Implementation:Datahub project Datahub SchemaTronDataHubType
- Implementation:Triton inference server Server Tritonclient Infer
- Implementation:Teamcapybara Capybara Spec Server
- Implementation:Pyro ppl Pyro ProvenanceTensor
- Implementation:Evidentlyai Evidently Legacy Semantic Similarity Feature
- Implementation:Openai Evals MMLU Eval Config
- Implementation:Datahub project Datahub WriteToDataSourceV2Visitor
- Implementation:SeleniumHQ Selenium Closure Base
- Implementation:Bentoml BentoML IO Descriptor Multipart
- Implementation:Apache Paimon Ray Init
Heuristics
- Heuristic:Microsoft Autogen Name Uniqueness Constraints
- Heuristic:Eventual Inc Daft Delta Lake S3 Locking
- Heuristic:FMInference FlexLLMGen OOM Memory Management
- Heuristic:Zai org CogVideo Memory Optimization Strategies
- Heuristic:Speechbrain Speechbrain Gradient Clipping Strategy
- Heuristic:Scikit learn Scikit learn Feature Scaling Numerical Stability
- Heuristic:Apache Airflow Scheduler Performance Tuning
- Heuristic:Openai Whisper Log Probability Threshold
- Heuristic:DataTalksClub Data engineering zoomcamp GCS Upload Timeout Workaround
- Heuristic:Snorkel team Snorkel DataParallel Default Behavior
Environments
- Environment:Ollama Ollama Go Runtime
- Environment:OWASP Www project top 10 for large language model applications Pre Commit Hooks Environment
- Environment:Mlfoundations Open flamingo HuggingFace Open CLIP Dependencies
- Environment:Alibaba ROLL Python Runtime Environment
- Environment:Isaac sim IsaacGymEnvs Python CUDA Runtime
- Environment:Kubeflow Pipelines Python SDK
- Environment:Tensorflow Serving Kubernetes Deployment Environment
- Environment:Intel Ipex llm CPU Finetuning Environment
- Environment:CARLA simulator Carla Python API Runtime
- Environment:Google research Deduplicate text datasets Python TFDS Environment