Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Isaac sim IsaacGymEnvs Factory Assembly Training
- Workflow:Langchain ai Langchain Adding Partner Integration
- Workflow:NVIDIA NeMo Aligner RLHF PPO Training
- Workflow:Openai Openai node Structured Output Parsing
- Workflow:OpenHands OpenHands GitHub Webhook Event Processing
- Workflow:Nightwatchjs Nightwatch E2E Test Authoring
- Workflow:ARISE Initiative Robosuite Environment Setup And Simulation
- Workflow:Openai Openai python Chat Completion
- Workflow:Google research Deduplicate text datasets Suffix array querying
- Workflow:Openai CLIP Prompt engineered classification
Principles
- Principle:Pyro ppl Pyro Neural Module Registration
- Principle:Huggingface Transformers Device Mesh Topology
- Principle:Mlfoundations Open flamingo Optimizer And Scheduler Configuration
- Principle:Guardrails ai Guardrails Guard Validator Composition
- Principle:Pyro ppl Pyro Autoregressive Networks
- Principle:Mlflow Mlflow Database Schema Management
- Principle:Interpretml Interpret Global Explanation Generation
- Principle:Elevenlabs Elevenlabs python Voice Cloning
- Principle:VainF Torch Pruning Magnitude Importance
- Principle:Ggml org Llama cpp Partial JSON Healing
Implementations
- Implementation:DataTalksClub Data engineering zoomcamp Java Ride Data Model
- Implementation:Microsoft Playwright FfExecutionContext
- Implementation:Microsoft DeepSpeedExamples VisProjection
- Implementation:Datahub project Datahub RequiresMutable
- Implementation:Dotnet Machinelearning FastTree LambdaMART Derivatives
- Implementation:Sgl project Sglang CI Permissions Config
- Implementation:Mit han lab Llm awq VILA15 Demo
- Implementation:Apache Flink SplitFetcherTask
- Implementation:Risingwavelabs Risingwave SourceHandler Interface
- Implementation:Deepspeedai DeepSpeed XPU Adagrad
Heuristics
- Heuristic:Run llama Llama index Finetuning Warmup Steps
- Heuristic:NVIDIA NeMo Curator Semantic Dedup Cluster Sizing
- Heuristic:Mlfoundations Open flamingo KV Cache Classification Optimization
- Heuristic:Microsoft Playwright Test Stability Practices
- Heuristic:Microsoft Onnxruntime ORTModule Wrapping Order
- Heuristic:Apache Dolphinscheduler Datasource Cache Expiry
- Heuristic:Langchain ai Langgraph Stream Mode Selection
- Heuristic:Volcengine Verl Sequence Length Balancing
- Heuristic:Bitsandbytes foundation Bitsandbytes Blocksize Platform Defaults
- Heuristic:Romsto Speculative Decoding Ngram Order Selection
Environments
- Environment:CARLA simulator Carla Python API Runtime
- Environment:Microsoft Agent framework Core Package Dependencies
- Environment:Alibaba MNN HuggingFace Ecosystem Environment
- Environment:Farama Foundation Gymnasium Video Recording Dependencies
- Environment:AUTOMATIC1111 Stable diffusion webui GPU Compute Backend
- Environment:FlagOpen FlagEmbedding Finetuning Environment
- Environment:Huggingface Datatrove Inference GPU Environment
- Environment:Datajuicer Data juicer GPU CUDA Environment
- Environment:Apache Hudi Flink Runtime Environment
- Environment:OWASP Www project top 10 for large language model applications Pydantic Invoice Agent Runtime