Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Danijar Dreamerv3 Train And Evaluate
- Workflow:Allenai Open instruct GRPO Reinforcement Learning
- Workflow:Isaac sim IsaacGymEnvs Domain Randomization Training
- Workflow:Wandb Weave SDK Release
- Workflow:Scikit learn Scikit learn Ensemble Model Building
- Workflow:Speechbrain Speechbrain Speaker Embedding Training
- Workflow:Mlfoundations Open flamingo Data Preparation
- Workflow:Triton inference server Server Model Performance Tuning
- Workflow:Langgenius Dify RAG Pipeline Development
- Workflow:Ray project Ray Remote Task Execution
Principles
- Principle:ArroyoSystems Arroyo UDF Runtime Execution
- Principle:Huggingface Datasets Language Code Registry
- Principle:Ggml org Llama cpp Tokenization
- Principle:Huggingface Alignment handbook QLoRA Quantized Finetuning
- Principle:Avhz RustQuant Order Management
- Principle:ARISE Initiative Robosuite USD Scene Export
- Principle:Tencent Ncnn Model Merging
- Principle:Openclaw Openclaw Gateway Server Startup
- Principle:Pytorch Serve Hardware Metrics Collection
- Principle:Datajuicer Data juicer Operator Type Selection
Implementations
- Implementation:Mage ai Mage ai Twitter Ads Transform
- Implementation:Online ml River Stream Shuffle
- Implementation:Microsoft Playwright APIRequestContext Fetch
- Implementation:Datahub project Datahub SparkPathUtils
- Implementation:Confident ai Deepeval Observe Decorator
- Implementation:Anthropics Anthropic sdk python Messages Create With Tools
- Implementation:Tensorflow Serving Caching Manager Test
- Implementation:DataTalksClub Data engineering zoomcamp Redpanda CSV Consumer
- Implementation:Microsoft Onnxruntime TrainingUtil
- Implementation:Rapidsai Cuml SMO Solver
Heuristics
- Heuristic:Datahub project Datahub Gradle Formatting Over Direct Tools
- Heuristic:Promptfoo Promptfoo Retry With Jitter
- Heuristic:NVIDIA DALI Warning Deprecated C API V1 Functions
- Heuristic:Huggingface Transformers Dataloader Pin Memory NonBlocking
- Heuristic:Teamcapybara Capybara Frozen Time Detection
- Heuristic:NVIDIA DALI Thread Affinity Optimization
- Heuristic:HKUDS AI Trader Market Type Auto Detection
- Heuristic:Protectai Modelscan Graceful Scanner Degradation
- Heuristic:Apache Dolphinscheduler HikariCP Pool Tuning
- Heuristic:Sdv dev SDV Gaussian KDE Incompatibility
Environments
- Environment:Spcl Graph of thoughts Python 3 8 Runtime
- Environment:Cohere ai Cohere python Cohere API Credentials
- Environment:OWASP Www project top 10 for large language model applications PR Description Generator Runtime
- Environment:Mit han lab Llm awq Flash Attention Environment
- Environment:Obss Sahi Python Pycocotools
- Environment:FMInference FlexLLMGen NVMe Disk
- Environment:FlagOpen FlagEmbedding GPU Accelerator Environment
- Environment:Snorkel team Snorkel Dask Distributed
- Environment:Kserve Kserve Istio Service Mesh
- Environment:TobikoData Sqlmesh GitHub CICD Runner