Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Alibaba ROLL Supervised Finetuning Pipeline
- Workflow:Sdv dev SDV Multi table synthesis
- Workflow:Elevenlabs Elevenlabs python Realtime TTS Streaming
- Workflow:Google deepmind Dm control Manipulation Task Setup
- Workflow:Langfuse Langfuse Evaluation pipeline
- Workflow:Deepset ai Haystack RAG Pipeline
- Workflow:Google deepmind Dm control Locomotion Task Setup
- Workflow:Facebookresearch Habitat lab PointNav PPO Training
- Workflow:CrewAIInc CrewAI Flow Based Orchestration
- Workflow:Protectai Modelscan Custom Scanner Plugin
Principles
- Principle:Nightwatchjs Nightwatch Page Commands
- Principle:Anthropics Anthropic sdk python Message Request Construction
- Principle:Triton inference server Server Server Utilities
- Principle:Truera Trulens LangGraph Agent Wrapping
- Principle:Apache Airflow Provider Release Process
- Principle:Fede1024 Rust rdkafka Custom Client Context
- Principle:DataExpert io Data engineer handbook Experiment User Assignment
- Principle:FMInference FlexLLMGen Execution Environment Initialization
- Principle:Predibase Lorax Continuous Batching Inference
- Principle:Online ml River Time Series Evaluation
Implementations
- Implementation:Apache Airflow Lifecycle Listener Spec
- Implementation:Tensorflow Serving Tfrt Multi Inference Test
- Implementation:InternLM Lmdeploy Gemm KernelImplSm90
- Implementation:Openai CLIP Transform
- Implementation:Lance format Lance LegacyBlobEncoding
- Implementation:Scikit learn Scikit learn GaussianMixture
- Implementation:Neuml Txtai Llama Vectors
- Implementation:Openclaw Openclaw StartGatewayServer
- Implementation:Mage ai Mage ai Destination Emit State
- Implementation:Apache Dolphinscheduler FailoverCoordinator WorkerFailover
Heuristics
- Heuristic:NVIDIA NeMo Aligner PPO Critic Warmup Tip
- Heuristic:Fede1024 Rust rdkafka Multi Version Dependency Hazard
- Heuristic:Neuml Txtai Thread Safety Constraints
- Heuristic:Mlc ai Mlc llm Engine Mode Selection
- Heuristic:Haosulab ManiSkill Rendering Memory Optimization
- Heuristic:Ollama Ollama Download Retry Strategy
- Heuristic:Cleanlab Cleanlab Confident Threshold Heuristic
- Heuristic:Google research Deduplicate text datasets Ulimit File Descriptors For Merge
- Heuristic:Princeton nlp Tree of thought llm API Request Batching
- Heuristic:Google research Deduplicate text datasets HACKSIZE Overlap Buffer
Environments
- Environment:DataExpert io Data engineer handbook Flink Kafka Docker Environment
- Environment:DistrictDataLabs Yellowbrick Python Scikit Learn Environment
- Environment:Microsoft Agent framework Python 3 10 Runtime
- Environment:Facebookresearch Audiocraft Python PyTorch CUDA Environment
- Environment:Alibaba ROLL ROCm GPU Environment
- Environment:Deepspeedai DeepSpeed CUDA GPU Environment
- Environment:CarperAI Trlx DeepSpeed Multi GPU
- Environment:Speechbrain Speechbrain HuggingFace Transformers
- Environment:Mage ai Mage ai Python 3 9 Runtime
- Environment:Apache Spark JDK Build Environment