Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:DataTalksClub Data engineering zoomcamp Spark Batch Processing
- Workflow:Fastai Fastbook Tabular Modeling
- Workflow:Unstructured IO Unstructured Chunking And Embedding
- Workflow:NVIDIA DALI Image Classification Training PyTorch
- Workflow:Openai Evals Creating a model graded eval
- Workflow:EvolvingLMMs Lab Lmms eval Custom Model Integration
- Workflow:Datajuicer Data juicer Dataset Quality Analysis
- Workflow:Apache Druid Batch Data Ingestion
- Workflow:Ggml org Llama cpp Speculative Decoding
- Workflow:NVIDIA DALI Custom Operator Development
Principles
- Principle:Sktime Pytorch forecasting Decomposition Linear
- Principle:FlagOpen FlagEmbedding Package Installation
- Principle:Scikit learn Scikit learn Gaussian Process
- Principle:Huggingface Datasets Arrow File Reading
- Principle:Datajuicer Data juicer Statistics Computation
- Principle:Tensorflow Serving Model Metadata Query
- Principle:Pola rs Polars Timezone Handling
- Principle:CrewAIInc CrewAI Specialist Agent Definition
- Principle:Recommenders team Recommenders NCF Prediction
- Principle:Microsoft Autogen Round Robin Orchestration
Implementations
- Implementation:CARLA simulator Carla MapData
- Implementation:Cohere ai Cohere python Generation Model
- Implementation:Apache Paimon MemorySliceInput
- Implementation:BerriAI Litellm LLM Guard
- Implementation:Cleanlab Cleanlab Segmentation Get Label Quality Scores
- Implementation:Ggml org Ggml Hexagon backend
- Implementation:ArroyoSystems Arroyo Proto Schema Converter
- Implementation:Microsoft Semantic kernel IEmbeddingGenerator GenerateAsync
- Implementation:Ollama Ollama MLXRunner Client
- Implementation:NVIDIA TransformerEngine TE LayerNormMLP
Heuristics
- Heuristic:Google deepmind Dm control Prop Settling Physics Tuning
- Heuristic:Tencent Ncnn Letterbox Vs Direct Resize
- Heuristic:Datahub project Datahub Gradle Task Only
- Heuristic:Avhz RustQuant MC Parallel Path Threshold
- Heuristic:Fede1024 Rust rdkafka Producer Flush Before Drop
- Heuristic:Predibase Lorax Warning Deprecated BitsAndBytes 8bit
- Heuristic:Liu00222 Open Prompt Injection BPE Retokenization Parameters
- Heuristic:Arize ai Phoenix Queue Clear Race Condition
- Heuristic:Eventual Inc Daft Execution Config Tuning
- Heuristic:Apache Flink False Positive Availability Optimization
Environments
- Environment:Kubeflow Pipelines KFP Backend Deployment
- Environment:Isaac sim IsaacGymEnvs IsaacGym Preview 4
- Environment:Volcengine Verl Ray Distributed Environment
- Environment:Google deepmind Mujoco MJX JAX Environment
- Environment:Getgauge Taiko Docker Container
- Environment:Deepspeedai DeepSpeed CPU Environment
- Environment:Vllm project Vllm CUDA GPU Runtime
- Environment:Openai Openai agents python Memory Extensions Dependencies
- Environment:PacktPublishing LLM Engineers Handbook Docker MongoDB Qdrant Infrastructure
- Environment:AnswerDotAI RAGatouille Python ColBERT Dependencies