Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Kubeflow Pipelines Pipeline Authoring and Compilation
- Workflow:Trailofbits Fickling PyTorch Format Identification
- Workflow:Ggml org Llama cpp HF to GGUF Model Conversion
- Workflow:Mlflow Mlflow LLM Tracing
- Workflow:Shiyu coder Kronos Batch Prediction
- Workflow:VainF Torch Pruning LLM Structural Pruning
- Workflow:Triton inference server Server Model Performance Tuning
- Workflow:Scikit learn contrib Imbalanced learn Balanced Deep Learning Training
- Workflow:Dagster io Dagster RAG Pipeline
- Workflow:BerriAI Litellm SDK Completion
Principles
- Principle:Interpretml Interpret Model Finalization After Merge
- Principle:ArroyoSystems Arroyo Local Cluster Initialization
- Principle:Speechbrain Speechbrain Data Preparation For CTC ASR
- Principle:Huggingface Datasets Scalar Value Types
- Principle:Bentoml BentoML Bento Artifact Management
- Principle:Huggingface Datatrove URL Filtering
- Principle:Langgenius Dify Environment Synchronization
- Principle:ARISE Initiative Robosuite Robot Simulation Abstraction
- Principle:Webdriverio Webdriverio HookLifecycle
- Principle:Speechbrain Speechbrain Speech To Unit Translation
Implementations
- Implementation:FlowiseAI Flowise AgentflowStickyNote
- Implementation:Speechbrain Speechbrain Whisper HFTransformersInterface
- Implementation:Lance format Lance Java ReadOptions
- Implementation:SeldonIO Seldon core Seldon Pipeline Infer
- Implementation:Googleapis Python genai Client Init
- Implementation:Apache Spark WriteAheadLog
- Implementation:Tensorflow Serving MNIST Input Data
- Implementation:Cohere ai Cohere python V2Client Chat Stream
- Implementation:Evidentlyai Evidently Legacy Words Feature
- Implementation:CrewAIInc CrewAI File Read Tool
Heuristics
- Heuristic:DistrictDataLabs Yellowbrick Model Fitted State Detection
- Heuristic:Apache Paimon Vector Index Configuration Tips
- Heuristic:Recommenders team Recommenders SAR Cold Start Items
- Heuristic:Arize ai Phoenix Warning Deprecated VertexAIModel
- Heuristic:Snorkel team Snorkel Minimum Three LFs
- Heuristic:Astronomer Astronomer cosmos Dbt Invocation Mode Selection
- Heuristic:Apache Flink False Positive Availability Optimization
- Heuristic:Axolotl ai cloud Axolotl Memory Optimization Tips
- Heuristic:OpenGVLab InternVL Multi GPU ViT Device Mapping
- Heuristic:Axolotl ai cloud Axolotl Gradient Checkpointing Reentrant Rules
Environments
- Environment:Apache Druid Integration Test Docker
- Environment:Microsoft Autogen Extension Optional Dependencies
- Environment:Huggingface Peft Optional Quantization Backends
- Environment:Datajuicer Data juicer Python Runtime Environment
- Environment:Bigscience workshop Petals Python Transformers
- Environment:MarketSquare Robotframework browser CI GitHub Actions
- Environment:Huggingface Diffusers Quantization Environment
- Environment:Guardrails ai Guardrails OpenTelemetry Tracing
- Environment:Deepset ai Haystack Python Runtime Environment
- Environment:Lm sys FastChat API Keys And Credentials