Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Langgenius Dify Workflow Builder and Execution
- Workflow:Datahub project Datahub Protobuf Schema Ingestion
- Workflow:Astronomer Astronomer cosmos TaskGroup dbt integration
- Workflow:NVIDIA TransformerEngine Accelerate HF Llama With TE
- Workflow:Google deepmind Dm control Composer Environment Building
- Workflow:Openai Openai agents python Tool Integrated Agent
- Workflow:Sktime Pytorch forecasting DeepAR Probabilistic Forecasting
- Workflow:Microsoft Onnxruntime On Device Training
- Workflow:Datahub project Datahub CLI Metadata Ingestion
- Workflow:Langfuse Langfuse Otel ingestion pipeline
Principles
- Principle:Dotnet Machinelearning SSA Model Fitting
- Principle:Apache Spark YARN Web Proxy Security
- Principle:Scikit learn contrib Imbalanced learn Under Sampling Base Abstraction
- Principle:Microsoft Agent framework Declarative Tool Binding
- Principle:Lucidrains X transformers Belief State Training
- Principle:Duckdb Duckdb Auxiliary Code Generation
- Principle:Huggingface Diffusers Lazy Import Management
- Principle:Open compass VLMEvalKit API Model Adapter Pattern
- Principle:Apache Druid Visualization Module Selection
- Principle:Dotnet Machinelearning Binary Classification Training
Implementations
- Implementation:Vespa engine Vespa Embedder Embed
- Implementation:Apache Flink CompactorOperator NotifyCheckpointComplete
- Implementation:TA Lib Ta lib python Stream Function API
- Implementation:AUTOMATIC1111 Stable diffusion webui File Hashing
- Implementation:Mage ai Mage ai Source Write Records
- Implementation:Alibaba MNN FlatBuffers Reflection Header
- Implementation:Mlflow Mlflow Trace Decorator
- Implementation:Datajuicer Data juicer NlpaugEnMapper
- Implementation:Arize ai Phoenix Pyproject Config
- Implementation:Apache Druid SchemaColumnList
Heuristics
- Heuristic:Apache Dolphinscheduler JDBC Security Blocklist
- Heuristic:OpenGVLab InternVL Gradient Checkpointing Memory
- Heuristic:Volcengine Verl Sequence Length Balancing
- Heuristic:Isaac sim IsaacGymEnvs Determinism Performance Tradeoff
- Heuristic:ContextualAI HALOs FSDP Sampling Workaround
- Heuristic:Duckdb Duckdb Version Sync Across Files
- Heuristic:NVIDIA NeMo Aligner PPO Critic Warmup Tip
- Heuristic:Langchain ai Langgraph Retry Policy Configuration
- Heuristic:ARISE Initiative Robomimic Data Worker Tuning By Modality
- Heuristic:Langgenius Dify API Token Single Flight Caching
Environments
- Environment:ArroyoSystems Arroyo Kubernetes Deployment
- Environment:Iamhankai Forest of Thought OpenAI API Credentials
- Environment:OpenBMB UltraFeedback vLLM Multi GPU Environment
- Environment:Cohere ai Cohere python AWS Integration Dependencies
- Environment:Romsto Speculative Decoding CUDA PyTorch
- Environment:Iterative Dvc Git SCM Environment
- Environment:Deepspeedai DeepSpeed XPU Environment
- Environment:Vllm project Vllm Distributed
- Environment:Mlflow Mlflow OpenAI LLM Integration Environment
- Environment:Ggml org Llama cpp Python Conversion Environment