Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Ray project Ray Actor Lifecycle Management
- Workflow:Infiniflow Ragflow Document Processing Pipeline
- Workflow:Farama Foundation Gymnasium Vectorized Environment Training
- Workflow:Huggingface Datasets Dataset Loading and Exploration
- Workflow:Webdriverio Webdriverio Cucumber BDD Testing
- Workflow:LLMBook zh LLMBook zh github io LLM Pretraining
- Workflow:Deepseek ai Janus Multimodal Understanding
- Workflow:Puppeteer Puppeteer PDF Generation
- Workflow:Vllm project Vllm Structured Output Generation
- Workflow:Microsoft Playwright Network mocking and interception
Principles
- Principle:Guardrails ai Guardrails Stream Chunk Processing
- Principle:Promptfoo Promptfoo CLI Utilities
- Principle:Langfuse Langfuse LLM Execution for Experiments
- Principle:NVIDIA TransformerEngine HF Decoder Layer Replacement
- Principle:LaurentMazare Tch rs Vocabulary Management
- Principle:Haosulab ManiSkill Episode Initialization
- Principle:Nightwatchjs Nightwatch Extension Path Configuration
- Principle:Tensorflow Tfjs Fine Tuning
- Principle:Google deepmind Mujoco Simulation State Initialization
- Principle:Langgenius Dify InputValidation
Implementations
- Implementation:Facebookresearch Habitat lab SingleAgentAccessMgr
- Implementation:TobikoData Sqlmesh TasksOverview
- Implementation:Scikit learn Scikit learn SparsePCA
- Implementation:Nautechsystems Nautilus trader TradingNode Init
- Implementation:Alibaba MNN Protobuf Message CC
- Implementation:Vllm project Vllm Torch Bindings
- Implementation:Microsoft Playwright Page AddInitScript
- Implementation:CARLA simulator Carla MeshFactory Interface
- Implementation:CrewAIInc CrewAI Flow State Model
- Implementation:Hiyouga LLaMA Factory MoE Config
Heuristics
- Heuristic:Fastai Fastbook Discriminative Learning Rates
- Heuristic:Mlflow Mlflow Batch Logging Size Limits
- Heuristic:FlagOpen FlagEmbedding Dynamic Batch Size Reduction
- Heuristic:OpenRLHF OpenRLHF Gradient Checkpointing Memory Tip
- Heuristic:Sktime Pytorch forecasting Gradient Clipping Value
- Heuristic:Nautechsystems Nautilus trader Strategy On Start Initialization
- Heuristic:Predibase Lorax Flash Attention Backend Selection
- Heuristic:Arize ai Phoenix Notebook Event Loop Patching
- Heuristic:Huggingface Datasets Warning Deprecated Pandas Builder
- Heuristic:ContextualAI HALOs LoRA Merge At Save
Environments
- Environment:Diagram of thought Diagram of thought Python Graph Libraries
- Environment:Neuml Txtai Python Core Environment
- Environment:Sgl project Sglang ROCm
- Environment:DistrictDataLabs Yellowbrick Python Scikit Learn Environment
- Environment:ClickHouse ClickHouse CI Docker Environment
- Environment:Lance format Lance SIMD And Platform Requirements
- Environment:Ggml org Ggml CUDA GPU Environment
- Environment:Mlc ai Mlc llm CUDA GPU Environment
- Environment:Datajuicer Data juicer Python Runtime Environment
- Environment:Microsoft Onnxruntime Windows Build Environment