Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Volcengine Verl Data Preprocessing For RL
- Workflow:Bitsandbytes foundation Bitsandbytes 8bit LLM Int8 Inference
- Workflow:Mlflow Mlflow Prompt Management
- Workflow:Dotnet Machinelearning Binary Classification Pipeline
- Workflow:Openclaw Openclaw Docker Deployment
- Workflow:Apache Dolphinscheduler Workflow Failover Recovery
- Workflow:Google research Deduplicate text datasets Wiki40B TFDS deduplication
- Workflow:Pola rs Polars DataFrame Aggregation and Grouping
- Workflow:Helicone Helicone Local Development Setup
- Workflow:ClickHouse ClickHouse Building From Source
Principles
- Principle:Triton inference server Server Inference Request
- Principle:FlowiseAI Flowise Node Parameter Configuration
- Principle:Datahub project Datahub Client Authentication
- Principle:Nightwatchjs Nightwatch Component Test Authoring
- Principle:Apache Paimon Lance Distributed Analytics
- Principle:Explodinggradients Ragas Document Loading
- Principle:Hiyouga LLaMA Factory Dataset Format Conversion
- Principle:CARLA simulator Carla CMake Build Configuration
- Principle:Danijar Dreamerv3 Logging And Reporting
- Principle:PacktPublishing LLM Engineers Handbook HuggingFace Dataset Publishing
Implementations
- Implementation:Alibaba MNN Express Utils
- Implementation:Apache Spark Build Api Docs
- Implementation:Treeverse LakeFS Commit Operation
- Implementation:Protectai Llm guard Relevance
- Implementation:Hiyouga LLaMA Factory Hparams Parser
- Implementation:Open compass VLMEvalKit SArena Metrics
- Implementation:SeleniumHQ Selenium Closure ErrorHandler
- Implementation:Arize ai Phoenix PrecisionRecallFScore
- Implementation:Google deepmind Mujoco mjx benchmark
- Implementation:Ggml org Llama cpp Memory Recurrent
Heuristics
- Heuristic:Duckdb Duckdb Memory Management Rules
- Heuristic:Cohere ai Cohere python Tokenizer Cache With TTL
- Heuristic:Fede1024 Rust rdkafka Transaction Error Recovery
- Heuristic:Gretelai Gretel synthetics GPU Memory Allow Growth
- Heuristic:Sail sg LongSpec Triton Block Size Tuning
- Heuristic:Deepspeedai DeepSpeed ZeRO Pipeline Incompatibility
- Heuristic:Groq Groq python Retry Backoff Strategy
- Heuristic:ThreeSR Awesome Inference Time Scaling Date Parsing Fallback Tip
- Heuristic:Tensorflow Serving Batching Thread Tuning
- Heuristic:Deepset ai Haystack Document Splitting Defaults
Environments
- Environment:Openai Openai node Node 20 Runtime
- Environment:Eventual Inc Daft Ray Distributed Runner
- Environment:Langfuse Langfuse ClickHouse Analytics
- Environment:Sgl project Sglang CUDA SM100
- Environment:NVIDIA TransformerEngine CUDA Toolkit Requirements
- Environment:OWASP Www project top 10 for large language model applications PR Description Generator Runtime
- Environment:OpenRLHF OpenRLHF CUDA GPU Environment
- Environment:CrewAIInc CrewAI Python Runtime Environment
- Environment:Apache Airflow Development Contributor Environment
- Environment:Microsoft Agent framework Python 3 10 Runtime