Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Langgenius Dify Knowledge Base Management
- Workflow:Langgenius Dify RAG Pipeline Development
- Workflow:Unstructured IO Unstructured Document Partitioning
- Workflow:Fede1024 Rust rdkafka Produce Consume Roundtrip
- Workflow:Alibaba MNN Model Compression
- Workflow:Interpretml Interpret Model Explanation And Visualization
- Workflow:OpenBMB UltraFeedback Dataset Construction
- Workflow:Lm sys FastChat Vicuna SFT Finetuning
- Workflow:Apache Shardingsphere Cluster Mode Initialization
- Workflow:ClickHouse ClickHouse Building From Source
Principles
- Principle:Marker Inc Korea AutoRAG API Based Passage Reranking
- Principle:Anthropics Anthropic sdk python Streaming Structured Output
- Principle:Apache Hudi Demo Environment Cleanup
- Principle:Microsoft Playwright Select Browser and Configure Context
- Principle:Avhz RustQuant Python Integration
- Principle:Triton inference server Server Component Model Preparation
- Principle:Mit han lab Llm awq Vision Transformer Encoding
- Principle:Onnx Onnx External Data Saving
- Principle:Kubeflow Kubeflow Multi User Configuration
- Principle:Recommenders team Recommenders News Recommendation Evaluation
Implementations
- Implementation:TobikoData Sqlmesh UseLocalStorage
- Implementation:Infiniflow Ragflow FileUploader Component
- Implementation:Online ml River Metrics RollingROCAUC
- Implementation:TobikoData Sqlmesh Model PlanAction
- Implementation:Run llama Llama index PairwiseComparisonEvaluator
- Implementation:SeleniumHQ Selenium Closure Testing MockClock
- Implementation:Arize ai Phoenix Evaluation DataFrame Schema
- Implementation:MaterializeInc Materialize ResolvedImage Fingerprint
- Implementation:NVIDIA NeMo Curator RayActorPoolExecutor
- Implementation:Apache Beam DirectRunner DefaultTransformOverrides
Heuristics
- Heuristic:Intel Ipex llm NF4 Quantization Best Practice
- Heuristic:Vespa engine Vespa Log Level Inheritance Polling
- Heuristic:CrewAIInc CrewAI RAG Search Defaults
- Heuristic:OWASP Www project top 10 for large language model applications SHA Pinning For GitHub Actions
- Heuristic:NVIDIA DALI Distributed Sharding Strategy
- Heuristic:Fede1024 Rust rdkafka Transaction Error Recovery
- Heuristic:Apache Airflow DAG Top Level Code Avoidance
- Heuristic:ClickHouse ClickHouse Debug Build Tips
- Heuristic:Openai Openai python Warning Deprecated Legacy Response
- Heuristic:OpenHands OpenHands Fail Open Rate Limiting
Environments
- Environment:Google deepmind Dm control OSMesa Software Rendering
- Environment:Deepset ai Haystack Python Runtime Environment
- Environment:Kubeflow Pipelines KFP Backend Deployment
- Environment:BerriAI Litellm Redis Cache Backend
- Environment:Kserve Kserve Istio Service Mesh
- Environment:Microsoft DeepSpeedExamples CIFAR10 Training Environment
- Environment:OWASP Www project top 10 for large language model applications PR Description Generator Runtime
- Environment:ArroyoSystems Arroyo Python UDF Runtime
- Environment:Intel Ipex llm RAG LlamaIndex Environment
- Environment:NVIDIA NeMo Aligner NeMo Framework GPU Environment