Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Vespa engine Vespa Linguistics text processing pipeline
- Workflow:Vibrantlabsai Ragas Custom Metric Creation
- Workflow:Langfuse Langfuse Prompt management lifecycle
- Workflow:Microsoft BIPIA Attack Success Rate Evaluation
- Workflow:ContextualAI HALOs Online Iterative Alignment
- Workflow:Evidentlyai Evidently Text Data Quality Evaluation
- Workflow:Intel Ipex llm RAG With LangChain
- Workflow:Apache Flink File Sink Pipeline
- Workflow:Dotnet Machinelearning ONNX Model Scoring
- Workflow:Apache Shardingsphere Shadow Database Routing
Principles
- Principle:ArroyoSystems Arroyo UDF Authoring
- Principle:Huggingface Datasets WebDataset Building
- Principle:Vllm project Vllm Draft Model Acquisition
- Principle:Togethercomputer Together python Batch Result Retrieval
- Principle:Microsoft Agent framework Human in the Loop Request
- Principle:Apache Kafka Release Branch Preparation
- Principle:Spcl Graph of thoughts Thought Scoring
- Principle:Triton inference server Server Jetson Edge Deployment
- Principle:Google deepmind Mujoco Engine Utilities
- Principle:Allenai Open instruct Actor Coordination
Implementations
- Implementation:Risingwavelabs Risingwave MySqlOffsetContext
- Implementation:Online ml River Tree HoeffdingTree
- Implementation:NVIDIA DALI C API V2 Pipeline Tests
- Implementation:Tencent Ncnn C API
- Implementation:TobikoData Sqlmesh Models Page
- Implementation:Kornia Kornia Lovasz Hinge Loss
- Implementation:Openai Openai node Pagination
- Implementation:VainF Torch Pruning GroupTaylorImportance
- Implementation:Alibaba ROLL Comparison GPT4 Data Zh
- Implementation:Protectai Llm guard API Endpoints
Heuristics
- Heuristic:Duckdb Duckdb PR Submission Strategy
- Heuristic:AUTOMATIC1111 Stable diffusion webui GTX 16 Series FP16 Workaround
- Heuristic:Eric mitchell Direct preference optimization FSDP Batch Size Per GPU
- Heuristic:Ggml org Ggml Sampling Parameter Defaults
- Heuristic:Datahub project Datahub Warning Deprecated Spark Lineage Legacy
- Heuristic:Interpretml Interpret EBM Hyperparameter Tuning Guide
- Heuristic:Openai CLIP Linear Probe Regularization C
- Heuristic:Deepset ai Haystack Logging Auto Detection
- Heuristic:Pola rs Polars GPU Aggregation Join Speedup
- Heuristic:Huggingface Open r1 Code Execution Timeout Strategy
Environments
- Environment:DataExpert io Data engineer handbook PostgreSQL Docker Environment
- Environment:Spotify Luigi SQLAlchemy Database
- Environment:Vllm project Vllm CUDA Hopper
- Environment:Protectai Llm guard ONNX Runtime Acceleration
- Environment:CrewAIInc CrewAI LLM Provider Credentials
- Environment:Infiniflow Ragflow Docker Infrastructure
- Environment:Spotify Luigi Tornado Web Server
- Environment:Lucidrains X transformers Python Environment
- Environment:Gretelai Gretel synthetics Python Base Environment
- Environment:Online ml River Build Toolchain