Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Helicone Helicone Integrate Provider To Gateway
- Workflow:Sdv dev SDV Single table synthesis
- Workflow:Ggml org Llama cpp Text Generation
- Workflow:Google deepmind Dm control Composer Environment Building
- Workflow:Apache Hudi Flink Table Clustering
- Workflow:Mlflow Mlflow Model Logging and Registry
- Workflow:Deepset ai Haystack Document Preprocessing Pipeline
- Workflow:Nautechsystems Nautilus trader Data loading and cataloging
- Workflow:Dotnet Machinelearning Time Series Forecasting
- Workflow:Pola rs Polars DataFrame Aggregation and Grouping
Principles
- Principle:Triton inference server Server Model Lifecycle Testing
- Principle:Online ml River ADWIN Drift Detection
- Principle:Webdriverio Webdriverio CapabilityNormalization
- Principle:Tensorflow Serving Type Safe Erasure
- Principle:AUTOMATIC1111 Stable diffusion webui Fair Task Scheduling
- Principle:Roboflow Rf detr Model Initialization
- Principle:Openclaw Openclaw Daemon Installation
- Principle:OpenGVLab InternVL Model Inference Loading
- Principle:DistrictDataLabs Yellowbrick Residual Analysis
- Principle:Datahub project Datahub Stack Lifecycle Management
Implementations
- Implementation:Risingwavelabs Risingwave JDBCSinkFactory
- Implementation:Infiniflow Ragflow DatasetOverviewHooks
- Implementation:Apache Hudi Azure Pipelines CI Configuration
- Implementation:Datahub project Datahub Actions CLI Run
- Implementation:Apache Druid Sampler Streaming Schema
- Implementation:Speechbrain Speechbrain Train SLURP Direct Wav2Vec
- Implementation:Langchain ai Langchain BaseChatModel Subclass
- Implementation:NVIDIA TransformerEngine InferenceParams
- Implementation:Ggml org Ggml Cpu impl
- Implementation:Online ml River Datasets Index
Heuristics
- Heuristic:SeldonIO Seldon core Over Commit Memory Tip
- Heuristic:CARLA simulator Carla PID Controller Tuning
- Heuristic:InternLM Lmdeploy KV Cache Memory Tuning
- Heuristic:OWASP Www project top 10 for large language model applications Deliberately Insecure Code Isolation
- Heuristic:Tensorflow Serving Warning Deprecated CreateTfrtSavedModel Raw
- Heuristic:Huggingface Alignment handbook Sequence Packing Strategy
- Heuristic:Alibaba ROLL GPU Memory Offload Strategy
- Heuristic:Guardrails ai Guardrails Async Vs Sync Validation Mode
- Heuristic:Facebookresearch Habitat lab Force Single Threaded PyTorch
- Heuristic:Intel Ipex llm Llama Padding Token Workaround
Environments
- Environment:Eventual Inc Daft Ray Distributed Runner
- Environment:Sgl project Sglang Prometheus
- Environment:Lance format Lance SIMD And Platform Requirements
- Environment:Kserve Kserve VLLM Runtime
- Environment:Intel Ipex llm XPU Inference Environment
- Environment:OWASP Www project top 10 for large language model applications PR Description Generator Runtime
- Environment:ARISE Initiative Robomimic HuggingFace Hub Dependencies
- Environment:Testtimescaling Testtimescaling github io Semantic Scholar API
- Environment:Puppeteer Puppeteer Node 18 Runtime
- Environment:Nautechsystems Nautilus trader Databento API Credentials