Jump to content

Connect Leeroopedia MCP: Equip your AI agents to search best practices, build plans, verify code, diagnose failures, and look up hyperparameter defaults.

Principle:LMCache LMCache Cluster Management

From Leeroopedia


Knowledge Sources
Domains Cluster_Management, Monitoring
Last Updated 2026-02-09 00:00 GMT

Overview

A REST API-based cluster management pattern that provides operations for looking up, moving, pinning, clearing, and compressing KV cache entries across instances.

Description

Cluster Management exposes the controller's KV operations as REST API endpoints. External tools or automated scripts can query cache state (lookup), migrate cache between instances (move), prevent eviction (pin), free cache (clear), or compress cache (compress). Operations are dispatched to workers via the LMCacheClusterExecutor.

Usage

Use the controller's REST API endpoints for cluster-level cache management. The API is available at the controller's --port.

Theoretical Basis

Operations follow a request-dispatch-aggregate pattern:

  1. Client sends REST request to controller
  2. KVController validates and routes to LMCacheClusterExecutor
  3. Executor dispatches ZMQ messages to target workers
  4. Workers execute the operation and return results
  5. Executor aggregates results and returns to client

Related Pages

Implemented By

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment