Principle:Ray project Ray Application Deployment
| Knowledge Sources | |
|---|---|
| Domains | Model_Serving, Deployment_Orchestration |
| Last Updated | 2026-02-13 17:00 GMT |
Overview
A mechanism for deploying a configured application DAG to the serving control plane, which materializes replica actors and routes traffic.
Description
Application Deployment takes a configured Application DAG and submits it to the Serve controller. The controller extracts deployment specifications from the DAG, creates replica actors, configures routing, and waits for all replicas to reach RUNNING state. The result is a DeploymentHandle that can be used to send requests.
Usage
Use this after configuring deployments with .bind(). Deploy once and then send requests via the returned handle.
Theoretical Basis
Application deployment implements a desired state reconciliation pattern. The client declares the desired state (deployment configuration), and the controller reconciles actual state to match by creating/updating/removing replica actors.