ray.serve.schema.DeploymentSchema#

class ray.serve.schema.DeploymentSchema(*, name: str, num_replicas: PositiveInt | str | None = DEFAULT.VALUE, route_prefix: str | None = DEFAULT.VALUE, max_concurrent_queries: ConstrainedIntValue = DEFAULT.VALUE, max_ongoing_requests: ConstrainedIntValue = DEFAULT.VALUE, max_queued_requests: StrictInt = DEFAULT.VALUE, user_config: Dict | None = DEFAULT.VALUE, autoscaling_config: Dict | None = DEFAULT.VALUE, graceful_shutdown_wait_loop_s: ConstrainedFloatValue = DEFAULT.VALUE, graceful_shutdown_timeout_s: ConstrainedFloatValue = DEFAULT.VALUE, health_check_period_s: ConstrainedFloatValue = DEFAULT.VALUE, health_check_timeout_s: ConstrainedFloatValue = DEFAULT.VALUE, ray_actor_options: RayActorOptionsSchema = DEFAULT.VALUE, placement_group_bundles: List[Dict[str, float]] = DEFAULT.VALUE, placement_group_strategy: str = DEFAULT.VALUE, max_replicas_per_node: int = DEFAULT.VALUE, logging_config: LoggingConfig = DEFAULT.VALUE)[source]#

Bases: BaseModel

Specifies options for one deployment within a Serve application. For each deployment this can optionally be included in ServeApplicationSchema to override deployment options specified in code.

Methods

__init__

Create a new model by parsing and validating input data from keyword arguments.

construct

Creates a new model setting __dict__ and __fields_set__ from trusted or pre-validated data.

copy

Duplicate a model, optionally choose which fields to include, exclude and change.

deployment_schema_route_prefix_format

The route_prefix 1.

dict

Generate a dictionary representation of the model, optionally specifying which fields to include or exclude.

json

Generate a JSON representation of the model, include and exclude arguments as per dict().

update_forward_refs

Try to update ForwardRefs on fields based on this Model, globalns and localns.

Attributes

name

num_replicas

route_prefix

max_concurrent_queries

max_ongoing_requests

max_queued_requests

user_config

autoscaling_config

graceful_shutdown_wait_loop_s

graceful_shutdown_timeout_s

health_check_period_s

health_check_timeout_s

ray_actor_options

placement_group_bundles

placement_group_strategy

max_replicas_per_node

logging_config