AWS::SageMaker::EndpointConfig AsyncInferenceClientConfig - AWS CloudFormation (original) (raw)

Configures the behavior of the client used by SageMaker to interact with the model container during asynchronous inference.

Syntax

To declare this entity in your AWS CloudFormation template, use the following syntax:

JSON

{
  "MaxConcurrentInvocationsPerInstance" : Integer
}

Properties

MaxConcurrentInvocationsPerInstance

The maximum number of concurrent requests sent by the SageMaker client to the model container. If no value is provided, SageMaker will choose an optimal value for you.

Required: No

Type: Integer

Update requires: Replacement

AWS::SageMaker::EndpointConfig

AsyncInferenceConfig

Did this page help you? - Yes

Thanks for letting us know we're doing a good job!

If you've got a moment, please tell us what we did right so we can do more of it.

Did this page help you? - No

Thanks for letting us know this page needs work. We're sorry we let you down.

If you've got a moment, please tell us how we can make the documentation better.