AWS::SageMaker::EndpointConfig AsyncInferenceClientConfig - AWS CloudFormation (original) (raw)
Configures the behavior of the client used by SageMaker to interact with the model container during asynchronous inference.
Syntax
To declare this entity in your AWS CloudFormation template, use the following syntax:
JSON
{
"MaxConcurrentInvocationsPerInstance" : Integer
}
Properties
MaxConcurrentInvocationsPerInstance
The maximum number of concurrent requests sent by the SageMaker client to the model container. If no value is provided, SageMaker will choose an optimal value for you.
Required: No
Type: Integer
Update requires: Replacement
AWS::SageMaker::EndpointConfig
AsyncInferenceConfig
Did this page help you? - Yes
Thanks for letting us know we're doing a good job!
If you've got a moment, please tell us what we did right so we can do more of it.
Did this page help you? - No
Thanks for letting us know this page needs work. We're sorry we let you down.
If you've got a moment, please tell us how we can make the documentation better.