Serverless Inference — sagemaker 2.247.0 documentation (original) (raw)

This module contains classes related to Amazon Sagemaker Serverless Inference

This module contains code related to the ServerlessInferenceConfig class.

Codes are used for configuring serverless inference endpoint. Use it when deploying the model to the endpoints.

class sagemaker.serverless.serverless_inference_config.ServerlessInferenceConfig(memory_size_in_mb=2048, max_concurrency=5, provisioned_concurrency=None)

Bases: object

Configuration object passed in when deploying models to Amazon SageMaker Endpoints.

This object specifies configuration related to serverless endpoint. Use this configuration when trying to create serverless endpoint and make serverless inference

Initialize a ServerlessInferenceConfig object for serverless inference configuration.

Parameters: