GitHub - aws-neuron/deep-learning-containers: AWS Neuron Deep Learning Containers (DLCs) are a set of Docker images for training and serving models on AWS Trainium and Inferentia instances using AWS Neuron SDK. (original) (raw)

AWS Neuron Deep Learning Containers

AWS Neuron Deep Learning Containers (DLCs) are a set of Docker images for training and serving models on AWS Trainium and Inferentia instances using AWS Neuron SDK. For more documentation, please refer to Neuron Containers Overview.

Containers

pytorch-inference-neuron

Framework Neuron Packages Neuron SDK Version Supported EC2 Instance Types Python Version Options ECR Public URL Other Packages
PyTorch 1.13.1 aws-neuronx-tools, torch-neuron Neuron 2.20.2 inf1 3.10 (py310) public.ecr.aws/neuron/pytorch-inference-neuron:1.13.1-neuron-py310-sdk2.20.2-ubuntu20.04 torchserve 0.11.0

pytorch-inference-neuronx

Framework Neuron Packages Neuron SDK Version Supported EC2 Instance Types Python Version Options ECR Public URL Other Packages
PyTorch 2.6.0 aws-neuronx-tools, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, transformers-neuronx Neuron 2.23.0 trn1,trn2,inf2 3.10 (py310) public.ecr.aws/neuron/pytorch-inference-neuronx:2.6.0-neuronx-py310-sdk2.23.0-ubuntu22.04 torchserve 0.11.0
PyTorch 2.5.1 aws-neuronx-tools, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, transformers-neuronx Neuron 2.22.0 trn1,trn2,inf2 3.10 (py310) public.ecr.aws/neuron/pytorch-inference-neuronx:2.5.1-neuronx-py310-sdk2.22.0-ubuntu22.04 torchserve 0.11.0
PyTorch 2.1.2 aws-neuronx-tools, neuronx_distributed, torch-neuronx, transformers-neuronx Neuron 2.20.2 trn1,inf2 3.10 (py310) public.ecr.aws/neuron/pytorch-inference-neuronx:2.1.2-neuronx-py310-sdk2.20.2-ubuntu20.04 torchserve 0.11.0
PyTorch 1.13.1 aws-neuronx-tools, neuronx_distributed, torch-neuronx, transformers-neuronx Neuron 2.20.2 trn1,inf2 3.10 (py310) public.ecr.aws/neuron/pytorch-inference-neuronx:1.13.1-neuronx-py310-sdk2.20.2-ubuntu20.04 torchserve 0.11.0

pytorch-training-neuronx

Framework Neuron Packages Neuron SDK Version Supported EC2 Instance Types Python Version Options ECR Public URL
PyTorch 2.6.0 aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx Neuron 2.23.0 trn1,trn2,inf2 3.10 (py310) public.ecr.aws/neuron/pytorch-training-neuronx:2.6.0-neuronx-py310-sdk2.23.0-ubuntu22.04
PyTorch 2.5.1 aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx Neuron 2.22.0 trn1,trn2,inf2 3.10 (py310) public.ecr.aws/neuron/pytorch-training-neuronx:2.5.1-neuronx-py310-sdk2.22.0-ubuntu22.04
PyTorch 2.1.2 aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx Neuron 2.20.2 trn1,inf2 3.10 (py310) public.ecr.aws/neuron/pytorch-training-neuronx:2.1.2-neuronx-py310-sdk2.20.2-ubuntu20.04
PyTorch 1.13.1 aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx Neuron 2.20.2 trn1,inf2 3.10 (py310) public.ecr.aws/neuron/pytorch-training-neuronx:1.13.1-neuronx-py310-sdk2.20.2-ubuntu20.04

jax-training-neuronx

Framework Neuron Packages Neuron SDK Version Supported EC2 Instance Types Python Version Options ECR Public URL Other Packages
JAX 0.5 jax-neuronx, libneuronxla Neuron 2.23.0 trn1,trn2,inf2 3.10 (py310) public.ecr.aws/neuron/jax-training-neuronx:0.5-neuronx-py310-sdk2.23.0-ubuntu22.04 jaxlib 0.5

Security

See SECURITY for more information.

License

This project is licensed under the Apache-2.0 License.