/deep-learning-containers

AWS Neuron Deep Learning Containers (DLCs) are a set of Docker images for training and serving models on AWS Trainium and Inferentia instances using AWS Neuron SDK.

Primary LanguagePythonApache License 2.0Apache-2.0

AWS Neuron Deep Learning Containers

AWS Neuron Deep Learning Containers (DLCs) are a set of Docker images for training and serving models on AWS Trainium and Inferentia instances using AWS Neuron SDK. For more documentation, please refer to Neuron Containers Overview.

Containers

pytorch-inference-neuron

Framework Neuron Packages Neuron SDK Version Supported EC2 Instance Types Python Version Options ECR Public URL Other Packages
PyTorch 1.13.1 aws-neuronx-tools, torch-neuron Neuron 2.19.1 inf1 3.10 (py310) public.ecr.aws/neuron/pytorch-inference-neuron:1.13.1-neuron-py310-sdk2.19.1-ubuntu20.04 torchserve 0.11.0

pytorch-inference-neuronx

Framework Neuron Packages Neuron SDK Version Supported EC2 Instance Types Python Version Options ECR Public URL Other Packages
PyTorch 2.1.2 aws-neuronx-tools, neuronx_distributed, torch-neuronx, transformers-neuronx Neuron 2.19.1 trn1,inf2 3.10 (py310) public.ecr.aws/neuron/pytorch-inference-neuronx:2.1.2-neuronx-py310-sdk2.19.1-ubuntu20.04 torchserve 0.11.0
PyTorch 1.13.1 aws-neuronx-tools, neuronx_distributed, torch-neuronx, transformers-neuronx Neuron 2.19.1 trn1,inf2 3.10 (py310) public.ecr.aws/neuron/pytorch-inference-neuronx:1.13.1-neuronx-py310-sdk2.19.1-ubuntu20.04 torchserve 0.11.0

pytorch-training-neuronx

Framework Neuron Packages Neuron SDK Version Supported EC2 Instance Types Python Version Options ECR Public URL
PyTorch 2.1.2 aws-neuronx-tools, neuronx_distributed, torch-neuronx Neuron 2.19.1 trn1,inf2 3.10 (py310) public.ecr.aws/neuron/pytorch-training-neuronx:2.1.2-neuronx-py310-sdk2.19.1-ubuntu20.04
PyTorch 1.13.1 aws-neuronx-tools, neuronx_distributed, torch-neuronx Neuron 2.19.1 trn1,inf2 3.10 (py310) public.ecr.aws/neuron/pytorch-training-neuronx:1.13.1-neuronx-py310-sdk2.19.1-ubuntu20.04

Security

See SECURITY for more information.

License

This project is licensed under the Apache-2.0 License.