hyperpod
There are 3 repositories under hyperpod topic.
aws-samples/awsome-distributed-training
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
aws-samples/aws-do-hyperpod
Create and manage Amazon SageMaker HyperPod clusters, run distributed model training
aws-samples/playground-persistent-cluster
Experimental scripts for Amazon SageMaker HyperPod