aws-samples/aws-do-eks
Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference examples.
ShellMIT-0
Issues
- 0
Configuration creates two security groups with `kubernetes.io/cluster/k8s-gpu-cluster` tag
#24 opened by fozziethebeat - 0
Update Distributed training with Amazon EKS and Torch Distributed Elastic blog
#22 opened by HUNG-rushb - 0
MPIJob EFA example doesn't apply
#19 opened by kwohlfahrt - 0
Cluster Auto-scaler forbidden error
#17 opened by chailatent - 2
Use Finch instead of docker on OSX
#13 opened by perifaws