GoogleCloudPlatform/container-engine-accelerators
Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine
GoApache-2.0
Issues
- 3
- 0
`nvidia-device-plugin` failed to run on GPU nodes created by Node Auto-Provisioning
#407 opened by hongchaodeng - 28
Nvidia Driver Public Bucket returning 403 - breaking ALL driver installation
#356 opened by bigbitbus - 0
- 3
nvidia-driver-installer fails to install drivers for G2 instance type with L4
#302 opened by christidis - 1
- 2
is there a solution to make all gpu deveices visible for a pod which not requests `nvidia.com/gpu`
#239 opened by tingweiwu - 4
OutOfnvidia.com/gpu when node is restarted
#100 opened by driosalido - 0
- 2
Request to provide Dockerfile source code for Nvidia driver installation on COS
#204 opened by Loquats - 3
Driver upgrade is not possible
#200 opened by adityapatadia - 0
- 1
- 0
- 0
- 21
- 8
- 12
nvidia-gpu-device-plugin gets OOM killed
#202 opened by omesser - 1
when restart kubelet, the gpu-device-plugin will Restart and Re-register to the new kubelet, leads to pods that depend on the gpu-device-plugin restarting as well.
#199 opened by Soledao - 3
- 1
- 23
Downloading driver fails on a K8S 1.18 GKE Cluster
#177 opened by sbrunk - 2
- 1
- 0
Installation on ubuntu 18.04 LTS VM on google cloud fails on "Unable to locate package linux-headers-5.3.0-1029-gcp"
#140 opened by Svendegroote91 - 1
Is it okay to change Nvidia Driver version other than that given by Daemonset mentioned in GPU for Kubernetes Cluster Docs
#134 opened by limbuu - 0
- 1
Can't install NVIDIA Drivers using DaemonSet
#137 opened by MartinaRuocco - 4
- 0
nvidia-driver-installer failed to build the driver
#130 opened by ensonic - 2
init container error on v1.15.4-gke.15
#127 opened by itayvallach - 1
Init container erroring on 1.13.10-gke.0 Clusters
#125 opened by chainlink - 1
Kernel Download Fails for Pod Running on Ubuntu 18.04
#101 opened by KingJ - 11
Using single GPU with multiple containers
#123 opened by ndesh26 - 5
- 0
- 0
Rename this repo to kubernetes-engine-*
#108 opened by ahmetb - 0
Update driver version: how?
#107 opened by thomas-riccardi - 2
Support for CUDA 10.0
#106 opened by cwbeitel - 6
- 1
Pod Unschedulable
#104 opened by AjayZinngg - 1
- 7
- 4
device plugin to emit metrics?
#78 opened by lsjostro - 3
when asked for 1 gpu device, gpu is available as /dev/nvidia1 not /dev/nvidia0
#72 opened by gurvindersingh - 1
- 16
nvidia-gpu daemonset using hostNetworking
#64 opened by kodieGlosser - 1
DevicePlugin Pod keeps terminating
#65 opened by afritzler - 2
- 7