Failed to start gpuManager stat /dev/nvidia*: no such file or directory
mattshma opened this issue · 0 comments
mattshma commented
启动 kubelet 时 hang 住了,查看其 log,有如下报错信息:
Failed to start gpuManager stat /dev/nvidiactl: no such file or directory
Failed to start gpuManager stat /dev/nvidia-uvm: no such file or directory
执行如下命令解决了问题:
$ sudo nvidia-modprobe -u -c=0
$ sudo systemctl restart kubelet.service