mattshma/bigdata

Failed to start gpuManager stat /dev/nvidia*: no such file or directory

mattshma opened this issue · 0 comments

启动 kubelet 时 hang 住了,查看其 log,有如下报错信息:

Failed to start gpuManager stat /dev/nvidiactl: no such file or directory
Failed to start gpuManager stat /dev/nvidia-uvm: no such file or directory

执行如下命令解决了问题:

$ sudo nvidia-modprobe -u -c=0
$ sudo systemctl restart kubelet.service