NVIDIA/nvidia-container-toolkit

nvidia-container-cli: mount error: failed to add device rules: unable to find any existing device filters attached to the cgroup:

Opened this issue · 0 comments

24.09.06 11:52:32.282     nuctl.platform.docker (W) Failed to run container {"err": "stdout:\n6abac6357c78f5337d482bbec8d339f5fe47d6736ac84d502d7cd822a1a761c5\ndocker: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'\nnvidia-container-cli: mount error: failed to add device rules: unable to find any existing device filters attached to the cgroup: bpf_prog_query(BPF_CGROUP_DEVICE) failed: operation not permitted: unknown.\n\nstderr:\n", "errVerbose": "\nError - exit status 125\n    /nuclio/pkg/cmdrunner/shellrunner.go:114\n\nCall stack:\nstdout:\n6abac6357c78f5337d482bbec8d339f5fe47d6736ac84d502d7cd822a1a761c5\ndocker: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'\nnvidia-container-cli: mount error: failed to add device rules: unable to find any existing device filters attached to the cgroup: bpf_prog_query(BPF_CGROUP_DEVICE) failed: operation not permitted: unknown.\n\nstderr:\n\n    /nuclio/pkg/cmdrunner/shellrunner.go:114\nstdout:\n6abac6357c78f5337d482bbec8d339f5fe47d6736ac84d502d7cd822a1a761c5\ndocker: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'\nnvidia-container-cli: mount error: failed to add device rules: unable to find any existing device filters attached to the cgroup: bpf_prog_query(BPF_CGROUP_DEVICE) failed: operation not permitted: unknown.\n\nstderr:\n", "stdout": "6abac6357c78f5337d482bbec8d339f5fe47d6736ac84d502d7cd822a1a761c5\ndocker: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'\nnvidia-container-cli: mount error: failed to add device rules: unable to find any existing device filters attached to the cgroup: bpf_prog_query(BPF_CGROUP_DEVICE) failed: operation not permitted: unknown.\n", "stderr": ""}
24.09.06 11:52:32.282                     nuctl (W) Failed to create a function; setting the function status {"err": "Failed to run a Docker container", "errVerbose": "\nError - exit status 125\n    /nuclio/pkg/cmdrunner/shellrunner.go:114\n\nCall stack:\nstdout:\n6abac6357c78f5337d482bbec8d339f5fe47d6736ac84d502d7cd822a1a761c5\ndocker: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'\nnvidia-container-cli: mount error: failed to add device rules: unable to find any existing device filters attached to the cgroup: bpf_prog_query(BPF_CGROUP_DEVICE) failed: operation not permitted: unknown.\n\nstderr:\n\n    /nuclio/pkg/cmdrunner/shellrunner.go:114\nFailed to run a Docker container\n    /nuclio/pkg/platform/local/platform.go:905\nFailed to run a Docker container"}

Error - exit status 125
    /nuclio/pkg/cmdrunner/shellrunner.go:114

Call stack:
stdout:
6abac6357c78f5337d482bbec8d339f5fe47d6736ac84d502d7cd822a1a761c5
docker: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'
nvidia-container-cli: mount error: failed to add device rules: unable to find any existing device filters attached to the cgroup: bpf_prog_query(BPF_CGROUP_DEVICE) failed: operation not permitted: unknown.

stderr:

    /nuclio/pkg/cmdrunner/shellrunner.go:114
Failed to run a Docker container
    /nuclio/pkg/platform/local/platform.go:905
Failed to deploy function
    ...//nuclio/pkg/platform/abstract/platform.go:227
(base) ubuntu@cvat:~/cvat-han/serverless$ nvidia-docker run --rm hello-world
nvidia-docker: command not found
(base) ubuntu@cvat:~/cvat-han/serverless$ nvidia-docker -v
nvidia-docker: command not found
(base) ubuntu@cvat:~/cvat-han/serverless$ nvidia-smi
Fri Sep  6 12:09:40 2024  

请问出现以下问题该怎么解决
错误类型: docker: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed
具体错误: nvidia-container-cli: mount error: failed to add device rules: unable to find any existing device filters attached to the cgroup: bpf_prog_query(BPF_CGROUP_DEVICE) failed: operation not permitted: unknown.