Issues
- 2
Shared GPU access of MPS type doesn't work.
#1105 opened by allyrr - 4
- 4
- 3
Unable To Use The GPU Node Pool On Azure AKS
#906 opened by sello4354 - 2
How to exclude some specific GPUs?
#935 opened by manhtukhang - 4
Enable hostNetwork = true
#962 opened by jihuiyang-x - 1
Is CVE-2024-0132 relevant to the Driver Plugin?
#967 opened by ryanpxx - 1
Does time-slicing or MPS GPU-sharing supports a mode for processe to exclusively use GPU DRAM?
#966 opened by so2bin - 1
MPS control daemon wrong pod selector
#982 opened by radepajic - 8
Resources are not split when using “time slicing” with the NVIDIA device plugin for Kubernetes
#990 opened by y-shida-tg - 1
MPS Functionality Not Working Correctly in k8s-device-plugin Versions v0.15 to v0.17
#1094 opened by haitwang-cloud - 1
Using GPU to train models in a K8s Pod with the K8s Device Plugin and PyTorch framework, the training time is 6% longer compared to running on bare metal.
#1101 opened by lyon-v - 3
- 6
- 4
gpu pod Pending
#852 opened by imenselmi - 4
- 3
Containers that use cuda images in k8s do not have gpu resources, but the process id can be seen using nvidia-smi
#954 opened by ZYWNB666 - 5
- 3
general protection fault, probably for non-canonical address 0x25b5f6bb1a24827e: 0000 [#1] SMP NOPTI
#941 opened by zsksy123 - 3
Inconsistent GPU Resource Allocation with MIG and Non-MIG Profiles in Kubernetes
#1078 opened by haitwang-cloud - 1
- 1
Wrong family type detected
#943 opened by Madfish5415 - 0
- 2
Security Context Misconfiguration with vGPU Nodes in NVIDIA Device Plugin Helm Chart
#854 opened by sbathgate - 3
- 1
Jetson Devices
#984 opened by sam-cts - 4
Docker image tag v0.9.0-ubuntu20.04
#833 opened by yuliyan-valchev-ft - 3
Add gpu uuids to node lables
#1015 opened by xiongzubiao - 2
- 2
NVIDIA Device Plugin Only Exposes One GPU Out of Two GPUs Installed on Single Node
#1020 opened by amir-bialek - 4
Is there any way in the meantime to request more than 1 replica from each GPU in my node?
#929 opened by wei1793786487 - 1
Question about the DeviceSpec/Mount in the AllocateResponse
#1041 opened by gaure - 0
- 3
- 2
- 2
Support for Registering GPU Resources by Model Name (e.g., nvidia.com/A100)
#1024 opened by antonaleks - 1
- 0
- 0
Enabling MPS fails on K3S
#983 opened by santurini - 1
k8s pod ,After running for a while, the GPU cannot be found in the pod. Failed to initialize NVML: Unknown Error
#981 opened by bilbilmyc - 0
- 1
Support automatic discovery of MIG devices
#992 opened by DrAuYueng - 1
- 8
`nvml init failed: ERROR_LIBRARY_NOT_FOUND` error after upgrading from `0.15.1` to `0.16.x`
#856 opened by andy108369 - 1
Security Vulnerability: Red Hat Enterprise Linux 8.10 - openldap Remote Denial of Service Vulnerability - RHSA-2024:4264
#845 opened by anjaniprayaga - 1
Helm Chart v0.16.1 not available
#848 opened by uvic-rcs - 1
Documentation for GFD
#844 opened by chipzoller - 5
README section for MPS should state `spec.hostIPC: true` is required in a Pod
#843 opened by chipzoller - 2
[k0s] `libnvidia-ml.so.1` missing in the pod
#826 opened by EKami - 0