utkuozdemir/nvidia_gpu_exporter

MIG instances support

Jeff-liaojinfu opened this issue · 1 comments

After the gpu mig function is enabled, Gpu-util cannot obtain the GPU usage. want to add metrics to obtain the usage of the entire gpu and MIG instances

+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.86.10 Driver Version: 535.86.10 CUDA Version: 12.2 |
|-----------------------------------------+----------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+======================+======================|
| 0 NVIDIA A800-SXM4-80GB Off | 00000000:18:00.0 Off | On |
| N/A 33C P0 87W / 400W | 36537MiB / 81920MiB | N/A Default |
| | | Enabled |

same issue, Can we add the MIG support