NVIDIA/gds-nvidia-fs

NVMe Driver not registered with nvidia-fs

KANIOYH opened this issue · 2 comments

Hello,I am trying to use gds-nvidia-fs.
Running the sample code under cuda/gds/sample. and I found some errors in the cufile.log
this is log

#cufile_sample_001
 27-06-2023 16:20:37:235 [pid=41294 tid=41294] ERROR  cufio-obj:101 error allocating nvfs handle, size: 131072
 27-06-2023 16:20:37:235 [pid=41294 tid=41294] ERROR  cufio:1185 cuFileBufRegister error, object allocation failed
 27-06-2023 16:20:37:235 [pid=41294 tid=41294] ERROR  cufio:1236 cuFileBufRegister error cufile success
 27-06-2023 16:20:37:236 [pid=41294 tid=41294] ERROR  cufio:1275 cuFileBufDeregister error, object for device pointer is not registered
 27-06-2023 16:20:37:236 [pid=41294 tid=41294] ERROR  cufio:1323 cuFileBufDeregister error: device pointer lookup failure
#cufile_sample_002
 27-06-2023 16:20:53:48 [pid=41337 tid=41337] ERROR  cufio-fs:204 NVMe Driver not registered with nvidia-fs!!!
 27-06-2023 16:20:53:48 [pid=41337 tid=41337] ERROR  cufio-fs:204 NVMe Driver not registered with nvidia-fs!!!
 27-06-2023 16:20:53:48 [pid=41337 tid=41337] NOTICE  cufio-fs:408 dumping volume attributes: DEVNAME:/dev/nvme1n1,ID_FS_TYPE:ext4,ID_FS_USAGE:filesystem,UDEV_PCI_BRIDGE:0000:80:00.0,device/transport:pcie,fsid:e23f580faebd273f0x,numa_node:1,queue/logical_block_size:4096,wwid:nvme.8086-50484d42373432333030425232383043474e-494e54454c2053534450454431443238304741-00000001,
 27-06-2023 16:20:53:48 [pid=41337 tid=41337] NOTICE  cufio:1036 cuFileHandleRegister GDS not supported or disabled by config, using cuFile posix read/write with compat mode enabled

run output

xxx@xxx:/usr/local/cuda-11.7/gds/samples$ sudo ./cufile_sample_001 /OPTANE/test_file 0
opening file /OPTANE/test_file
registering device memory of size :131072
writing from device memory
written bytes :131072
deregistering device memory
buffer deregister failed:device pointer lookup failure

xxx@xxx:/usr/local/cuda-11.7/gds/samples$ sudo ./cufile_sample_002 /OPTANE/test_file 0
opening file /OPTANE/test_file
allocating device memory of size :1048576 gpu id: 0
writing from gpuid: 0
written bytes :1048576

How should I register nvidia fs on NVMe Driver?
Thanks

@KANIOYH Would this be helpful to you?
#4 (comment)

thanks!