Mellanox/k8s-rdma-shared-dev-plugin

Unable to find the Infiniband/RoCE device when using the Ubuntu 18.04 image

thincal opened this issue · 6 comments

Created a container with the ubuntu 18.04 image and using the rdma-shared device plugin, inside the container when running ib_write_bw it reports bellow error, but with ubuntu 20.04/22.04 it works well. so what's the reason behind causing this issue ? appreciated for any information.

Did not detect devices 
If device exists, check if driver is up
Unable to find the Infiniband/RoCE device

@adrianchiris Hi, do you have any info with this issue ? thanks.

i believe its related to the perftest version being used in the workload container vs the RDMA api exposed by the kernel running on the node.

what is the OS of the k8s worker node ?

i dont think its related to rdma shared device plugin

what is the OS of the k8s worker node ?

Ubuntu 22.04.2 LTS

i believe its related to the perftest version being used in the workload container vs the RDMA api exposed by the kernel running on the node.

so that is the reason.