does DRA support multi GPUs across worker nodes?
thj08 opened this issue · 4 comments
I want to let master node have ability to allocate avaliable GPUs across different worker nodes , does DRA support multi GPUs across worker nodes?
So far DRA enables resources in a per-Node scenario given the interaction with the Kubelet. What you are asking is a MultiNode DRA if I understand correctly
Thanks for the reply.
As my real case, I setup my cluster with 1 master node and 3 worker nodes, and every worker node has one GPU resource. Is it possible to apply a container on one worker node with 3 GPUs?
By the way, I also try the DRA demo project, and it use kind for local cluster. Does DRA support remote cluster?
I think you are talking about DRA and CXL integration. This is discussed as one of the use case but currently not implemented as I understand.