how to enable multi-gpu training in each node
Dav-Jay opened this issue · 2 comments
Dav-Jay commented
Thanks for the excellent work!
When I tried the provided train_inception.sh
for CIFAR10 on two nodes, I found only one GPU is invoked in each node (GPU 0), is there a way to use multiple gpus in each node? In original Caffe, I can use --gpu=0,1,2,3 which seems not work in geeps.
AIROBOTAI commented
Seems current geeps does not support multi-gpu in each node?
cuihenggang commented
Unfortunately, the current GeePS implementation does not support multi-gpu in each node. Feel free to implement that and create a pull request :)