I used a single server with 8 RTX 3090 GPUs to train GLIP, which took 4 days. However, using two servers with 16 GPUs took 9 days. Is this normal?

Question

CLL112 opened this issue a month ago · 1 comments

Answer 1 · 2024-11-22T16:45:53.000Z

Is it possible that the network connection between my servers is causing the slow performance, or is this speed normal?