Change the code to use multi-gpu, but can not speed up the training.

Question

Change the code to use multi-gpu, but can not speed up the training.

Opened this issue 5 years ago · 2 comments

Use DataParallel model to start a multi-gpu training, change the config.yaml batch size, can not speed up the training.

Answer 1 · 2019-12-19T02:01:47.000Z

I've also tried multi-gpus and had same issue with you.
Then I found the biggest overhead is in GE2ELoss part. Especially get cosine similarity matrix and calculate loss part.

https://github.com/HarryVolek/PyTorch_Speaker_Verification/blob/11b1d1932b0a226de9cabd8652c0c2ea1446611f/utils.py

Just copy and paste this codes to your utils.py.
I don't know why author didn't have merged this codes yet, but it is much faster than original codes.

Answer 2 · 2019-12-19T02:05:48.000Z

I've also tried multi-gpus and had same issue with you.
Then I found the biggest overhead is in GE2ELoss part. Especially get cosine similarity matrix and calculate loss part.

https://github.com/HarryVolek/PyTorch_Speaker_Verification/blob/11b1d1932b0a226de9cabd8652c0c2ea1446611f/utils.py

Just copy and paste this codes to your utils.py.
I don't know why author didn't have merged this codes yet, but it is much faster than original codes.

Thank you very much, I will try to use the code in the link.