About AMP and batch size

Question

Closed this issue 3 years ago · 2 comments

Hi,
I'm very impressed by your excellent work! Thanks for sharing your code.

I have questions about the training protocol.

In your paper,

"We train all models with a global batch size of 2048 with the NVIDIA Automatic Mixed Precision(AMP) enabled."

but the training script denotes the batch size of 256, instead of 2048.

I wonder two points from here.

Can I re-produce the result accuracy in this repo by using this command (batch size=256, instead of 2048)?
Does this repo contains AMP?

Thanks in advance :)

Answer 1 · 2021-05-14T21:16:28.000Z

Hi @youngwanLEE, thank you for your interest in our work!

For your batch size concern, actually 256 means the batch size per GPU. The 8-GPU setting is used in the default training command and it has a total batch size 2048. You should be able to re-produce the result accuracy (similar to our reported ones) using the command provided in this repo.
Yes. The AMP is enabled at default.

Answer 2 · 2021-05-15T01:58:50.000Z

@xwjabc Thanks for your quick reply :)