s12 model Reproduction experiment

Question

s12 model Reproduction experiment

Closed this issue 2 years ago · 1 comments

Using the s12 model, only the four card batch size is 240 for a single card, and the acc top1 is 76 in the end. If there are no eight cards, how can the acc reach 80，Other parameter defaults. --Apex amp can greatly affect the accuracy in addition to fast training.

Answer 1 · 2022-12-14T06:14:03.000Z

Hi @starsky68 ,

Thanks for your attention. Please refer to the train.py file in metaformer repo where I add --grad-accum-steps so that you can set larger batch size.