training hyperparameter

Question

training hyperparameter

wangbin2018 opened this issue 4 years ago · 6 comments

wangbin2018 commented 4 years ago

Hello，where can I see the training Hyperparameter？

Answer 1 · 2020-07-03T01:33:22.000Z

You can refer to this repo to reproduce our results. https://github.com/pytorch/examples/blob/master/imagenet/main.py

Answer 2 · 2020-07-03T14:31:16.000Z

I train the res2net-v1b about 200 epochs,but only get top1-78.37%,(lr = 0.2 ,per_batch = 128 on 8 GPU), other hyperparameter is same，about 2% less，how can I adjustment?

Answer 3 · 2020-07-07T04:08:21.000Z

Excuse me，can I ask one more question？You paper say use a mini-batch of 256 on 4 Titan Xp GPUs，256 is total batch (means 64 per GPU)or 256 per GPU, and the Hyperparameter of res2net_v1b is the same as res2net?

Answer 4 · 2020-07-07T06:56:37.000Z

The v1b uses our designed trick that is similar to mixup. Sorry I cannot give it to you since we are still working on it. Using mixup and coslr with 300 epoch can get the similar result. If you are going to compare your method with res2net, you can just refer to the results in the paper.

Answer 5 · 2020-07-07T13:36:12.000Z

Thank you very much!So common version of res2net using trick? And do you train net using f16, because 256 per_batch can lead to OOM. I'm very interest in your work~

Answer 6 · 2020-07-08T01:35:09.000Z

The res2net_v1b is designed for better downstream task, not for comparison. I didn't use f16, and no OOM happens when using GPU with 11G memory.