question about the needed iteration number
Opened this issue · 4 comments
Dear author,
Your work is really interesting and insightful.
When reproduce your experiment results, I found that iteration number of each experiment is set to 2k.
However, I guess the needed iteration number for each dataset (mnist, cifar...) varies and 2k iteration is a little bit time consuming.
Could you please share your opinion on it.
Also it would be better if you could share the GPU hours of each experiment.
Thanks a lot!
Thank you for your interest in our work! You are right, on many datasets, we do not need 2k iterations to achieve the desired performance. Generally speaking, the best results we claim in the paper are usually produced within 1.2k iterations. The main consideration of the default setting of 2k iterations is to ensure that a better extreme point is found. If you are more concerned about distillation efficiency than performance, you can choose to stop early : )
As for GPU hours, it may be different on different devices. And welcome to try our enhanced version DREAM+!
Thanks for your prompt reply, it helps me a lot!
You can add CUDA_VISIBLE_DEVICES=N to the running command to make it.