Some question about the result
Zora137 opened this issue · 4 comments
hello,sorry to bother you many times
this is the result I run your code by this command without pretain
python train.py --data_path path/to/your/data --model_name mytrain --num_epochs 30 --batch_size 12 --lr 0.0001 5e-6 31 0.0001 1e-5 31
It cannot reach the resulin your papert without pretain:
is this normal? or did dosomething wrong?
expect your answer,thank you so much
Hi, the results are too bad. As stated in our paper:
For models trained from scratch an initial learning rate of 5e−4 with a cosine learning rate schedule [26] is adopted, and the training epoch is set to 35.
Could you please try using a larger initial learning rate?
Hi, the results are too bad. As stated in our paper:
For models trained from scratch an initial learning rate of 5e−4 with a cosine learning rate schedule [26] is adopted, and the training epoch is set to 35.
Could you please try using a larger initial learning rate?
hi nice work !!!
can you show me your args file ,I do many times the ruseltalways this
Hi, the results are too bad. As stated in our paper:
For models trained from scratch an initial learning rate of 5e−4 with a cosine learning rate schedule [26] is adopted, and the training epoch is set to 35.
Could you please try using a larger initial learning rate?
hi nice work !!! can you show me your args file ,I do many times the ruseltalways this
Hi, you can try setting the learning rate to --lr 0.0001 5e-6 16 0.0001 1e-5 16
. drop_path
can be set to 0.3
. But this might cause your training not converging. Please make sure you are using the same dependencies as we used. #58
Also, please check the results of each epoch, not only the last epoch. The best result should be achieved at an earlier epoch.