About the pre-training problems
Closed this issue · 1 comments
imzhangsheng commented
As mentioned in the README.md, the first stage of training the pre-trained model requires fixed ViT, and the second stage is training ViT in an end-to-end manner. My question is whether to load the model trained in the first stage during the second stage of training?
cshizhe commented
The model trained in the first stage should be loaded in the second training stage.