About the pre-training problems

Question

About the pre-training problems

Closed this issue a year ago · 1 comments

As mentioned in the README.md, the first stage of training the pre-trained model requires fixed ViT, and the second stage is training ViT in an end-to-end manner. My question is whether to load the model trained in the first stage during the second stage of training?

Answer 1 · 2023-09-12T12:26:05.000Z

The model trained in the first stage should be loaded in the second training stage.