About token bank selection module
Closed this issue · 1 comments
wenqi-wang20 commented
Hi, thank you for your great work. I was trying to reproduce the pre-training and fine-tuneing process. However, I have several questions:
- Are the pre-training model and fine-tuning model with the same arch? I saw there is only one model in
blip2_3dvqa_t5_elm.py
. But I think there is no temporal data in pre-training dataset? Is there still a token selection module? - Is the
blip2_3dvqa_t5_elm
model for only pretraining? Could you please point out that where is the proposed token bank module? I am so sorry that there are too many files and I can not find them by my own.
Thank you very much!
zhouyunsong commented
Hello, thank you for your interest.
- The model arch for pre-training and fine-tuning is the same, except that the latter will have a token bank, which was not published in the previous version of the code and has been updated.
advqa_t5_elm.yaml
is for pre-training only, you can checkadvqa_t5_lr.yaml
for token bank.