About token bank selection module

Question

Closed this issue 7 days ago · 1 comments

Hi, thank you for your great work. I was trying to reproduce the pre-training and fine-tuneing process. However, I have several questions:

Are the pre-training model and fine-tuning model with the same arch? I saw there is only one model in blip2_3dvqa_t5_elm.py. But I think there is no temporal data in pre-training dataset? Is there still a token selection module?
Is the blip2_3dvqa_t5_elm model for only pretraining? Could you please point out that where is the proposed token bank module? I am so sorry that there are too many files and I can not find them by my own.

Thank you very much!

Answer 1 · 2025-01-02T12:01:01.000Z

Hello, thank you for your interest.

The model arch for pre-training and fine-tuning is the same, except that the latter will have a token bank, which was not published in the previous version of the code and has been updated.
advqa_t5_elm.yaml is for pre-training only, you can check advqa_t5_lr.yaml for token bank.