whether the training of EVA involves masking text (caption) token?
Closed this issue · 1 comments
leyangjin commented
I am new to this area. Just want to check that whether the training of EVA model involves masking text (caption) token, or the training of EVA model only involves masking image patches.
Thank you so much for your help.
Quan-Sun commented
@leyangjin only masking image patches.