WendellGul/AGAH

Modify model structure

Opened this issue · 1 comments

CQYIO commented

hi.
Have you considered modifying the feature extraction structure of images and text.
Do you think you can use VIT(Visioni transformer) to replace it.

Hello, is there any improvement in the effect of this alternative method?