Modify model structure
Opened this issue · 1 comments
CQYIO commented
hi.
Have you considered modifying the feature extraction structure of images and text.
Do you think you can use VIT(Visioni transformer) to replace it.
Daydaylight commented
Hello, is there any improvement in the effect of this alternative method?