/WCA

[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models

Primary LanguagePythonMIT LicenseMIT

Watchers