[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
Primary LanguagePythonMIT LicenseMIT