/multimodal-patch-embeddings

A ViT model that allows comparing text embeddings directly with image patch embeddings

Primary LanguagePythonMIT LicenseMIT

Watchers

No one’s watching this repository yet.