/rosita

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers