Project for Vision and Language Course at UVa.
We explore different models of image-text pair generation based on wikiart and MET datasets.
Project for Vision and Language Course at UVa.
We explore different models of image-text pair generation based on wikiart and MET datasets.