/P2M_Image_Captioning

The ViT-GPT2 architecture for image captioning. It includes a code implementation that allows you to train your own network using a customizable amount of data and specific epochs.

Primary LanguagePython

Stargazers

No one’s star this repository yet.