Redcof/vit-gpt2-image-captioning
A Image to Text Captioning deep learning model with Vision Transformer (ViT) + Generative Pretrained Transformer 2(GPT2)
A Image to Text Captioning deep learning model with Vision Transformer (ViT) + Generative Pretrained Transformer 2(GPT2)