shreydan/VisionGPT2

Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.

Jupyter Notebook

Readme
1Issue
28Stargazers
2Watchers

Stargazers

1nyourlife
90r
alptekinnege
bil-ash
Chanakan5591
Thailand
chansky6
dx-dtran
HakeoungLee
The University of Texas at Austin
harrychih
Baltimore, MD
javjimb
Berlin
lzzzx666
UNIVERSITY OF ILLINOIS AT URBANA - CHAMPAIGN
pathquester
peternasser99
egypt
shreydan
India
SkAndMl
Nanyang Technological University
Sushmitha1703
chennai
TamakoRan
中央民族大学
twardoch
Fontlab Ltd.
William-HTP
zz242msu

Contact site admin: Geeks.