/image-captioning

Transformer & CNN Image Captioning model in PyTorch.

Primary LanguagePythonMIT LicenseMIT

Experiment Tables

name num boxes note
Jun14_19-47-55_x1000c2s5b0n1 50 directly change from ResNet to DINO
Jun15_10-19-11_x1000c2s0b0n1 50 same as the first but with feature cache
Jun15_10-47-09_x1000c2s2b0n0 100 2nd + 50 boxes