/ViT-pytorch

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Primary LanguageJupyter NotebookMIT LicenseMIT

Watchers