/visual-transformer-paper-replication-with-pytorch

This repository contains code for implementing the Visual Transformer (ViT) model introduced in the research paper "An Image is worth 16x16 words". The model in implemented with pytorch.

Primary LanguageJupyter Notebook

Watchers