
This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.


This repository contains a (non-exhaustive) overview of follow-up works based on the original Vision Transformer (ViT) by Google. Feel free to open a PR to add more papers!


New pre-training objectives:

New pre-training tricks, techniques:

Architectural changes:

Investigations of the inner workings (cfr. BERTology):

Applying ViT to other domains besides image classification: