/VisionTransformer

Implementation of 'An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale' ICLR 2020 and a documentation about its historical and technical background.

Primary LanguagePython

Watchers