/pytorch-vit

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Primary LanguagePythonMIT LicenseMIT

Watchers