awoo_vit Prerequisites python 3.6 pytorch 1.10.0 references vit-pytorch An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale