VQ-VAE implementation using Vision Transformers for both the encoder and decoder
Primary LanguagePython