/VQ-ViT-Bert

🏞🤖 Masked token prediction for bidirectional vision transformers based on vector quantized latent representations

Primary LanguagePythonMIT LicenseMIT

Stargazers