/VisionTransformers

Vision Transformers are the state of the art methods for classification or object detection problems. Images are divided into patches and then fed into transofmer encoder with positional encodings.

Primary LanguagePythonApache License 2.0Apache-2.0

No issues in this repository yet.