A beginner's attempt to understand and implement the Vision Transformer paper.
Primary LanguageJupyter Notebook