Visualizations of interesting patterns on GPT-2 and BERT positional encodings.
Primary LanguageJupyter Notebook