Tool for visualizing attention in the Transformer model (BERT, GPT-2, XLNet, and RoBERTa)
Primary LanguageJupyter NotebookApache License 2.0Apache-2.0