/Transformer-vs-bahdanau-attention

Comparison between Bahdanau attention in seq2seq models and Transformers in the translation task

Primary LanguageJupyter Notebook

Watchers