/Attention-is-All-You-Need

Implementation of the Transformer model from scratch.

Primary LanguageJupyter Notebook