A PyTorch implementation of the Transformer model in "Attention is All You Need".
Primary LanguagePythonMIT LicenseMIT