A rough implementation of transformer architecture from "Attention is All You Need" paper.
Primary LanguagePythonApache License 2.0Apache-2.0