Computational Error on PositionalEncoder()

Question

Computational Error on PositionalEncoder()

jacobastern opened this issue 5 years ago · 1 comments

I read your blog post in TowardsDataScience on this model, and I think there may be a computational error in line 27 of Transformer/Embed.py. In the paper and in other implementations, like this one, we should have PE_(pos, 2i+1) = math.cos(pos / (10000 ** ((2 * i)/d_model))), not math.cos(pos / (10000 ** ((2 * (i + 1))/d_model))), as the code currently stands.

Answer 1 · 2019-07-23T07:26:43.000Z

I think @jastern33 is correct, this is the result from this github:

And this is from - http://nlp.seas.harvard.edu/2018/04/03/attention.html