epsilon / np.sqrt(embedding_dim)

Question

glcLucky opened this issue 4 years ago · 1 comments

I guess this formula may be wrong. Should we change that to this: np.sqrt(epsilon / embedding_dim)？

Answer 1 · 2021-07-14T09:55:25.000Z

Why do you say so ?