How to get the softmax distribution?

Question

How to get the softmax distribution?

WJMacro opened this issue 3 years ago · 1 comments

Hi, Fomicheva
I'm trying to reproduce your result. But I found something tricky. In your paper you've calculated the softmax entropy at each decode step. But since the decoder uses beam search to find a better decode sequence, we got beam_size prob distributions at each decode step. How did you track the best sequence and extract its prob distribution in beam search?
Could you please share your code?

Answer 1 · 2021-05-28T10:50:36.000Z

Hi WJMacro,
We compute the entropy in a force decoding regime. That is to say, we re-score the already generated translations. So search is not an issue. I hope this helps.