facebookresearch/mlqe

How to get the softmax distribution?

WJMacro opened this issue · 1 comments

Hi, Fomicheva
I'm trying to reproduce your result. But I found something tricky. In your paper you've calculated the softmax entropy at each decode step. But since the decoder uses beam search to find a better decode sequence, we got beam_size prob distributions at each decode step. How did you track the best sequence and extract its prob distribution in beam search?
Could you please share your code?

Hi WJMacro,
We compute the entropy in a force decoding regime. That is to say, we re-score the already generated translations. So search is not an issue. I hope this helps.