LindgeW opened this issue 4 years ago · 0 comments
If not, how to add the src-side and tgt-side attention prob distributions together?