jadore801120/attention-is-all-you-need-pytorch

why is masking performed again during the inference decoder stage?

Akshay1-6180 opened this issue · 0 comments

why is masking performed again during the inference decoder stage?