About diffloss
Closed this issue · 2 comments
RohollahHS commented
Hi,
In this part
Lines 232 to 238 in fe470ac
self.diffloss
? You then use the mask to compute the loss only for masked tokens.
Thanks
LTH14 commented
Both implementations are ok
RohollahHS commented
Thanks for the prompt response.