The description and the behavior don't match
yongchanghao opened this issue · 2 comments
yongchanghao commented
In the following code, the comment says weight A is init as usual (kaiming init as in other places in the code) and B as zeros. However, the behavior is quite the opposite:
Lines 58 to 60 in 3f5c193
Is this intended? Thanks.
edwardjhu commented
I fixed the comment. Thanks!
yongchanghao commented
Thank you. May I know the reason why the matrix A is set to 0 here, unlike other parts (and the paper) where B is 0?