Difference from original paper: regularization term of TOP1 Loss

Question

Difference from original paper: regularization term of TOP1 Loss

bilzard opened this issue 2 years ago · 1 comments

Hi.
Thank you for sharing this code.

I found difference from original paper in regularization term of TOP1 Loss.
According to the original paper, the regularization term should be calculated over only negative samples¹.
However, this repo calculates it over all samples².
It might not be significant difference, but I just pointed it out.

Answer 1 · 2023-11-17T05:50:02.000Z

@bilzard Thank you for your comments!

Footnotes