a mistake in rotary embbeding
Closed this issue · 1 comments
chenht2021 commented
fd873630 commented
I'm testing with a toy dataset.
The previous code was not trained.
Changing to this code seems to be training well from the first epoch.
Awesome! Thank you so much.