About SS operation.
biwanqing opened this issue · 2 comments
biwanqing commented
Hello, thanks for your very enlightening work. Could you please explain the rationale for fine-tuning the SS factor instead of the weights and bias?Are there any experimental phenomena or theoretical derivations that support this?
yaoyao-liu commented
Thanks for your interest in our work!
The experiments in Table 1 of our paper show updating SS performs better in few-shot learning. Our paper is based on empirical results, and there are no theoretical derivations.
If you have any further questions, please do not hesitate to contact me.
biwanqing commented
Thanks for your prompt reply. I am interested in this work and will continue to follow it.