Question about weight value of key and query in SA Layer
dogyoonlee opened this issue · 1 comments
dogyoonlee commented
MenghaoGuo commented
Hi,
thank you for your attention to PCT.
In experiment, we find it can make the entire network converge better. Moreover, we observe that the weights of q_conv kernel and k_conv kernel are different after training.