The short-range/long-range mlp seem like sampling the four direction in different channel features.
lixcli opened this issue · 1 comments
lixcli commented
Hi,
I think that some logic of the code perhaps not match the paper's method.
As the paper's state, the sampling step use a way similar to[1], which mean it will increase the channel numbers by 4 times for short-range branch and long-range branch in the mlp part, if we sample the four direction element in the sample feature. However, The mlp layer's input dimension is equal to the output dimension( both 1024, code link).
It means that it sample the four direction in different channel features.
Thank's for your work and your code-sharing!:)
ref:
[1]Shift: A zero flop, zero parameter alternative to spatial convolutions.
Xingrun-Xing commented