Sense-GVT/BCDNet

The short-range/long-range mlp seem like sampling the four direction in different channel features.

lixcli opened this issue · 1 comments

Hi,
I think that some logic of the code perhaps not match the paper's method.
As the paper's state, the sampling step use a way similar to[1], which mean it will increase the channel numbers by 4 times for short-range branch and long-range branch in the mlp part, if we sample the four direction element in the sample feature. However, The mlp layer's input dimension is equal to the output dimension( both 1024, code link).
It means that it sample the four direction in different channel features.

Thank's for your work and your code-sharing!:)

ref:
[1]Shift: A zero flop, zero parameter alternative to spatial convolutions.

Please refer to line 69 and 94 in the code, we sample 1/4 channels from each direction. So that, the input and output channels are the same.