Valid padding in CapNets

Question

Valid padding in CapNets

phongnhhn92 opened this issue 7 years ago · 3 comments

Hello sir,
I am following the Capsule Network paper and your implementation.
I have a quick question about the valid padding in the conv2 you used to get output for the Primary Caps. So as I understand, after the 1st conv layer, the size of output is (batchsize,20,20,256). So if the conv2 has 256, 9x9 kernel, stride 2 then the formula output should be (20-9+2*p)/2+1 = 6. However, mathematically, the formula above can not be solved so I would like to ask how did exactly padding (valid) works in this situation to have the output is (batchsize,6,6,256).
Thanks !

Answer 1 · 2017-12-04T07:04:54.000Z

Hi — could you respond to the CapsLayer repo as we’ll be developing the library there :)

Answer 2 · 2017-12-04T07:08:48.000Z

U mean I should post this question in that repo ?

Answer 3 · 2017-12-04T08:00:41.000Z

@phongnhhn92 I think that input of conv1 layer is MNIST image, its size is 28*28, so the output is (28-9+0*2)/1+1=20, and there are 256 neurons, so output of conv1 is (batchsize,20,20,256). The conv2 layer you said is PrimaryCaps? The vec_len of capsule is 8, so there are 8 conv layers stacked in PrimaryCaps. The output here is (20-9+2*p)/2+1=6, there are 32 neurons, each neuron calculates 8 conv, concat them will get (batchsize,6,6,32*8)