Opened this issue 4 years ago · 0 comments
I read that the source code is used: “Conv2d(576, 4, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))”
But how do these four channels represent k*k dense position offsets?