JialeCao001/D2Det

Does anyone know how a fully convolutional network produces position-sensitive local offsets?

Opened this issue · 0 comments

微信截图_20210414210109

I read that the source code is used:
“Conv2d(576, 4, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))”

But how do these four channels represent k*k dense position offsets?