tornadomeet/ResNet

Why does the ResNext use original Residual Block instead of pre-activation residual block?

Itsanewday opened this issue · 0 comments

In the Identity Mapping, the author claimed that pre-activation is better than post-activation, but i find that in the ResNext , the residual blocks are the original post activation residual blocks, like conv->bn->relu->conv->bn->sum->relu instead of bn->relu->conv->bn->relu->conv->sum. Can you help we? Thanks a lot