I am curious about why the flops and parms of TAdaConvNeXt-T is one half of the ResNet50's, to my knowledge they should be similar.

Question

I am curious about why the flops and parms of TAdaConvNeXt-T is one half of the ResNet50's, to my knowledge they should be similar.

leexinhao opened this issue 2 years ago · 2 comments

          I am curious about why the flops and parms of TAdaConvNeXt-T is one half of the ResNet50's, to my knowledge they should be similar.

Originally posted by @leexinhao in #11 (comment)

Could you give me the implement of TAdaConvNeXt？

Answer 1 · 2023-04-11T09:27:29.000Z

Sorry for the delayed reply.

The reason for the reduced computation for TAdaConvNeXt is that we used a tublet embedding stem in TAdaConvNeXt model, which reduces the temporal resolution by a factor of 2 in the first downsample layer of ConvNeXt :). In contrast, the TAda2D stem employs only spatial convolutions with no temporal downsampling.

Answer 2 · 2023-04-11T09:34:08.000Z

Sorry for the delayed reply.

The reason for the reduced computation for TAdaConvNeXt is that we used a tublet embedding stem in TAdaConvNeXt model, which reduces the temporal resolution by a factor of 2 in the first downsample layer of ConvNeXt :). In contrast, the TAda2D stem employs only spatial convolutions with no temporal downsampling.

Thanks for your reply!