d-li14/octconv.pytorch

Computation time around 3 times longer with OctConv

niam06 opened this issue · 3 comments

Thanks for your implementation of OctConv!

I see that the OctConv paper reports reduced GFLOPs usage as well as reduced computation time per image. However, running your implementation on ImageNet dataset, it seems the computation time per image actually increases by a multiple of 3! This happens both during training and during validation. I am comparing PyTorch's ResNet50 vs your OctResNet-50 with default alpha parameters.

duplicate of #4

Thanks!

Continuing on my experiments on ImageNet, I tried testing OctConv Resnet50 on CPU, and it seems that it has runtime around 25% higher to PyTorch's vanilla ResNet50 (~0.79 seconds with ResNet50 vs. ~1.1 seconds with OctConvResNet50 for each 160 images). I am using default alpha_in and alpha_out at 0.5 each. Any thoughts on this? Thanks!