When I used resnet50 as backbone training, the time per iteration was about 0.8s (bitchsize=4, gpu=4), and it was also about 0.8s per image at inference, which is a 10x difference from the paper(0.08s). Is this normal? The gpu is 2080ti.
train log:
inference log: