Speed of RFBNet and SSD on COCO and VOC, have you consider the nms time?

Question

Speed of RFBNet and SSD on COCO and VOC, have you consider the nms time?

tjulyz opened this issue 6 years ago · 4 comments

Hi,
Thank you for share your codes!
When I test your code on VOC and SSD, I cannot reproduce the speed as you reported.
My configuration is Titan XP, pytorch 0.4, cuda 9.0, and get 20+ms for VOC, 30+ms for COCO, not like yours (80+fps, and 15ms for COCO). When test the speed, I use conf_thresh 0.01, gpunms. I have also tried pytorch 0.3, still cannot reproduce the speed. The time of nms for me is ~20ms for COCO, and ~10ms for VOC. So have you consider the NMS time when report the fps?
Any advice for getting the correct speed?
Thank you very much!

Answer 1 · 2019-03-03T08:07:43.000Z

@tjulyz In my test, I get ~9ms inference speed plus ~3 ms CPU nms time with pytorch 0.3.1, cuda 9.0 on VOC 300. For COCO 300 the GPU nms time increase to ~6ms. These measurements all ignore the first 50 samples when the GPU is warming up. For your configuration, I guess the bottleneck mainly comes from the nms. You can try the gpu nms version or use a higher version of numpy and cython, or a more powerful cpu.

Answer 2 · 2019-03-04T06:06:17.000Z

Great, thank you very much!

Answer 3 · 2019-03-05T06:47:34.000Z

I also met the same problem. My configuration is Titan XP, pytorch 0.3.1, cuda 9.0 and I am using the nms of gpu version. For COCO 300, the inference speed is ~6 ms/img while the nms time is ~12ms/img, which looks strange. Could you please provide some more suggestions? Many thanks!

Answer 4 · 2019-07-11T02:31:58.000Z

i alse met the same problem, gpu_nms also need cost much time than model infer. i am very worried about this