Gorilla-Lab-SCUT/frustum-convnet

inference time of f-convnet

WWW2323 opened this issue · 1 comments

image
hi,could you tell me how do you measure the inference time of f-convnet at 0.47s on the KITTI benmark,does it consist of 2d detection time、frustum extracting time and f-convnet model time?did you use a batch-size which bigger than 1 to inference all frustums in one image?when i inference a batch of frustums(batch size: 32), f-convnet just consume 0.03s,is there something wrong with my code?

@WWW2323 Yes. The inference time includes the 2D RGB detection (most of time). And for F-ConvNet, we measure the time of 32 frustums, including first stage and refinement stage. Note that we assume that one image contains 32 2D bboxes on average. Actually, the number of 2D bboxes of most images is less than 32 if you have a good 2D detector. The inference time here is conservative, and also has a lot of optimization space.