KITTI ground truth labels provide poor quality

Question

KITTI ground truth labels provide poor quality

Closed this issue 4 months ago · 4 comments

Hope you are doing fine!

I have been using several detectors (pv_rcnn_plusplus and voxel_rcnn models trained on waymo,nuscenes and lyft) in order to label KITTI frames. Then, I noticed that, there are several frames that has missing ground truth boxes even though our detectors were able to label them correctly which made me curious if there is something I am doing wrong or KITTI ground truth is really bad?

Kind regards,
Gorkem

Answer 1 · 2024-05-05T11:28:40.000Z

Yeah that's correct. From memory, KITTI ground truth labels do not label objects that are smaller than 25 pixel height. This tends to exclude far range objects that lidar can pick up which is why kitti lidar ground truth is missing labels.

Waymo and nuScenes on the other hand, will label any objects with at least one lidar point in them regardless of image pixel size. That's why detectors trained on them can detect objects with less points.

Answer 2 · 2024-05-05T11:54:44.000Z

Hi @darrenjkt, thanks a lot for your reply!

Then, for the quantitative evaluation purposes; is it a good approach to discard these object somehow? Because this results in a lot of false positives which significantly affects the results.

Exactly, using the waymo dataset I was able to get reasonable results both quantitative and qualitative...

Best,

Answer 3 · 2024-05-05T12:09:00.000Z

Not sure if the openpcdet kitti evaluation already does some filtering but you could dig into their code first.

If not, you could filter the predictions based on their 3d bounding box pixel height after projecting into the image frame.

Make sure you re-run this modified evaluation on other algorithms for fair comparison.

Answer 4 · 2024-05-05T21:53:11.000Z

Alright, thanks for your help!