/DAIR-V2X

Primary LanguagePythonApache License 2.0Apache-2.0

DAIR-V2X and OpenDAIRV2X: Towards General and Real-World Cooperative Autonomous Driving



teaser

Table of Contents:

  1. Highlights
  2. News
  3. Dataset Download
  4. Getting Started
  5. Major Features
  6. Benchmark
  7. Citation
  8. Contaction

Highlights

  • DAIR-V2X: The first real-world dataset for research on vehicle-to-everything autonomous driving. It comprises a total of 71,254 frames of image data and 71,254 frames of point cloud data.
  • OpenDAIR-V2X: An open-sourced framework for supporting the research on vehicle-to-everything autonomous driving.

News

  • [2023.03] 🔥 Our new dataset "V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting" has been accepted by CVPR2023. Congratulations! We will release the dataset sooner. Please follow DAIR-V2X-Seq for the latest news.
  • [2023.03] 🔥 We have released training code for our FFNET, and our OpenDAIRV2X now supports evaluating FFNET.
  • [2022.11] We have held the first VIC3D Object Detection challenge.
  • [2022.07] We have released the OpenDAIRV2X codebase v1.0.0. The current version can faciliate the researchers to use the DAIR-V2X dataset and reproduce the benchmarks.
  • [2022.03] Our Paper "DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection" has been accepted by CVPR2022. Arxiv version could be seen here.
  • [2022.02] DAIR-V2X dataset is availale here. It can be unlimitedly downloaded within mainland China.

Dataset Download

Getting Started

Please refer to getting_started.md for Installation, Evaluation, Benchmark and Training etc for VIC3D.

Major Features

  • Support Train/Evaluation for VIC3D

    It will directly support model training and evaluation for VIC3D. Now the model inference and model training are mainly based on MMDetection3D, which is not quite convenient to carry the VICAD research.

  • Evaluation (Model inference is based on MMDetection3D)

  • Training based on MMDetection3D

  • Direct Evaluation with DAIR-V2X Framework

  • Direct Training with DAIR-V2X Framework

  • Support different fusion methods for VIC3D

    It will directly support different fusion methods including early fusion/feature fusion/late fusion. Now it supports early fusion and late fusion.

  • Early Fusion

  • Early Fusion

  • Late Fusion

  • Support multi-modality/single-modality detectors for VIC3D

    It will directly support different modaility detectors including image-modality detector, pointcloud-modality detector and image-pointcloud fusion detector. Now it supports image-modality detector ImvoxelNet, pointcloud-modality detector PointPillars.

  • Image-modality

  • Pointcloud-modality

  • Multi-modality

  • Support Cooperation-view/single-view detectors for VIC3D

    It directly supports different view's detectors for VIC3D, including infrastructure-view detector, vehicle-view detector, vehicle-infrastrucure cooperation-view detector.

  • Infrastructure-view

  • Vehicle-view

  • Cooperation-view

Benchmark

You can find more benchmark in SV3D-Veh, SV3D-Inf, and VIC3D. We provide part of the VIC3D Benchmark in following table.

Modality Fusion Model Dataset AP-3D (IoU=0.5) AP-BEV (IoU=0.5) AB
Overall 0-30m 30-50m 50-100m Overall 0-30m 30-50m 50-100m
Image VehOnly ImvoxelNet VIC-Sync 9.13 19.06 5.23 0.41 10.96 21.93 7.28 0.78 0
Late-Fusion ImvoxelNet VIC-Sync 18.77 33.47 9.43 8.62 24.85 39.49 14.68 14.96 309.38
Pointcloud VehOnly PointPillars VIC-Sync 48.06 47.62 63.51 44.37 52.24 30.55 66.03 48.36 0
Early Fusion PointPillars VIC-Sync 62.61 64.82 68.68 56.57 68.91 68.92 73.64 65.66 1382275.75
Late-Fusion PointPillars VIC-Sync 56.06 55.69 68.44 53.60 62.06 61.52 72.53 60.57 478.61
Late-Fusion PointPillars VIC-Async-2 52.43 51.13 67.09 49.86 58.10 57.23 70.86 55.78 478.01
TCLF PointPillars VIC-Async-2 53.37 52.41 67.33 50.87 59.17 58.25 71.20 57.43 897.91

Citation

If you find this project useful in your research, please consider cite:

@inproceedings{dair-v2x,
  title={Dair-v2x: A large-scale dataset for vehicle-infrastructure cooperative 3d object detection},
  author={Yu, Haibao and Luo, Yizhen and Shu, Mao and Huo, Yiyi and Yang, Zebang and Shi, Yifeng and Guo, Zhenglong and Li, Hanyu and Hu, Xing and Yuan, Jirui and others},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={21361--21370},
  year={2022}
}

Contaction

If any questions and suggenstations, please email to dair@air.tsinghua.edu.cn.

Related Resources

Awesome