Awesome-Aut-3D-Detection-Methods

note

This repository is created to collect excellent works on 3D object detection for autonomous driving tasks. We will update the latest papers as soon as possible.

keywords

inputs

monocular: monocular   stereo: stereo  lidar: point cloud  
image+lidar: image+lidar fusion  

datasets

experiments on datasets: kitti: KITTI   nuse: NuScenes   waymo: Waymo   ATG4D: ATG4D   lyft: lyft 

code

framework : Tensorflow: TensorFlow   PyTorch: PyTorch  

2017

  • [CVPR] PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. [tensorflow][pytorch] [lidar] 🔥⭐
  • [CVPR] Multi-View 3D Object Detection Network for Autonomous Driving. [tensorflow] [image+lidar] [kitti]:fire: :star:
  • [ICRA] Vote3Deep: Fast object detection in 3D point clouds using efficient convolutional neural networks. [code_matlab] [lidar] [kitti]:star:
  • [IROS] 3D fully convolutional network for vehicle detection in point cloud. [tensorflow] [lidar] [kitti]:fire: :star:

2018

  • [CVPR] PIXOR: Real-time 3D Object Detection from Point Clouds. [pytorch] [lidar] [kitti][ATG4D]
  • [CVPR] VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection. [tensorflow] [lidar] [kitti]:fire::fire::fire: :star:
  • [CVPR] PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation. [code] [image+lidar] [kitti]
  • [CVPR] Frustum PointNets for 3D Object Detection from RGB-D Data. [tensorflow] [image+lidar] [kitti] 🔥 ⭐
  • [ECCV] Deep Continuous Fusion for Multi-Sensor 3D Object Detection. [image+lidar] [kitti] [ATG4D]
  • [ECCVW] YOLO3D: End-to-end real-time 3D Oriented Object Bounding Box Detection from LiDAR Point Cloud. [ monocular] [kitti]
  • [ICRA] End-to-end Learning of Multi-sensor 3D Tracking by Detection. [image+lidar] [kitti]
  • [ICRA] Robust Real-Time 3D Person Detection for Indoor and Outdoor Applications. [lidar] [kitti]
  • [ICRA] A General Pipeline for 3D Detection of Vehicles.[lidar] [kitti]
  • [IROS] Joint 3D Proposal Generation and Object Detection from View Aggregation. [lidar] [kitti]:star:
  • [IROS] Edge and Corner Detection for Unorganized 3D Point Clouds with Application to Robotic Welding. [lidar] [kitti]
  • [SENSORS] SECOND: Sparsely Embedded Convolutional Detection. [pytorch][lidar] [kitti] 🔥🔥🔥🔥
  • [arXiv] IPOD: Intensive Point-based Object Detector for Point Cloud. [image+lidar] [kitti]
  • [arXiv] Complex-YOLO: Real-time 3D Object Detection on Point Clouds. [pytorch] [lidar] [kitti] 🔥

2019

  • [CVPR] Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving. [code] [stereo][kitti]
  • [CVPR] Stereo R-CNN based 3D Object Detection for Autonomous Driving. [code] [stereo][kitti]
  • [CVPR] PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud. [pytorch] [lidar] [kitti]:fire:
  • [CVPR] PointPillars: Fast Encoders for Object Detection from Point Clouds. [pytorch] [lidar] [kitti]:fire:
  • [CVPR] LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving.[lidar] [kitti][ATG4D]
  • [CVPRW] Attentional PointNet for 3D-Object Detection in Point Clouds. [pytorch] [lidar] [kitti]
  • [ICCV] Fast Point R-CNN. [lidar] [kitti]
  • [ICCV] STD: Sparse-to-Dense 3D Object Detector for Point Cloud.[pytorch] [lidar] [kitti]
  • [ICCV] M3D-RPN: Monocular 3D Region Proposal Network for Object Detection.[pytorch] [monocular] [kitti]
  • [ICCVW] Range Adaptation for 3D Object Detection in LiDAR. [lidar] [kitti]
  • [ICCVW] Multi-View Reprojection Architecture for Orientation Estimation. [monocular] [kitti]
  • [NeurIPS] Point-Voxel CNN for Efficient 3D Deep Learning. [lidar] [kitti]
  • [ICMLW] LiDAR Sensor modeling and Data augmentation with GANs for Autonomous driving. [lidar]
  • [ICRA] Focal Loss in 3D Object Detection. [code] [lidar] [kitti]
  • [ICRA] SEG-VoxelNet for 3D Vehicle Detection from RGB and LiDAR Data. [lidar] [kitti]
  • [ICRA] MVX-Net: Multimodal VoxelNet for 3D Object Detection. [lidar] [kitti]
  • [AAAI] MonoGRNet: A Geometric Reasoning Network for 3D Object Localization. [monocular] [kitti]
  • [IROS] EPN: Edge-Aware PointNet for Object Recognition from Multi-View 2.5D Point Clouds. [tensorflow] [lidar] [kitti]
  • [IROS] Frustum ConvNet: Sliding Frustums to Aggregate Local Point-Wise Features for Amodal 3D Object Detection. [pytorch] [lidar+image] [kitti]
  • [IROS] Improving 3D object detection for pedestrians with virtual multi-view synthesis orientation estimation. [lidar] [kitti]
  • [3DV] IoU Loss for 2D/3D Object Detection. [lidar] [kitti]
  • [arXiv] Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud. [monocular][kitti]
  • [arXiv] FVNet: 3D Front-View Proposal Generation for Real-Time Object Detection from Point Clouds. [code] [lidar] [kitti]
  • [CVPRW] Complexer-YOLO: Real-Time 3D Object Detection and Tracking on Semantic Point Clouds. [pytorch] [monocular][kitti]:fire:
  • [CVPR] Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction. [pytorch] [monocular][kitti]
  • [CVPR] GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving. [monocular][kitti]
  • [CVPR] ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape. [monocular][kitti]
  • [CVPR] Triangulation Learning Network: from Monocular to Stereo 3D Object Detection. [pytorch] [stereo][kitti]
  • [CoRR] 3D Backbone Network for 3D Object Detection. [code] [lidar] [kitti]
  • [arXiv] nuScenes: A multimodal dataset for autonomous driving. [link] [dataset]
  • [arXiv] Deformable Filter Convolution for Point Cloud Reasoning.[lidar] [kitti][ATG4D]
  • [arXiv] PointRGCN: Graph Convolution Networks for 3D Vehicles Detection Refinement.[lidar] [kitti][ATG4D]

2020

  • [TPAMI] Part-A^2 Net: 3D Part-Aware and Aggregation Neural Network for Object Detection from Point Cloud. [pytorch][lidar] [kitti]
  • [AAAI] TANet: Robust 3D Object Detection from Point Clouds with Triple Attention. [code] [lidar] [kitti]
  • [AAAI] PI-RCNN: An Efficient Multi-sensor 3D Object Detector with Point-based Attentive Cont-conv Fusion Module. [lidar+image] [kitti]
  • [AAAI] ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection. [code] [stereo] [kitti]
  • [AAAI] Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation. [monocular] [kitti]
  • [CVPR] PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection. [pytorch] [lidar] [kitti] [waymo]:fire: :star: :fire: :star:
  • [CVPR] Structure Aware Single-stage 3D Object Detection from Point Cloud. [pytorch] [lidar] [kitti] 🔥 ⭐
  • [CVPR]3DSSD: Point-based 3D Single Stage Object Detector. [TensorFlow] [lidar] [kitti][nusc] 🔥 ⭐
  • [CVPR]Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud. [TensorFlow] [lidar] [kitti] 🔥 ⭐
  • [CVPR]Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection. [lidar] [kitti]
  • [CVPR]PnPNet: End-to-End Perception and Prediction with Tracking in the Loop. [lidar]
  • [CVPR] Train in Germany, Test in The USA: Making 3D Object Detectors Generalize.[code] [lidar]
  • [CVPR] PointPainting: Sequential Fusion for 3D Object Detection. [lidar+image] [kitti] [nusc]
  • [CVPR] DSGN: Deep Stereo Geometry Network for 3D Object Detection. [monocular] [kitti]
  • [CVPR] Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation.[code] [stereo] [kitti]
  • [CVPR] Learning Depth-Guided Convolutions for Monocular 3D Object Detection.[code] [monocular] [kitti]
  • [CVPR] MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships. [monocular] [kitti]
  • [CVPR] LiDAR-based Online 3D Video Object Detection with Graph-based Message Passing and Spatiotemporal Transformer Attention. [lidar_video] [nusc]
  • [CVPR] Physically Realizable Adversarial Examples for LiDAR Object Detection. [lidar]
  • [CVPR]HVNet: Hybrid Voxel Network for LiDAR Based 3D Object Detection. [lidar] [kitti]
  • [CVPR]Dops: Learning to detect 3d objects and predict their 3d shapes. [lidar_video] [waymo]
  • [CVPR]Learning to Evaluate Perception Models Using Planner-Centric Metrics. [lidar]
  • [CVPR]What You See is What You Get: Exploiting Visibility for 3D Object Detection. [lidar] [nusc]
  • [CVPR]MotionNet: Joint Perception and Motion Prediction for Autonomous Driving Based on Bird’s Eye View Maps. [lidar]
  • [ECCVW] Deformable PV-RCNN: Improving 3D Object Detection with Learned Deformations.[code][lidar] [kitti]
  • [ECCV] object as hotspots.[lidar] [kitti]
  • [ECCV] EPNet: Enhancing Point Features with Image Semantics for 3D Object Detection.[lidar+image] [kitti]
  • [ECCV] 3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection.[lidar+image] [kitti]
  • [ECCV] Kinematic 3D Object Detection in Monocular Video.[code][monocular_video] [kitti]
  • [ECCV] Rethinking Pseudo-LiDAR Representation.[code][monocular] [kitti]
  • [ECCV] An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds.[lidar] [waymo]
  • [ECCV] Pillar-based Object Detection for Autonomous Driving.[lidar] [waymo]
  • [ECCV] Active Perception using Light Curtains for Autonomous Driving.[code][lidar]
  • [ECCV] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution.[lidar]
  • [ECCV] Improving 3D Object Detection through Progressive Population Based Augmentation.[lidar] [kitti]
  • [IROS] MVLidarNet: Real-Time Multi-Class Scene Understanding for Autonomous Driving Using Multiple Views.[lidar] [nusc]
  • [ACMMM] Weakly Supervised 3D Object Detection from Point Clouds.[lidar]
  • [BMVC] RV-FuseNet: Range View based Fusion of Time-Series LiDAR Data for Joint 3D Object Detection and Motion Forecasting [lidar][nusc]
  • [Sensors] 3D-GIoU: 3D Generalized Intersection over Union for Object Detection in Point Cloud [lidar][kitti]
  • [arxiv] 3D IoU-Net: IoU Guided 3D Object Detector for Point Clouds [lidar][kitti]
  • [arxiv] Center-based 3D Object Detection and Tracking [code][lidar][nusc]
  • [arxiv] Boundary-Aware Dense Feature Indicator for Single-Stage 3D Object Detection from Point Clouds [lidar][nusc]
  • [arxiv] InfoFocus: 3D Object Detection for Autonomous Driving with Dynamic Information Modeling [lidar][nusc]
  • [arxiv] Quantifying Data Augmentation for LiDAR based 3D Object Detection [lidar][kitti]
  • [arxiv] Range Conditioned Dilated Convolutions for Scale Invariant 3D Object Detection [lidar][kitti][nusc]
  • [arxiv] Real-time 3D object proposal generation and classification under limited processing resources [lidar][kitti]
  • [arxiv] Safety-Aware Hardening of 3D Object Detection Neural Network Systems [lidar][kitti]
  • [arxiv] Stereo RGB and Deeper LIDAR Based Network for 3D Object Detection[stereo][kitti]
  • [arxiv] SSN: Shape Signature Networks for Multi-class Object Detection from Point Clouds [code][lidar][kitti]
  • [arxiv] SVGA-Net: Sparse Voxel-Graph Attention Network for 3D Object Detection from Point Clouds [lidar][kitti]
  • [arxiv] GhostBuster: Looking Into Shadows to Detect Ghost Objects in Autonomous Vehicle 3D Sensing [lidar][kitti]
  • [arxiv] Cross-Modality 3D Object Detection [lidar][kitti]
  • [arxiv] Towards Autonomous Driving: a Multi-Modal 360∘ Perception Proposal[lidar][kitti]
  • [arxiv] RangeRCNN: Towards Fast and Accurate 3D Object Detection with Range Image Representation[lidar][kitti]
  • [arxiv] CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection[lidar+image][kitti]
  • [arxiv]Reinforced Axial Refinement Network for Monocular 3D Object Detection[monocular][kitti]
  • [arxiv]PV-RCNN: The Top-Performing LiDAR-only Solutions for 3D Detection / 3D Tracking / Domain Adaptation of Waymo Open Dataset Challenges[lidar][kitti][waymo]
  • [arxiv]GhostBuster: Looking Into Shadows to Detect Ghost Objects in Autonomous Vehicle 3D Sensing [lidar][kitti]
  • [arxiv]CenterNet3D:An Anchor free Object Detector for Autonomous Driving [lidar][kitti]
  • [arxiv] Part-Aware Data Augmentation for 3D Object Detection in Point Cloud. [lidar][kitti]
  • [arxiv] 1st Place Solution for Waymo Open Dataset Challenge - 3D Detection and Domain Adaptation. [lidar][waymo]
  • [arxiv] RoIFusion: 3D Object Detection from LiDAR and Vision. [lidar+image][kitti]
  • [arxiv] A Density-Aware PointRCNN for 3D Objection Detection in Point Clouds. [lidar][kitti]
  • [arxiv] Radar-Camera Sensor Fusion for Joint Object Detection and Distance Estimation in Autonomous Vehicles . [lidar+image][nusc]
  • [arxiv] 3D Object Detection and Tracking Based on Streaming Data. [lidar][kitti]['det_and_tracking']
  • [arxiv] Uncertainty-Aware Voxel based 3D Object Detection and Tracking with von-Mises Loss. [lidar][kitti]['det_and_tracking']
  • [arxiv] Faraway-Frustum: Dealing with Lidar Sparsity for 3D Object Detection using Fusion. [lidar][kitti]
  • [arxiv] Multi-View Adaptive Fusion Network for 3D Object Detection. [lidar][kitti]
  • [arxiv] Stereo Frustums: A Siamese Pipeline for 3D Object Detection. [stereo][kitti]
  • [arxiv]MLOD: Awareness of Extrinsic Perturbation in Multi-LiDAR 3D Object Detection for Autonomous Driving. [lidar]
  • [arxiv]SF-UDA3D: Source-Free Unsupervised Domain Adaptation for LiDAR-Based 3D Object Detection. [lidar][kitti][nusc]
  • [arxiv]Learning Monocular 3D Vehicle Detection without 3D Bounding Box Labels. [mono][kitti]
  • [arxiv]SemanticVoxels: Sequential Fusion for 3D Pedestrian Detection using LiDAR Point Cloud and Semantic Segmentation. [lidar][kitti]
  • [arxiv]CoFF: Cooperative Spatial Feature Fusion for 3D Object Detection on Autonomous Vehicles. [lidar][kitti]
  • [arxiv]Multi-Frame to Single-Frame: Knowledge Distillation for 3D Object Detection. [lidar][waymo]
  • [arxiv] MAFF-Net: Filter False Positive for 3D Vehicle Detection with Multi-modal Adaptive Feature Fusion. [lidar][kitti]
  • [arxiv] CenterFusion: Center-based Radar and Camera Fusion for 3D Object Detection. [radar+image][nusc]
  • [arxiv] Uncertainty-Aware Voxel based 3D Object Detection and Tracking with von-Mises Loss. [lidar][kitti]
  • [arxiv] A Neural Network for Fast and Efficient Edge Detection in 3D Point Clouds. [lidar][kitti]
  • [arxiv] CIA-SSD: Confident IoU-Aware Single-Stage Object Detector From Point Cloud. [lidar][kitti]
  • [arxiv] It’s All Around You: Range-Guided Cylindrical Network for 3D Object Detection. [lidar][kitti]

Suvery

  • [TPAMI] Deep Learning for 3D Point Clouds: A Survey[lidar]
  • [arxiv] 3D Point Cloud Processing and Learning for Autonomous Driving[lidar]
  • [arxiv] Deep Learning for 3D Point Cloud Understanding: A Survey[lidar]
  • [arxiv] An Overview Of 3D Object Detection[lidar]

code base