This repository is created to collect excellent works on 3D object detection for autonomous driving tasks. We will update the latest papers as soon as possible.
monocular
: monocular stereo
: stereo lidar
: point cloud
image+lidar
: image+lidar fusion
experiments on datasets: kitti
: KITTI nuse
: NuScenes waymo
: Waymo ATG4D
: ATG4D lyft
: lyft
framework : Tensorflow
: TensorFlow PyTorch
: PyTorch
- [CVPR] PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. [tensorflow][pytorch] [
lidar
] 🔥⭐ - [CVPR] Multi-View 3D Object Detection Network for Autonomous Driving. [tensorflow] [
image+lidar
] [kitti
]:fire: :star: - [ICRA] Vote3Deep: Fast object detection in 3D point clouds using efficient convolutional neural networks. [code_matlab] [
lidar
] [kitti
]:star: - [IROS] 3D fully convolutional network for vehicle detection in point cloud. [tensorflow] [
lidar
] [kitti
]:fire: :star:
- [CVPR] PIXOR: Real-time 3D Object Detection from Point Clouds. [pytorch] [
lidar
] [kitti
][ATG4D
] - [CVPR] VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection. [tensorflow] [
lidar
] [kitti
]:fire::fire::fire: :star: - [CVPR] PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation. [code] [
image+lidar
] [kitti
] - [CVPR] Frustum PointNets for 3D Object Detection from RGB-D Data. [tensorflow] [
image+lidar
] [kitti
] 🔥 ⭐ - [ECCV] Deep Continuous Fusion for Multi-Sensor 3D Object Detection. [
image+lidar
] [kitti
] [ATG4D
] - [ECCVW] YOLO3D: End-to-end real-time 3D Oriented Object Bounding Box Detection from LiDAR Point Cloud. [
monocular
] [kitti
] - [ICRA] End-to-end Learning of Multi-sensor 3D Tracking by Detection. [
image+lidar
] [kitti
] - [ICRA] Robust Real-Time 3D Person Detection for Indoor and Outdoor Applications. [
lidar
] [kitti
] - [ICRA] A General Pipeline for 3D Detection of Vehicles.[
lidar
] [kitti
] - [IROS] Joint 3D Proposal Generation and Object Detection from View Aggregation. [
lidar
] [kitti
]:star: - [IROS] Edge and Corner Detection for Unorganized 3D Point Clouds with Application to Robotic Welding. [
lidar
] [kitti
] - [SENSORS] SECOND: Sparsely Embedded Convolutional Detection. [pytorch][
lidar
] [kitti
] 🔥🔥🔥🔥 - [arXiv] IPOD: Intensive Point-based Object Detector for Point Cloud. [
image+lidar
] [kitti
] - [arXiv] Complex-YOLO: Real-time 3D Object Detection on Point Clouds. [pytorch] [
lidar
] [kitti
] 🔥
- [CVPR] Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving. [code] [
stereo
][kitti
] - [CVPR] Stereo R-CNN based 3D Object Detection for Autonomous Driving. [code] [
stereo
][kitti
] - [CVPR] PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud. [pytorch] [
lidar
] [kitti
]:fire: - [CVPR] PointPillars: Fast Encoders for Object Detection from Point Clouds. [pytorch] [
lidar
] [kitti
]:fire: - [CVPR] LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving.[
lidar
] [kitti
][ATG4D
] - [CVPRW] Attentional PointNet for 3D-Object Detection in Point Clouds. [pytorch] [
lidar
] [kitti
] - [ICCV] Fast Point R-CNN. [
lidar
] [kitti
] - [ICCV] STD: Sparse-to-Dense 3D Object Detector for Point Cloud.[pytorch] [
lidar
] [kitti
] - [ICCV] M3D-RPN: Monocular 3D Region Proposal Network for Object Detection.[pytorch] [
monocular
] [kitti
] - [ICCVW] Range Adaptation for 3D Object Detection in LiDAR. [
lidar
] [kitti
] - [ICCVW] Multi-View Reprojection Architecture for Orientation Estimation. [
monocular
] [kitti
] - [NeurIPS] Point-Voxel CNN for Efficient 3D Deep Learning. [
lidar
] [kitti
] - [ICMLW] LiDAR Sensor modeling and Data augmentation with GANs for Autonomous driving. [
lidar
] - [ICRA] Focal Loss in 3D Object Detection. [code] [
lidar
] [kitti
] - [ICRA] SEG-VoxelNet for 3D Vehicle Detection from RGB and LiDAR Data. [
lidar
] [kitti
] - [ICRA] MVX-Net: Multimodal VoxelNet for 3D Object Detection. [
lidar
] [kitti
] - [AAAI] MonoGRNet: A Geometric Reasoning Network for 3D Object Localization. [
monocular
] [kitti
] - [IROS] EPN: Edge-Aware PointNet for Object Recognition from Multi-View 2.5D Point Clouds. [tensorflow] [
lidar
] [kitti
] - [IROS] Frustum ConvNet: Sliding Frustums to Aggregate Local Point-Wise Features for Amodal 3D Object Detection. [pytorch] [
lidar+image
] [kitti
] - [IROS] Improving 3D object detection for pedestrians with virtual multi-view synthesis orientation estimation. [
lidar
] [kitti
] - [3DV] IoU Loss for 2D/3D Object Detection. [
lidar
] [kitti
] - [arXiv] Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud. [
monocular
][kitti
] - [arXiv] FVNet: 3D Front-View Proposal Generation for Real-Time Object Detection from Point Clouds. [code] [
lidar
] [kitti
] - [CVPRW] Complexer-YOLO: Real-Time 3D Object Detection and Tracking on Semantic Point Clouds. [pytorch] [
monocular
][kitti
]:fire: - [CVPR] Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction. [pytorch] [
monocular
][kitti
] - [CVPR] GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving. [
monocular
][kitti
] - [CVPR] ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape. [
monocular
][kitti
] - [CVPR] Triangulation Learning Network: from Monocular to Stereo 3D Object Detection. [pytorch] [
stereo
][kitti
] - [CoRR] 3D Backbone Network for 3D Object Detection. [code] [
lidar
] [kitti
] - [arXiv] nuScenes: A multimodal dataset for autonomous driving. [link] [
dataset
] - [arXiv] Deformable Filter Convolution for Point Cloud Reasoning.[
lidar
] [kitti
][ATG4D
] - [arXiv] PointRGCN: Graph Convolution Networks for 3D Vehicles Detection Refinement.[
lidar
] [kitti
][ATG4D
]
- [TPAMI] Part-A^2 Net: 3D Part-Aware and Aggregation Neural Network for Object Detection from Point Cloud. [pytorch][
lidar
] [kitti
] - [AAAI] TANet: Robust 3D Object Detection from Point Clouds with Triple Attention. [code] [
lidar
] [kitti
] - [AAAI] PI-RCNN: An Efficient Multi-sensor 3D Object Detector with Point-based Attentive Cont-conv Fusion Module. [
lidar+image
] [kitti
] - [AAAI] ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection. [code] [
stereo
] [kitti
] - [AAAI] Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation. [
monocular
] [kitti
] - [CVPR] PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection. [pytorch] [
lidar
] [kitti
] [waymo
]:fire: :star: :fire: :star: - [CVPR] Structure Aware Single-stage 3D Object Detection from Point Cloud. [pytorch] [
lidar
] [kitti
] 🔥 ⭐ - [CVPR]3DSSD: Point-based 3D Single Stage Object Detector. [TensorFlow] [
lidar
] [kitti
][nusc
] 🔥 ⭐ - [CVPR]Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud. [TensorFlow] [
lidar
] [kitti
] 🔥 ⭐ - [CVPR]Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection. [
lidar
] [kitti
] - [CVPR]PnPNet: End-to-End Perception and Prediction with Tracking in the Loop. [
lidar
] - [CVPR] Train in Germany, Test in The USA: Making 3D Object Detectors Generalize.[code] [
lidar
] - [CVPR] PointPainting: Sequential Fusion for 3D Object Detection. [
lidar+image
] [kitti
] [nusc
] - [CVPR] DSGN: Deep Stereo Geometry Network for 3D Object Detection. [
monocular
] [kitti
] - [CVPR] Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation.[code] [
stereo
] [kitti
] - [CVPR] Learning Depth-Guided Convolutions for Monocular 3D Object Detection.[code] [
monocular
] [kitti
] - [CVPR] MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships. [
monocular
] [kitti
] - [CVPR] LiDAR-based Online 3D Video Object Detection with Graph-based Message Passing and Spatiotemporal Transformer Attention. [
lidar_video
] [nusc
] - [CVPR] Physically Realizable Adversarial Examples for LiDAR Object Detection. [
lidar
] - [CVPR]HVNet: Hybrid Voxel Network for LiDAR Based 3D Object Detection. [
lidar
] [kitti
] - [CVPR]Dops: Learning to detect 3d objects and predict their 3d shapes. [
lidar_video
] [waymo
] - [CVPR]Learning to Evaluate Perception Models Using Planner-Centric Metrics. [
lidar
] - [CVPR]What You See is What You Get: Exploiting Visibility for 3D Object Detection. [
lidar
] [nusc
] - [CVPR]MotionNet: Joint Perception and Motion Prediction for Autonomous Driving Based on Bird’s Eye View Maps. [
lidar
] - [ECCVW] Deformable PV-RCNN: Improving 3D Object Detection with Learned Deformations.[code][
lidar
] [kitti
] - [ECCV] object as hotspots.[
lidar
] [kitti
] - [ECCV] EPNet: Enhancing Point Features with Image Semantics for 3D Object Detection.[
lidar+image
] [kitti
] - [ECCV] 3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection.[
lidar+image
] [kitti
] - [ECCV] Kinematic 3D Object Detection in Monocular Video.[code][
monocular_video
] [kitti
] - [ECCV] Rethinking Pseudo-LiDAR Representation.[code][
monocular
] [kitti
] - [ECCV] An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds.[
lidar
] [waymo
] - [ECCV] Pillar-based Object Detection for Autonomous Driving.[
lidar
] [waymo
] - [ECCV] Active Perception using Light Curtains for Autonomous Driving.[code][
lidar
] - [ECCV] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution.[
lidar
] - [ECCV] Improving 3D Object Detection through Progressive Population Based Augmentation.[
lidar
] [kitti
] - [IROS] MVLidarNet: Real-Time Multi-Class Scene Understanding for Autonomous Driving Using Multiple Views.[
lidar
] [nusc
] - [ACMMM] Weakly Supervised 3D Object Detection from Point Clouds.[
lidar
] - [BMVC] RV-FuseNet: Range View based Fusion of Time-Series LiDAR Data for Joint 3D Object Detection and Motion Forecasting [
lidar
][nusc
] - [Sensors] 3D-GIoU: 3D Generalized Intersection over Union for Object Detection in Point Cloud [
lidar
][kitti
] - [arxiv] 3D IoU-Net: IoU Guided 3D Object Detector for Point Clouds [
lidar
][kitti
] - [arxiv] Center-based 3D Object Detection and Tracking [code][
lidar
][nusc
] - [arxiv] Boundary-Aware Dense Feature Indicator for Single-Stage 3D Object Detection from Point Clouds [
lidar
][nusc
] - [arxiv] InfoFocus: 3D Object Detection for Autonomous Driving with Dynamic Information Modeling [
lidar
][nusc
] - [arxiv] Quantifying Data Augmentation for LiDAR based 3D Object Detection [
lidar
][kitti
] - [arxiv] Range Conditioned Dilated Convolutions for Scale Invariant 3D Object Detection [
lidar
][kitti
][nusc
] - [arxiv] Real-time 3D object proposal generation and classification under limited processing resources [
lidar
][kitti
] - [arxiv] Safety-Aware Hardening of 3D Object Detection Neural Network Systems [
lidar
][kitti
] - [arxiv] Stereo RGB and Deeper LIDAR Based Network for 3D Object Detection[
stereo
][kitti
] - [arxiv] SSN: Shape Signature Networks for Multi-class Object Detection from Point Clouds [code][
lidar
][kitti
] - [arxiv] SVGA-Net: Sparse Voxel-Graph Attention Network for 3D Object Detection from Point Clouds
[
lidar
][kitti
] - [arxiv] GhostBuster: Looking Into Shadows to Detect Ghost Objects in Autonomous Vehicle 3D Sensing [
lidar
][kitti
] - [arxiv] Cross-Modality 3D Object Detection [
lidar
][kitti
] - [arxiv] Towards Autonomous Driving: a Multi-Modal 360∘ Perception Proposal[
lidar
][kitti
] - [arxiv] RangeRCNN: Towards Fast and Accurate 3D Object Detection with Range Image Representation[
lidar
][kitti
] - [arxiv] CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection[
lidar+image
][kitti
] - [arxiv]Reinforced Axial Refinement Network for Monocular 3D Object Detection[
monocular
][kitti
] - [arxiv]PV-RCNN: The Top-Performing LiDAR-only Solutions for 3D Detection / 3D Tracking / Domain Adaptation of Waymo Open Dataset Challenges[
lidar
][kitti
][waymo
] - [arxiv]GhostBuster: Looking Into Shadows to Detect Ghost Objects in Autonomous Vehicle 3D Sensing [
lidar
][kitti
] - [arxiv]CenterNet3D:An Anchor free Object Detector for Autonomous Driving [
lidar
][kitti
] - [arxiv] Part-Aware Data Augmentation for 3D Object Detection in Point Cloud. [
lidar
][kitti
] - [arxiv] 1st Place Solution for Waymo Open Dataset Challenge - 3D Detection and Domain Adaptation. [
lidar
][waymo
] - [arxiv] RoIFusion: 3D Object Detection from LiDAR and Vision. [
lidar+image
][kitti
] - [arxiv] A Density-Aware PointRCNN for 3D Objection Detection in Point Clouds. [
lidar
][kitti
] - [arxiv] Radar-Camera Sensor Fusion for Joint Object Detection and Distance Estimation in Autonomous Vehicles
. [
lidar+image
][nusc
] - [arxiv] 3D Object Detection and Tracking Based on Streaming Data. [
lidar
][kitti
]['det_and_tracking'] - [arxiv] Uncertainty-Aware Voxel based 3D Object Detection and Tracking with von-Mises Loss. [
lidar
][kitti
]['det_and_tracking'] - [arxiv] Faraway-Frustum: Dealing with Lidar Sparsity for 3D Object Detection using Fusion. [
lidar
][kitti
] - [arxiv] Multi-View Adaptive Fusion Network for 3D Object Detection. [
lidar
][kitti
] - [arxiv] Stereo Frustums: A Siamese Pipeline for 3D Object Detection. [
stereo
][kitti
] - [arxiv]MLOD: Awareness of Extrinsic Perturbation in Multi-LiDAR 3D Object Detection for Autonomous Driving. [
lidar
] - [arxiv]SF-UDA3D: Source-Free Unsupervised Domain Adaptation for LiDAR-Based 3D Object Detection. [
lidar
][kitti
][nusc
] - [arxiv]Learning Monocular 3D Vehicle Detection without 3D Bounding Box Labels. [
mono
][kitti
] - [arxiv]SemanticVoxels: Sequential Fusion for 3D Pedestrian Detection using LiDAR Point Cloud and Semantic Segmentation. [
lidar
][kitti
] - [arxiv]CoFF: Cooperative Spatial Feature Fusion for 3D Object Detection on Autonomous Vehicles. [
lidar
][kitti
] - [arxiv]Multi-Frame to Single-Frame: Knowledge Distillation for 3D Object Detection. [
lidar
][waymo
] - [arxiv] MAFF-Net: Filter False Positive for 3D Vehicle Detection with Multi-modal Adaptive Feature Fusion. [
lidar
][kitti
] - [arxiv] CenterFusion: Center-based Radar and Camera Fusion for 3D Object Detection. [
radar+image
][nusc
] - [arxiv] Uncertainty-Aware Voxel based 3D Object Detection and Tracking with von-Mises Loss. [
lidar
][kitti
] - [arxiv] A Neural Network for Fast and Efficient Edge Detection in 3D Point Clouds. [
lidar
][kitti
] - [arxiv] CIA-SSD: Confident IoU-Aware Single-Stage Object Detector From Point Cloud. [
lidar
][kitti
] - [arxiv] It’s All Around You: Range-Guided Cylindrical Network for 3D Object Detection. [
lidar
][kitti
]
- [TPAMI] Deep Learning for 3D Point Clouds: A Survey[
lidar
] - [arxiv] 3D Point Cloud Processing and Learning for Autonomous Driving[
lidar
] - [arxiv] Deep Learning for 3D Point Cloud Understanding: A Survey[
lidar
] - [arxiv] An Overview Of 3D Object Detection[
lidar
]
- [lidar_only] second.pytorch [
kitti
][nusc
] - [lidar_only] Det3D [
kitti
][nusc
][lyft
][waymo
] - [lidar_only] OpenPCDet[
kitti
][nusc
][waymo
] - [lidar_image] mmdetection3d[
kitti
][nusc
][lyft
][waymo
]