作者:Tom Hardy
公众号:3D视觉工坊
公众号运营者和嘉宾介绍:运营者来自国内一线大厂的算法工程师,深研3D视觉、vSLAM、计算机视觉、点云处理、深度学习、自动驾驶、图像处理、三维重建等领域,特邀嘉宾包括国内外知名高校的博士硕士,旷视、商汤、百度、阿里等就职的算法大佬,欢迎一起交流学习!
主要针对3D object相关算法进行了汇总,分为基于RGB图像、RGB-D数据、立体视觉、点云、融合等方式,欢迎补充~
一、基于点云的三维目标检测算法
- End-to-End Multi-View Fusion for 3D Object Detection in LiDAR Point Clouds
- Vehicle Detection from 3D Lidar Using Fully Convolutional Network(百度早期工作)
- VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection
- Object Detection and Classification in Occupancy Grid Maps using Deep Convolutional Networks
- RT3D: Real-Time 3-D Vehicle Detection in LiDAR Point Cloud for Autonomous Driving
- BirdNet: a 3D Object Detection Framework from LiDAR information
- LMNet: Real-time Multiclass Object Detection on CPU using 3D LiDAR
- HDNET: Exploit HD Maps for 3D Object Detection
- PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
- PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space
- IPOD: Intensive Point-based Object Detector for Point Cloud
- PIXOR: Real-time 3D Object Detection from Point Clouds
- DepthCN: Vehicle Detection Using 3D-LIDAR and ConvNet
- Voxel-FPN: multi-scale voxel feature aggregation in 3D object detection from point clouds
- STD: Sparse-to-Dense 3D Object Detector for Point Cloud
- Fast Point R-CNN
- StarNet: Targeted Computation for Object Detection in Point Clouds
- Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection
- LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving
- FVNet: 3D Front-View Proposal Generation for Real-Time Object Detection from Point Clouds
- Part-A^2 Net: 3D Part-Aware and Aggregation Neural Network for Object Detection from Point Cloud
- PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud
- Complex-YOLO: Real-time 3D Object Detection on Point Clouds
- YOLO4D: A ST Approach for RT Multi-object Detection and Classification from LiDAR Point Clouds
- YOLO3D: End-to-end real-time 3D Oriented Object Bounding Box Detection from LiDAR Point Cloud
- Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud
- Structure Aware Single-stage 3D Object Detection from Point Cloud(CVPR2020) 源代码
- MLCVNet: Multi-Level Context VoteNet for 3D Object Detection(CVPR2020) 源代码
- 3DSSD: Point-based 3D Single Stage Object Detector(CVPR2020) 源代码
- LiDAR-based Online 3D Video Object Detection with Graph-based Message Passing and Spatiotemporal Transformer Attention(CVPR2020) 源代码
- PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection(CVPR2020) 源代码
- Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud(CVPR2020) 源代码
- MLCVNet: Multi-Level Context VoteNet for 3D Object Detection(CVPR2020)
- Density Based Clustering for 3D Object Detection in Point Clouds(CVPR2020)
- What You See is What You Get: Exploiting Visibility for 3D Object Detection(CVPR2020)
- PointPainting: Sequential Fusion for 3D Object Detection(CVPR2020)
- HVNet: Hybrid Voxel Network for LiDAR Based 3D Object Detection(CVPR2020)
- LiDAR R-CNN: An Efficient and Universal 3D Object Detector(CVPR2021)
- Center-based 3D Object Detection and Tracking(CVPR2021)
- 3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection(CVPR2021)
二、基于单目的三维目标检测算法
- Task-Aware Monocular Depth Estimation for 3D Object Detection
- M3D-RPN: Monocular 3D Region Proposal Network for Object Detection
- Monocular 3D Object Detection and Box Fitting Trained End-to-End Using Intersection-over-Union Loss
- Disentangling Monocular 3D Object Detection
- Shift R-CNN: Deep Monocular 3D Object Detection with Closed-Form Geometric Constraints
- Monocular 3D Object Detection via Geometric Reasoning on Keypoints
- Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction
- GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving
- Accurate Monocular Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving
- Task-Aware Monocular Depth Estimation for 3D Object Detection
- M3D-RPN: Monocular 3D Region Proposal Network for Object Detection
- Deconvolutional Networks for Point-Cloud Vehicle Detection and Tracking in Driving Scenarios
- Learning Depth-Guided Convolutions for Monocular 3D Object Detection(CVPR2020)
- End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection(CVPR2020)
- GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection(CVPR2021)
- Delving into Localization Errors for Monocular 3D Object Detection(CVPR2021)
- M3DSSD: Monocular 3D Single Stage Object Detector(CVPR2021)
- MonoRUn: Monocular 3D Object Detection by Self-Supervised Reconstruction and Uncertainty Propagation(CVPR2021)
- Categorical Depth Distribution Network for Monocular 3D Object Detection(CVPR2021)
三、基于双目的三维目标检测算法
- Object-Centric Stereo Matching for 3D Object Detection
- Triangulation Learning Network: from Monocular to Stereo 3D Object Detection
- Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving
- Stereo R-CNN based 3D Object Detection for Autonomous Driving
- IDA-3D: Instance-Depth-Aware 3D Object Detection from Stereo Vision for Autonomous Driving(CVPR2020) 源代码
- Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation(CVPR2020) 源代码
- DSGN: Deep Stereo Geometry Network for 3D Object Detection(CVPR2020) 源代码
四、基于RGB-D的三维目标检测算法
- Frustum PointNets for 3D Object Detection from RGB-D Data
- Frustum VoxNet for 3D object detection from RGB-D or Depth images
五、基于Radar和RGB方式的三维目标检测算法
六、基于融合数据的三维目标检测算法
- MLOD: A multi-view 3D object detection based on robust feature fusion method
- Multi-Sensor 3D Object Box Refinement for Autonomous Driving
- Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving
- Improving 3D Object Detection for Pedestrians with Virtual Multi-View Synthesis Orientation Estimation
- Class-specific Anchoring Proposal for 3D Object Recognition in LIDAR and RGB Images
- MVX-Net: Multimodal VoxelNet for 3D Object Detection
- Sensor Fusion for Joint 3D Object Detection and Semantic Segmentation
- 3D Object Detection Using Scale Invariant and Feature Reweighting Networks
- End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection(CVPR2020) 源代码