- Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D [paper] [Github]
- FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras [paper] [Github]
- Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection [paper]
- BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers [paper] [Github]
- PETR: Position Embedding Transformation for Multi-View 3D Object Detection [paper][Github]
- ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning [paper][Github]
- BEVDet: High-Performance Multi-Camera 3D Object Detection in Bird-Eye-View [paper] [Github]
- BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection [paper]
- PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images [paper][Github]
- M2BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation [paper]
- BEVSegFormer: Bird’s Eye View Semantic Segmentation From Arbitrary Camera Rigs [paper]
- BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous Driving [paper] [Github]
- PolarDETR: Polar Parametrization for Vision-based Surround-View 3D Detection[paper] [Github]
- BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection [paper][Github]
- FUTR3D: A Unified Sensor Fusion Framework for 3D Detection [paper] [Github]
- BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework [paper] [Github]
- Unifying Voxel-based Representation with Transformer for 3D Object Detection [paper] [Github]
- BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation [paper] [Github]