Fabulous papers for CV field.
- 1. Summary of Conference Papers
- 2. Papers of Some Fields
- 2.1. Common Vision Backbone
- 2.2. Object Detection
- 2.3. Image Segmentation
- 2.4. Data Augmentation
- 2.5. Image Enhancement
- 2.6. Image Composition
- CVPR2023 -> Paper List
- (ICLR 2021) An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
- paper: ViT
- (ICCV 2021) Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
- paper: Swin Transformer
- (CVPR 2022) Swin Transformer V2: Scaling Up Capacity and Resolution
- paper: Swin Transformer v2
- (ICCV 2023) FLatten Transformer: Vision Transformer using Focused Linear Attention
- paper: Flatten Transformer
- (ICCV 2023) SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
- paper: SwiftFormer
- (CVPR 2015) Going Deeper with Convolutions
- paper: GoogLeNet
- (CVPR 2016) Deep Residual Learning for Image Recognition
- paper: ResNet
- (CVPR 2018) MobileNetV2: Inverted Residuals and Linear Bottlenecks
- paper: MobileNetv2
- (ECCV 2018) ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design
- paper: ShuffleNet v2
- (CVPR 2023) SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy
- paper: SCConv
- (ICCV 2023) RepViT: Revisiting Mobile CNN From ViT Perspective
- paper: RepViT
- code: https://github.com/THU-MIG/RepViT/
- (CVPR 2015 best) Fully Convolutional Networks for Semantic Segmentation
- (MICCAI 2015) U-Net: Convolutional Networks for Biomedical Image Segmentation
- (ICCV 2017) Mask R-CNN
- (CVPR 2019) Panoptic FPN:Panoptic Feature Pyramid Networks
- (CVPR 2021) Panoptic FCN:Panoptic Fully Convolutional Networks
- (ICCV 2023) Segment Anything
- (Arxiv 2023) Fast Segment Anything
- (Arxiv 2023) Advanced Data Augmentation Approaches: A Comprehensive Survey and Future directions
-
(CVPR 2019) AutoAugment: Learning Augmentation Policies from Data
-
(CVPRW 2020) Randaugment: Practical automated data augmentation with a reduced search space
-
(Arxiv 2017) Improved Regularization of Convolutional Neural Networks with Cutout
-
(AAAI 2020) Random Erasing Data Augmentation
-
(ICCV 2017) Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-supervised Object and Action Localization
-
(Arxiv 2020) GridMask Data Augmentation
-
(ICLR 2018) Mixup: Beyond Empirical Risk Minimization
-
(ICCV 2019) CutMix: Regularization Strategy to Train Strong Classififiers with Localizable Features
- (ICCV 2023) Iterative Prompt Learning for Unsupervised Backlit Image Enhancement
- paper: CLIP-LIT
- code: https://github.com/ZhexinLiang/CLIP-LIT
- (ICCV 2023) Empowering Low-Light Image Enhancer through Customized Learnable Priors
- paper: CUE
- code: https://github.com/zheng980629/CUE
- Double Domain Guided Real-Time Low-Light Image Enhancement for Ultra-High-Definition Transportation Surveillance
- paper: DDNet
- code: https://github.com/QuJX/DDNet
- Dimma: Semi-supervised Low Light Image Enhancement with Adaptive Dimming
- paper: Dimma
- code: https://github.com/WojciechKoz/Dimma
- (ICCV 2023) MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing
- (ICCV 2023) Lighting Every Darkness in Two Pairs: A Calibration-Free Pipeline for RAW Denosing
- paper: LED
- code: https://github.com/Srameo/LED
- (ICCV 2023) LightGlue: Local Feature Matching at Light Speed
- paper: LightGlue
- code: https://github.com/cvg/LightGlue
- (IPOL 2021) An Analysis and Implementation of the HDR+ Burst Denoising Method
- (Arxiv 2021) Making Images Real Again: A Comprehensive Survey on Deep Image Composition