CVPR-2022

CVPR-2022-Papers

官网链接：https://cvpr2022.thecvf.com/

开会时间：2022年6月19日-6月24日

❣❣❣近日，CVPR 2022 接收论文公布！总计2067篇！，部分预印版论文也陆续发布中，本文档持续收录更新!!

❗❗❗ 4月29日更新 6 篇。

对比学习
- Use All The Labels: A Hierarchical Multi-Label Contrastive Learning Framework
  ⭐code
动作识别
- Hybrid Relation Guided Set Matching for Few-shot Action Recognition
  ⭐code📰解读
形状匹配
- Deep Orientation-Aware Functional Maps: Tackling Symmetry Issues in Shape Matching
  ⭐code
目标检测
- Rotationally Equivariant 3D Object Detection
  🏠project
- Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection
  ⭐code📰解读
视图合成
- NeurMiPs: Neural Mixture of Planar Experts for View Synthesis
  ⭐code🏠project📺video📰解读

❗❗❗ 4月28日更新 10 篇。

6D
- Coupled Iterative Refinement for 6D Multi-Object Pose Estimation
  ⭐code📰解读
草图识别
- Leveraging Unlabeled Data for Sketch-based Understanding
点云
- Density-preserving Deep Point Cloud Compression
  ⭐code🏠project📰解读
形状匹配
- A Scalable Combinatorial Solver for Elastic Geometrically Consistent 3D Shape Matching
  ⭐code
知识蒸馏
- DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers
手物重建
- Collaborative Learning for Hand and Object Reconstruction with Attention-guided Graph Convolution
换头
- Few-Shot Head Swapping in the Wild
  😮oral⭐code🏠project📺video📰解读
分割
- Self-Supervised Learning of Object Parts for Semantic Segmentation
- MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation
  🏠project
其他
- Balanced MSE for Imbalanced Visual Regression
  😮oral⭐code
  📰CVPR 2022 (Oral) | 回归标签不平衡? 试试Balanced MSE

🐱	🐶	🐯	🐺
1.其它	2.Image Segmentation(图像分割)	3.Image Progress(图像处理)	4.Image Captioning(图像字幕)
5.Object Detection(目标检测)	6.Object Tracking(目标跟踪)	7.Point Cloud(点云)	8.Action Detection(人体动作检测与识别)
9.Human Pose Estimation(人体姿态估计)	10.3D(三维视觉)	11.Face	12.Image-to-Image Translation(图像到图像翻译)
13.GAN	14.Video	15.Transformer	16.Semi/self-supervised learning(半/自监督)
17.Medical Image(医学影像)	18.Person Re-Identification(人员重识别)	19.Neural Architecture Search(神经架构搜索)	20.Autonomous vehicles(自动驾驶)
21.UAV/Remote Sensing/Satellite Image(无人机/遥感/卫星图像)	22.Image Synthesis/Generation(图像合成)	23.Image Retrieval(图像检索)	24.Super-Resolution(超分辨率)
25.Fine-Grained/Image Classification(细粒度/图像分类)	26.GCN/GNN	27.Pose Estimation(物体姿势估计)	28.Style Transfer(风格迁移)
29.Augmented Reality/Virtual Reality/Robotics(增强/虚拟现实/机器人)	30.Visual Answer Questions(视觉问答)	31.Vision-Language(视觉语言)	32.Data Augmentation(数据增强)
33.Human-Object Interaction(人物交互)	34.Model Compression/Knowledge Distillation/Pruning(模型压缩/知识蒸馏/剪枝)	35.OCR	36.Optical Flow(光流估计)
37.Contrastive Learning(对比学习)	38.Meta-Learning(元学习)	39.Continual Learning(持续学习)	40.Adversarial Learning(对抗学习)

动画

图像动画

Thin-Plate Spline Motion Model for Image Animation
人物动画
- Structured Local Radiance Fields for Human Avatar Modeling
3D character animation(三维角色动画)
- 皮肤预测
  - SkinningNet: Two-Stream Graph Convolutional Neural Network for Skinning Prediction of Synthetic Characters
    🏠project
3D 舞蹈生成
- Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory

Neural rendering(神经渲染)

Gaze Estimation(视线估计)

GazeOnce: Real-Time Multi-Person Gaze Estimation

Sound

声源定位
- Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes
  ⭐code

Visual Emotion Analysis(视觉情感分析)

MDAN: Multi-level Dependent Attention Network for Visual Emotion Analysis

Novel View Synthesis(视图合成)

Dataset(数据集)

Sign Language Translation(手语翻译)

A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation

Human Motion Forecasting(人体运动预测)

光学、几何、光场成像

Light Field(光场)
- Occlusion-Aware Cost Constructor for Light Field Depth Estimation
  ⭐code📰粗解
深度重建
- Deep Hyperspectral-Depth Reconstruction Using Single Color-Dot Projection
  ⭐code🏠project📺video

Anomaly Detection(异常检测)

Catching Both Gray and Black Swans: Open-set Supervised Anomaly Detection
⭐code

Image Geo-localization(图像地理定位)

TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization
⭐code
视觉地理定位
- Rethinking Visual Geo-localization for Large-Scale Applications
  ⭐code
- Deep Visual Geo-localization Benchmark
  😮oral🏠project
轨迹重建
- MonoTrack: Shuttle trajectory reconstruction from monocular badminton video

Visual Grounding

Multi-View Transformer for 3D Visual Grounding
⭐code

Few/Zero-Shot Learning/Domain Generalization/Adaptation(小/零样本/域泛化/适应)

小样本
- Few-shot Learning with Noisy Labels
- Pushing the Limits of Simple Pipelines for Few-Shot Learning: External Data and Fine-Tuning Make a Difference
  🏠project📺video
零样本
- MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning
  ⭐code📰粗解
- Unseen Classes at a Later Time? No Problem
  ⭐code
域泛化
- Compound Domain Generalization via Meta-Knowledge Encoding
- Causality Inspired Representation Learning for Domain Generalization
- Towards Unsupervised Domain Generalization
  本次任务的主要目标是域泛化（domain generalization(DG)），是首篇将DG推广到unsupervised learning 领域的，并提出一个新的研究领域 unsupervised domain generalization(UDG)。
- 域外泛化
  - The Two Dimensions of Worst-case Training and the Integrated Effect for Out-of-domain Generalization
域适应
- Continual Test-Time Domain Adaptation
  ⭐code
- Safe Self-Refinement for Transformer-based Domain Adaptation
  ⭐code📰解读
- 无监督域适应
  - Reusing the Task-specific Classifier as a Discriminator: Discriminator-free Adversarial Domain Adaptation
    ⭐code

Dense Prediction(密集预测)

Does Robustness on ImageNet Transfer to Downstream Tasks?

Federated Learning(联邦学习)

Multi-Task Learning（多任务学习）

Incremental Learning（增量学习）

40.Adversarial Learning(对抗学习)

39.Continual Learning(持续学习)

38.Meta-Learning(元学习)

37.Contrastive Learning(对比学习)

36.Optical Flow(光流估计)

35.OCR

场景文本检测
- Towards End-to-End Unified Scene Text Detection and Layout Analysis
  ⭐code
- Pushing the Performance Limit of Scene Text Recognizer without Human Annotation
Text Spotting
- Text Spotting Transformers
  ⭐code📰粗解
LOGO设计
- Aesthetic Text Logo Synthesis via Content-aware Layout Inferring
  ⭐code
字体生成
- XMP-Font: Self-Supervised Cross-Modality Pre-training for Few-Shot Font Generation
文本识别
- Open-set Text Recognition via Character-Context Decoupling

34.Model Compression/Knowledge Distillation/Pruning(模型压缩/知识蒸馏/剪枝)

33.Human-Object Interaction(人物交互)

32.Data Augmentation(数据增强)

31.Vision-Language(视觉语言)

30.Visual Answer Questions(视觉问答)

29.Augmented Reality/Virtual Reality/Robotics(增强/虚拟现实/机器人)

目标导航
- Online Learning of Reusable Abstract Models for Object Goal Navigation
try-on
- Dressing in the Wild by Watching Dance Videos
  🏠project
- Style-Based Global Appearance Flow for Virtual Try-On
  ⭐code
- ClothFormer:Taming Video Virtual Try-on in All Module
  😮oral⭐code🏠project📰解读

28.Style Transfer(风格迁移)

Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
⭐code
Industrial Style Transfer with Large-scale Geometric Warping and Content Preservation
⭐code
运动风格迁移
- Style-ERD: Responsive and Coherent Online Motion Style Transfer
运动迁移
- Structure-Aware Motion Transfer with Deformable Anchor Model
  ⭐code📰解读

27.Pose Estimation(物体姿势估计)

26.GCN/GNN

25.Fine-Grained/Image Classification(细粒度/图像分类)

细粒度分类
- Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal Information
  ⭐code📰粗解
图像分类
- DTFD-MIL: Double-Tier Feature Distillation Multiple Instance Learning for Histopathology Whole Slide Image Classification
  ⭐code
- Contrastive Test-Time Adaptation
  🏠project
小样本分类
- CAD: Co-Adapting Discriminative Features for Improved Few-Shot Classification
- Matching Feature Sets for Few-Shot Image Classification
  ⭐code🏠project📺video
- Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification
  😮oral⭐code🏠project📰解读
- 小样本分类与分割(FS-CS)
  - Integrative Few-Shot Learning for Classification and Segmentation
长尾识别
- Nested Collaborative Learning for Long-Tailed Visual Recognition
- Long-Tailed Recognition via Weight Balancing
  ⭐code
细粒度识别
- Knowledge Mining with Scene Text for Fine-Grained Recognition
  ⭐code

24.Super-Resolution(超分辨率)

Learning Graph Regularisation for Guided Super-Resolution

23.Image Retrieval(图像检索)

22.Image Synthesis/Generation(图像合成)

Interactive Image Synthesis with Panoptic Layout Generation
Autoregressive Image Generation using Residual Quantization
⭐code📰粗解
GIRAFFE HD: A High-Resolution 3D-aware Generative Model
Arbitrary-Scale Image Synthesis
⭐code📰粗解
Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis
⭐code📰解读
文本引导的图像处理
- ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation
  😮oral🏠project
姿势引导的图像合成
- Exploring Dual-task Correlation for Pose Guided Person Image Generation
  ⭐code📰粗解
文本到图像合成
- StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis
图像翻译
- FlexIT: Towards Flexible Semantic Image Translation
- A Style-aware Discriminator for Controllable Image Translation
图像生成
- Marginal Contrastive Correspondence for Guided Image Generation
  😮oral

21.UAV/Remote Sensing/Satellite Image(无人机/遥感/卫星图像)

遥感图像融合
- HyperTransformer: A Textural and Spectral Feature Fusion Transformer for Pansharpening
  ⭐code📰粗解
航空图像分割
- Revisiting Near/Remote Sensing with Geospatial Attention

20.Autonomous vehicles(自动驾驶)

自动驾驶
- Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data
- Exploiting Temporal Relations on Radar Perception for Autonomous Driving
车道线检测
车道线描述
- Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes
  ⭐code
行为预测
- 🐦️JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection
自动驾驶场景重新照明
- SIMBAR: Single Image-Based Scene Relighting For Effective Data Augmentation For Automated Driving Vision Tasks
  🏠project

19.Neural Architecture Search(神经架构搜索)

18.Person Re-Identification(人员重识别)

Reid
人群计数
- Leveraging Self-Supervision for Cross-Domain Crowd Counting
- Boosting Crowd Counting via Multifaceted Attention
  ⭐code
行人检测
- STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes
  ⭐code
步态识别
- Gait Recognition in the Wild with Dense 3D Representations and A Benchmark
  ⭐code🏠project
Person Search
- PSTR: End-to-End One-Step Person Search With Transformers
  ⭐code

17.Medical Image(医学影像)

Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations
BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation
DeepLIIF: An Online Platform for Quantification of Clinical Pathology Slides
DiRA: Discriminative, Restorative, and Adversarial Learning for Self-supervised Medical Image Analysis
⭐code📰解读
Surpassing the Human Accuracy: Detecting Gallbladder Cancer from USG Images with Curriculum Learning
⭐code🏠project
3D生物打印
- Generating 3D Bio-Printable Patches Using Wound Segmentation and Reconstruction to Treat Diabetic Foot Ulcers
  利用伤口分割和重建生成3D生物打印贴片来治疗糖尿病足溃疡
SR（ＭRI）
- Transformer-empowered Multi-scale Contextual Matching and Aggregation for Multi-contrast MRI Super-resolution
  ⭐code
医学图像配准
- Affine Medical Image Registration with Coarse-to-Fine Vision Transformer
  ⭐code

16.Semi/self-supervised learning(半/自监督)

15.Transformer

14.Video

Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models
😮oral
动作分割
- Unsupervised Activity Segmentation by Joint Representation Learning and Online Clustering
  📺video
- Weakly-Supervised Online Action Segmentation in Multi-View Instructional Videos
动作理解
- How Do You Do It? Fine-Grained Action Understanding with Pseudo-Adverbs
- Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
  ⭐code
Video Copy Detection(视频拷贝检测)
- A Large-scale Comprehensive Dataset and Copy-overlap Aware Evaluation Protocol for Segment-level Video Copy Detection
  ⭐code
视频合成
- Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
  ⭐code
视频异常检测
- Generative Cooperative Learning for Unsupervised Video Anomaly Detection
- Bayesian Nonparametric Submodular Video Partition for Robust Anomaly Detection
视频监控
- 轨迹预测
视频时刻检索和视频高光检测
- UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection
  ⭐code
- Learning Pixel-Level Distinctions for Video Highlight Detection
视频时刻检索
- AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval
视频预测
- STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction
- Continual Predictive Learning from Videos
  😮oral⭐code
视频个体计数
- DR.VIC: Decomposition and Reasoning for Video Individual Counting
  ⭐code
视频插值
- Many-to-many Splatting for Efficient Video Frame Interpolation
  ⭐code
- TimeReplayer: Unlocking the Potential of Event Cameras for Video Interpolation
- Long-term Video Frame Interpolation via Feature Propagation
- Time Lens++: Event-based Frame Interpolation with Parametric Non-linear Flow and Multi-scale Fusion
视觉对应（视频）
- Locality-Aware Inter-and Intra-Video Reconstruction for Self-Supervised Correspondence Learning
  ⭐code
视频分类
- 零样本视频分类
  - Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification
视频预测
- 手部动作预测
  - Joint Hand Motion and Interaction Hotspots Prediction from Egocentric Videos
    🏠project📺video
视频分割
- Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
  ⭐code
- 视频实例分割(VIS)
  - Efficient Video Instance Segmentation via Tracklet Query and Proposal
    🏠project📺video📰粗解
  - Temporally Efficient Vision Transformer for Video Instance Segmentation
    😮oral⭐code📰解读
- 视频语义分割
  - Coarse-to-Fine Feature Mining for Video Semantic Segmentation
    ⭐code
- 视频全景分割
  - Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation
    😮oral⭐code📰解读
视频影像处理
- 视频超分辨率
  - Reference-based Video Super-Resolution Using Multi-Camera Video Triplets
  - Learning Trajectory-Aware Transformer for Video Super-Resolution
    😮oral⭐code
  - Investigating Tradeoffs in Real-World Video Super-Resolution
    ⭐code📰解读
  - BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment
    ⭐code🏠project📺video
    🏆NTIRE 2021年视频修复和增强挑战赛冠军
  - Look Back and Forth: Video Super-Resolution with Explicit Temporal Difference Modeling
- 视频恢复
  - Neural Global Shutter: Learn to Restore Video from a Rolling Shutter Camera with Global Reset Feature
    ⭐code
- 视频修复
  - Towards An End-to-End Framework for Flow-Guided Video Inpainting
- 视频去摩尔纹
  - Video Demoireing with Relation-Based Temporal Consistency
    🏠project📺video
- 视频去模糊
  - Multi-Scale Memory-Based Video Deblurring
- 视频去噪
  - Dancing under the stars: video denoising in starlight
    ⭐code
- 电影修复
  - Bringing Old Films Back to Life
    ⭐code
自监督视频表征学习
- Hierarchical Self-supervised Representation Learning for Movie Understanding
  ⭐code🏠project
- Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
- 视频对比学习
  - Probabilistic Representations for Video Contrastive Learning
视频分解
- Deformable Sprites for Unsupervised Video Decomposition
  😮oral🏠project
视频阴影检测
- Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training
  ⭐code

13.GAN

12.Image-to-Image Translation(图像到图像翻译)

11.Face(人脸)

Protecting Celebrities with Identity Consistency Transformer
Deepfake
- Voice-Face Homogeneity Tells Deepfake
  ⭐code📰粗解
妆容迁移
- Protecting Facial Privacy: Generating Adversarial Identity Masks via Style-robust Makeup Transfer
人脸识别
人脸表情识别
- Towards Semi-Supervised Deep Facial Expression Recognition with An Adaptive Confidence Margin
  ⭐code
3D人脸
- ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations
活体检测
- PatchNet: A Simple Face Anti-Spoofing Framework via Fine-Grained Patch Recognition
假脸检测
- Exploring Frequency Adversarial Attacks for Face Forgery Detection
人脸交换
- High-resolution Face Swapping via Latent Semantics Disentanglement
  ⭐code
人脸属性分类
- Fair Contrastive Learning for Facial Attribute Classification
  ⭐code
Face Relighting(人脸重照光)
- Face Relighting with Geometrically Consistent Shadows
人脸编辑
- TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing
  ⭐code🏠project
人脸幻构
- Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination
Deepfake检测
- Detecting Deepfakes with Self-Blended Images
  😮oral⭐code
人脸重建
- JIFF: Jointly-aligned Implicit Face Function for High Quality Single View Clothed Human Reconstruction
  ⭐code🏠project📰解读
人脸捕捉
- EMOCA: Emotion Driven Monocular Face Capture and Animation
  🏠project
换头
- Few-Shot Head Swapping in the Wild
  😮oral⭐code🏠project📺video📰解读

10.3D(三维视觉)

9.Human Pose Estimation(人体姿态估计)

COAP: Compositional Articulated Occupancy of People
⭐code🏠project📺video📰解读
Context-Aware Sequence Alignment using 4D Skeletal Augmentation
😮oral⭐code🏠project
基于视频的HPE
- Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation
  ::oral:star:code
3D pose
4D 人体捕获
- H4D: Human 4D Modeling by Learning Neural Compositional Representation
手势生成
- Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
3D手网格估计
- HandOccNet: Occlusion-Robust 3D Hand Mesh Estimation Network
3D形状生成
- Towards Implicit Text-Guided 3D Shape Generation
- 3D狗的形状
  - BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information
    🏠project
运动捕捉
- Neural MoCon: Neural Motion Control for Physically Plausible Human Motion Capture
  🏠project
手臂-手部动态估计
- Spatial-Temporal Parallel Transformer for Arm-Hand Dynamic Estimation
3D手重建
- LISA: Learning Implicit Shape and Appearance of Hands
  🏠project
3D人体形状
- OSSO: Obtaining Skeletal Shape from Outside<>:star:code🏠project📺video📰解读

8.Action Detection(人体动作检测与识别)

动作检测
时序动作定位
重复动作计数
- TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting
  😮oral⭐code🏠project
组动作识别
- Dual-AI: Dual-path Action Interaction Learning for Group Activity Recognition
  😮oral
- Detector-Free Weakly Supervised Group Activity Recognition
动作质量评估
- FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment
  😮oral⭐code🏠project📰解读

7.Point Cloud(点云)

Shape-invariant 3D Adversarial Point Clouds
⭐code
AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception
REGTR: End-to-end Point Cloud Correspondences with Transformers
⭐code
Equivariant Point Cloud Analysis via Learning Orientations for Message Passing
⭐code
Text2Pos: Text-to-Point-Cloud Cross-Modal Localization
Deformation and Correspondence Aware Unsupervised Synthetic-to-Real Scene Flow Estimation for Point Clouds
⭐code
Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation
⭐code📰解读
3DeformRS: Certifying Spatial Deformations on Point Clouds
⭐code
Reconstructing Surfaces for Sparse Point Clouds with On-Surface Priors
⭐code📰解读
Density-preserving Deep Point Cloud Compression
⭐code🏠project📰解读
3D 点云
- CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding
  ⭐code📰粗解
  CrossPoint，一个用于 3D 点云表征学习的简单自监督学习框架。虽然该方法是在合成的三维物体数据集上训练的，但在下游任务中的实验结果，如三维物体分类和三维物体部分分割，在合成和真实世界的数据集中都证明了该方法在学习可迁移表征方面的有效性。
- A Unified Query-based Paradigm for Point Cloud Understanding
- WarpingGAN: Warping Multiple Uniform Priors for Adversarial 3D Point Cloud Generation
  ⭐code
- 3D点云分割
  - Stratified Transformer for 3D Point Cloud Segmentation
    ⭐code
点云分类
- ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation
  ⭐code📰粗解
点云配准
- SC^2-PCR: A Second Order Spatial Compatibility for Efficient and Robust Point Cloud Registration
  ⭐code
点云补全
- Learning a Structured Latent Space for Unsupervised Point Cloud Completion
- Learning Local Displacements for Point Cloud Completion

6.Object Tracking(目标跟踪)

TCTrack: Temporal Contexts for Aerial Tracking
⭐code📰粗解
Correlation-Aware Deep Tracking
Global Tracking Transformers
⭐code
Unified Transformer Tracker for Object Tracking
⭐code
Global Tracking via Ensemble of Local Trackers
Unsupervised Learning of Accurate Siamese Tracking
⭐code
3D 目标跟踪
- Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds
  ⭐code📰粗解
多目标跟踪
- Learning of Global Objective for Network Flow in Multi-Object Tracking
- MeMOT: Multi-Object Tracking with Memory
  😮oral
RGB-T跟踪
- Visible-Thermal UAV Tracking: A Large-Scale Benchmark and New Baseline
  🏠project📰解读

5.Object Detection(目标检测)

DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
⭐code📰粗解
Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation
⭐code
Unknown-Aware Object Detection: Learning What You Don't Know from Videos in the Wild
⭐code📰粗解
Focal and Global Knowledge Distillation for Detectors
⭐code📰解读
关于目标检测的知识蒸馏工作，只需要30行代码就可以在 anchor-base, anchor-free 的单阶段、两阶段各种检测器上稳定涨点，现在代码已经开源。
Real-time Object Detection for Streaming Perception
⭐code
Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition
Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model
⭐code
Optimal Correction Cost for Object Detection Evaluation
Expanding Low-Density Latent Regions for Open-Set Object Detection
⭐code
SIOD: Single Instance Annotated Per Category Per Image for Object Detection
Task-specific Inconsistency Alignment for Domain Adaptive Object Detection
⭐code
Zero-Query Transfer Attacks on Context-Aware Object Detectors
AdaMixer: A Fast-Converging Query-Based Object Detector
😮oral⭐code
Learning to Detect Mobile Objects from LiDAR Scans Without Labels
⭐code
Forecasting from LiDAR via Future Object Detection
⭐code
Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection
😮oral
Multi-Granularity Alignment Domain Adaptation for Object Detection
Proper Reuse of Image Classification Features Improves Object Detection
⭐code
R(Det)^2: Randomized Decision Routing for Object Detection
Towards Robust Adaptive Object Detection under Noisy Annotations
⭐code
Entropy-based Active Learning for Object Detection with Progressive Diversity Constraint
Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection
Interactive Segmentation and Visualization for Tiny Objects in Multi-megapixel Images
⭐code
小样本目标检测
- Sylph: A Hypernetwork Framework for Incremental Few-shot Object Detection
- Few-Shot Object Detection with Fully Cross-Transformer
目标定位
- Weakly Supervised Object Localization as Domain Adaption
  ⭐code📰粗解
- Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization
3D目标检测
- A Versatile Multi-View Framework for LiDAR-based 3D Object Detection with Guidance from Panoptic Segmentation
- Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
  ⭐code📰粗解
- Rope3D: TheRoadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task
  🏠project
- Point2Seq: Detecting 3D Objects as Sequences
  ⭐code
- MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection
  ⭐code
- LiDAR Snowfall Simulation for Robust 3D Object Detection
  😮oral⭐code
- CAT-Det: Contrastively Augmented Transformer for Multi-modal 3D Object Detection
- Homography Loss for Monocular 3D Object Detection
- HyperDet3D: Learning a Scene-conditioned 3D Object Detector
- DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection
  ⭐code
- OccAM's Laser: Occlusion-based Attribution Maps for 3D Object Detectors on LiDAR Data
  ⭐code
- Focal Sparse Convolutional Networks for 3D Object Detection
  😮oral⭐code📰解读
- Rotationally Equivariant 3D Object Detection
  🏠project
伪装目标检测
- Zoom In and Out: A Mixed-scale Triplet Network for Camouflaged Object Detection
  ⭐code
全监督目标检测
- Omni-DETR: Omni-Supervised Object Detection with Transformers
  ⭐code
半监督目标检测
- Dense Learning based Semi-Supervised Object Detection
  ⭐code📰解读
显著目标检测
- Pyramid Grafting Network for One-Stage High Resolution Saliency Detection
  ⭐code📰解读
- Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection
  ⭐code📰解读
关键点检测
- Self-Supervised Equivariant Learning for Oriented Keypoint Detection

4.Image Captioning(图像字幕)

3.Image Progress(图像处理)

图像恢复
- Attentive Fine-Grained Structured Sparsity for Image Restoration
  ⭐code📰解读
图像修复
- Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding
  ⭐code📰粗解
- MAT: Mask-Aware Transformer for Large Hole Image Inpainting
  ⭐code
图像拼接
- Deep Rectangling for Image Stitching: A Learning Baseline
  ⭐code📰粗解
运动去模糊
- Unifying Motion Deblurring and Frame Interpolation with Events
image outpainting
- Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation
  🏠project
图像美学评估
- Personalized Image Aesthetics Assessment with Rich Attributes
  🏠project
图像质量评估
- Incorporating Semi-Supervised and Positive-Unlabeled Learning for Boosting Full Reference Image Quality Assessment
  ⭐code📰解读
图像去雨
- Towards Robust Rain Removal Against Adversarial Attacks: A Comprehensive Benchmark Analysis and Beyond
  ⭐code
图像去模糊
- Learning to Deblur using Light Field Generated and Real Defocus Images
  ⭐code🏠project
图像去噪
- CVF-SID: Cyclic multi-Variate Function for Self-Supervised Image Denoising by Disentangling Noise from Image
  ⭐code
- NAN: Noise-Aware NeRFs for Burst-Denoising
图像增强
- Toward Fast, Flexible, and Robust Low-Light Image Enhancement
  😮oral⭐code📰解读

2.Image Segmentation(图像分割)

1.其它

论文尚未公布

AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval

ID:Cyelie multi-Variate Function for self-supervised image denoising by disentangling noise form image

Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation

来源
[Two Systems in Thinking: Dual-System Transformer for Grounded Situation Recognition]
[Autoregressive Image Generation using Residual Quantization]
✔️Instance-wise Occlusion and Depth Orders in Natural Scenes
[Style Neophile: Constantly Seeking Novel Styles for Domain Generalization]
[ReSTR: Convolution-free Referring Image Segmentation Using Transformers]
[FIFO: Learning Fog-invariant Features for Foggy Scene Segmentation]
[TransforMatcher: Match-to-Match Attention for Semantic Correspondence]
[Reflection and Rotation Symmetry Detection via Equivariant Learning]
[Semi-supervised Semantic Segmentation with Error Localization Network]
[Future Transformer for Long-term Action Anticipation]
[Self-Taught Metric Learning without Labels]
✔️Fast Point Transformer
[Integrative Few-Shot Learning for Classification and Segmentation]
[Scene Painting via Semantic Image Synthesis]
[Detector-Free Weakly Supervised Group Activity Recognition]

solarlee/CVPR-2022