CVPR-2022-Papers

官网链接：https://cvpr2022.thecvf.com/

开会时间：2022年6月19日-6月24日

❣❣❣近日，CVPR 2022 接收论文公布！总计2067篇！，部分预印版论文也陆续发布中，本文档也将持续收录更新，多多关注!!

❗❗❗ 4月6日更新 21 篇。

形状补全
- ShapeFormer: Transformer-based Shape Completion via Sparse Representation
  ⭐code🏠project
GAN
- InsetGAN for Full-Body Image Generation
  🏠project
航空图像分割
- Revisiting Near/Remote Sensing with Geospatial Attention
轨迹重建
- MonoTrack: Shuttle trajectory reconstruction from monocular badminton video
Text Spotting
- Text Spotting Transformers
  ⭐code📰粗解
深度估计
- P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior
  ⭐code
目标检测
- Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation
  ⭐code
组动作识别
- Dual-AI: Dual-path Action Interaction Learning for Group Activity Recognition
  😮oral
- Detector-Free Weakly Supervised Group Activity Recognition
Visual Grounding
- Multi-View Transformer for 3D Visual Grounding
  ⭐code
三维服装变形
- SNUG: Self-Supervised Neural Dynamic Garments
  😮oral⭐code
神经渲染
- IRON: Inverse Rendering by Optimizing Neural SDFs and Materials from Photometric Images
  😮oral🏠project
图像合成
- Arbitrary-Scale Image Synthesis
  ⭐code📰粗解
VQA
- SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
  ⭐code📰粗解
视觉地理定位
- Rethinking Visual Geo-localization for Large-Scale Applications
  ⭐code
几何图形
- Neural Convolutional Surfaces
  🏠project
- GLASS: Geometric Latent Augmentation for Shape Spaces
  ⭐code🏠project
数据集
- ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer
  ⭐code🏠project📰粗解
分割
- Towards Fewer Annotations: Active Learning via Region Impurity and Prediction Uncertainty for Domain Adaptive Semantic Segmentation
  ⭐code
- Semi-supervised Semantic Segmentation with Error Localization Network
  ⭐code🏠project📰粗解
其它
- Leveraging Equivariant Features for Absolute Pose Regression

❗❗❗ 4月5日更新篇。

⭐[code]🏠[project]
🏠[project] 📰[粗解]
😮oral 📰粗解

❗❗❗ 4月4日更新篇。

❗❗❗ 4月1日更新 33 篇。

HOI
- D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions
  🏠code
多任务学习
- Task Adaptive Parameter Sharing for Multi-Task Learning
类增量
- Constrained Few-shot Class-incremental Learning
  ⭐code
分割
- ReSTR: Convolution-free Referring Image Segmentation Using Transformers
检索
- ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
transformer
- Deformable Video Transformer
时序动作定位
- Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization
  ⭐code
车道线检测
- Towards Driving-Oriented Metric for Lane Detection Models
点云
- Deformation and Correspondence Aware Unsupervised Synthetic-to-Real Scene Flow Estimation for Point Clouds
  ⭐code
- Learning Local Displacements for Point Cloud Completion
光流估计
- CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow
  ⭐code
跟踪
- MeMOT: Multi-Object Tracking with Memory
  😮oral
目标检测
- Multi-Granularity Alignment Domain Adaptation for Object Detection
图像美学评估
- Personalized Image Aesthetics Assessment with Rich Attributes
  🏠project
去雨
- Towards Robust Rain Removal Against Adversarial Attacks: A Comprehensive Benchmark Analysis and Beyond
  ⭐code
GNN
- AEGNN: Asynchronous Event-based Graph Neural Networks
自监督
- Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy
3Dope
- Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions
  ⭐code
VQA
- SimVQA: Exploring Simulated Environments for Visual Question Answering
  🏠project
VL
- VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers
VLN
- Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation
视频
- Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models
  😮oral
视频帧插值
- Time Lens++: Event-based Frame Interpolation with Parametric Non-linear Flow and Multi-scale Fusion
电影修复
- Bringing Old Films Back to Life
  ⭐code
Face Relighting(人脸重照光)
- Face Relighting with Geometrically Consistent Shadows
人脸编辑
- TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing
  ⭐code🏠project
人脸幻构
- Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination
其它
- Generating High Fidelity Data from Low-density Regions using Diffusion Models
- Continuous Scene Representations for Embodied AI
  ⭐code🏠project
- It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher
- End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps
- Reflection and Rotation Symmetry Detection via Equivariant Learning
- Exploiting Explainable Metrics for Augmented SGD

🐱	🐶	🐯	🐺
1.其它	2.Image Segmentation(图像分割)	3.Image Progress(图像处理)	4.Image Captioning(图像字幕)
5.Object Detection(目标检测)	6.Object Tracking(目标跟踪)	7.Point Cloud(点云)	8.Action Detection(人体动作检测与识别)
9.Human Pose Estimation(人体姿态估计)	10.3D(三维视觉)	11.Face	12.Image-to-Image Translation(图像到图像翻译)
13.GAN	14.Video	15.Transformer	16.Semi/self-supervised learning(半/自监督)
17.Medical Image(医学影像)	18.Person Re-Identification(人员重识别)	19.Neural Architecture Search(神经架构搜索)	20.Autonomous vehicles(自动驾驶)

Learning Motion-Dependent Appearance for High-Fidelity Rendering of Dynamic Humans from a Single Camera

Sound

声源定位
- Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes
  ⭐code

Visual Emotion Analysis(视觉情感分析)

MDAN: Multi-level Dependent Attention Network for Visual Emotion Analysis

Novel View Synthesis(视图合成)

NPBG++: Accelerating Neural Point-Based Graphics
🏠project

Dataset(数据集)

Sign Language Translation(手语翻译)

A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation

Human Motion Forecasting(人体运动预测)

OCR

场景文本检测
- Towards End-to-End Unified Scene Text Detection and Layout Analysis
  ⭐code

Light Field(光场)

Occlusion-Aware Cost Constructor for Light Field Depth Estimation
⭐code📰粗解

Anomaly Detection(异常检测)

Catching Both Gray and Black Swans: Open-set Supervised Anomaly Detection
⭐code

Multi-Task Learning（多任务学习）

Optical Flow(光流估计)

CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow
⭐code

Incremental Learning（增量学习）

增量学习
- Energy-based Latent Aligner for Incremental Learning
  ⭐code
类增量学习
- Doodle It Yourself: Class Incremental Learning by Drawing a Few Sketches
- Constrained Few-shot Class-incremental Learning
  ⭐code

Adversarial Learning(对抗学习)

Continual Learning(持续学习)

Meta-Learning(元学习)

Contrastive Learning(对比学习)

Model Compression/Knowledge Distillation/Pruning(模型压缩/知识蒸馏/剪枝)

剪枝
- Searching for Network Width with Bilaterally Coupled Network
知识蒸馏
- Knowledge Distillation with the Reused Teacher Classifier
模型压缩
- CHEX: CHannel EXploration for CNN Model Compression

Human-Object Interaction(人物交互)

数据增强

Style Transfer(风格迁移)

Vision-Language(视觉语言)

Visual Answer Questions(视觉问答)

Augmented Reality/Virtual Reality/Robotics(增强/虚拟现实/机器人)

目标导航
- Online Learning of Reusable Abstract Models for Object Goal Navigation
try-on
- Dressing in the Wild by Watching Dance Videos
  🏠project

Pose Estimation(物体姿势估计)

GCN/GNN

GNN
- 🐦️Lifelong Graph Learning
  ⭐code
- AEGNN: Asynchronous Event-based Graph Neural Networks

Zero-Shot Learning/Domain Generalization/Adaptation(零样本/域泛化/适应)

零样本
- MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning
  ⭐code📰粗解
- Unseen Classes at a Later Time? No Problem
  ⭐code
域泛化
- Compound Domain Generalization via Meta-Knowledge Encoding
- Causality Inspired Representation Learning for Domain Generalization
域适应
- Continual Test-Time Domain Adaptation
  ⭐code

动画

图像动画

Thin-Plate Spline Motion Model for Image Animation
人物动画
- Structured Local Radiance Fields for Human Avatar Modeling
3D character animation(三维角色动画)
- 皮肤预测
  - SkinningNet: Two-Stream Graph Convolutional Neural Network for Skinning Prediction of Synthetic Characters
    🏠project
3D 舞蹈生成
- Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory

Fine-Grained/Image Classification(细粒度/图像分类)

细粒度分类
- Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal Information
  ⭐code📰粗解
图像分类
- DTFD-MIL: Double-Tier Feature Distillation Multiple Instance Learning for Histopathology Whole Slide Image Classification
  ⭐code
小样本分类
- CAD: Co-Adapting Discriminative Features for Improved Few-Shot Classification
- 小样本分类与分割(FS-CS)
  - Integrative Few-Shot Learning for Classification and Segmentation
长尾识别
- Nested Collaborative Learning for Long-Tailed Visual Recognition
- Long-Tailed Recognition via Weight Balancing
  ⭐code
细粒度识别
- Knowledge Mining with Scene Text for Fine-Grained Recognition
  ⭐code

Super-Resolution(超分辨率)

视频超分辨率
- Reference-based Video Super-Resolution Using Multi-Camera Video Triplets
图像超分辨率
- Learning Graph Regularisation for Guided Super-Resolution

Image Retrieval(图像检索)

Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval
⭐code
Sketch3T: Test-Time Training for Zero-Shot SBIR
文本-视频检索
- X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval
  🏠project
跨模太检索
- ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval

Image Synthesis/Generation(图像合成)

Interactive Image Synthesis with Panoptic Layout Generation
Autoregressive Image Generation using Residual Quantization
⭐code📰粗解
GIRAFFE HD: A High-Resolution 3D-aware Generative Model
姿势引导的图像合成
- Exploring Dual-task Correlation for Pose Guided Person Image Generation
  ⭐code📰粗解
文本到图像合成
- StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis
图像翻译
- FlexIT: Towards Flexible Semantic Image Translation
- A Style-aware Discriminator for Controllable Image Translation

UAV/Remote Sensing/Satellite Image(无人机/遥感/卫星图像)

遥感图像融合
- HyperTransformer: A Textural and Spectral Feature Fusion Transformer for Pansharpening
  ⭐code📰粗解

20.Autonomous vehicles(自动驾驶)

自动驾驶
- Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data
车道线检测
- Rethinking Efficient Lane Detection via Curve Modeling
  ⭐code📰粗解
- Towards Driving-Oriented Metric for Lane Detection Models
车道线描述
- Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes
  ⭐code
行为预测
- 🐦️JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection

19.Neural Architecture Search(神经架构搜索)

🐦️ISNAS-DIP: Image-Specific Neural Architecture Search for Deep Image Prior

18.Person Re-Identification(人员重识别)

17.Medical Image(医学影像)

Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations
BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation
3D生物打印
- Generating 3D Bio-Printable Patches Using Wound Segmentation and Reconstruction to Treat Diabetic Foot Ulcers
  利用伤口分割和重建生成3D生物打印贴片来治疗糖尿病足溃疡
SR（ＭRI）
- Transformer-empowered Multi-scale Contextual Matching and Aggregation for Multi-contrast MRI Super-resolution
  ⭐code
医学图像配准
- Affine Medical Image Registration with Coarse-to-Fine Vision Transformer
  ⭐code

16.Semi/self-supervised learning(半/自监督)

15.Transformer

14.Video

Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models
😮oral
电影修复
- Bringing Old Films Back to Life
  ⭐code
动作分割
- Unsupervised Activity Segmentation by Joint Representation Learning and Online Clustering
  📺video
- Weakly-Supervised Online Action Segmentation in Multi-View Instructional Videos
动作理解
- How Do You Do It? Fine-Grained Action Understanding with Pseudo-Adverbs
- Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
  ⭐code
视频实例分割(VIS)
- Efficient Video Instance Segmentation via Tracklet Query and Proposal
  🏠project📺video📰粗解
Video Copy Detection(视频拷贝检测)
- A Large-scale Comprehensive Dataset and Copy-overlap Aware Evaluation Protocol for Segment-level Video Copy Detection
  ⭐code
视频合成
- Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
  ⭐code
视频异常检测
- Generative Cooperative Learning for Unsupervised Video Anomaly Detection
- Bayesian Nonparametric Submodular Video Partition for Robust Anomaly Detection
视频监控
- 轨迹预测
视频时刻检索和视频高光检测
- UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection
  ⭐code
视频时刻检索
- AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval
视频预测
- STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction
视频个体计数
- DR.VIC: Decomposition and Reasoning for Video Individual Counting
  ⭐code
视频插值
视觉对应（视频）
- Locality-Aware Inter-and Intra-Video Reconstruction for Self-Supervised Correspondence Learning
  ⭐code
视频分类
- 零样本视频分类
  - Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification

13.GAN

🐦️HyperInverter: Improving StyleGAN Inversion via Hypernetwork
🏠project
图像篡改检测
- Proactive Image Manipulation Detection
  ⭐code

12.Image-to-Image Translation(图像到图像翻译)

11.Face(人脸)

Protecting Celebrities with Identity Consistency Transformer
Deepfake
- Voice-Face Homogeneity Tells Deepfake
  ⭐code📰粗解
妆容迁移
- Protecting Facial Privacy: Generating Adversarial Identity Masks via Style-robust Makeup Transfer
人脸识别
人脸表情识别
- Towards Semi-Supervised Deep Facial Expression Recognition with An Adaptive Confidence Margin
  ⭐code
3D人脸
- ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations
活体检测
- PatchNet: A Simple Face Anti-Spoofing Framework via Fine-Grained Patch Recognition
假脸检测
- Exploring Frequency Adversarial Attacks for Face Forgery Detection
人脸交换
- High-resolution Face Swapping via Latent Semantics Disentanglement
  ⭐code
人脸属性分类
- Fair Contrastive Learning for Facial Attribute Classification
  ⭐code
Face Relighting(人脸重照光)
- Face Relighting with Geometrically Consistent Shadows
人脸编辑
- TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing
  ⭐code🏠project
人脸幻构
- Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination

10.3D(三维视觉)

9.Human Pose Estimation(人体姿态估计)

基于视频的HPE
- Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation
  ::oral:star:code
3D pose
4D 人体捕获
- H4D: Human 4D Modeling by Learning Neural Compositional Representation
手势生成
- Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
3D手网格估计
- HandOccNet: Occlusion-Robust 3D Hand Mesh Estimation Network
3D形状生成
- Towards Implicit Text-Guided 3D Shape Generation
- 3D狗的形状
  - BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information
    🏠project
运动捕捉
- Neural MoCon: Neural Motion Control for Physically Plausible Human Motion Capture
  🏠project
手臂-手部动态估计
- Spatial-Temporal Parallel Transformer for Arm-Hand Dynamic Estimation

8.Action Detection(人体动作检测与识别)

7.Point Cloud(点云)

Shape-invariant 3D Adversarial Point Clouds
⭐code
AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception
REGTR: End-to-end Point Cloud Correspondences with Transformers
⭐code
Equivariant Point Cloud Analysis via Learning Orientations for Message Passing
⭐code
Text2Pos: Text-to-Point-Cloud Cross-Modal Localization
Deformation and Correspondence Aware Unsupervised Synthetic-to-Real Scene Flow Estimation for Point Clouds
⭐code
3D 点云
- CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding
  ⭐code📰粗解
  CrossPoint，一个用于 3D 点云表征学习的简单自监督学习框架。虽然该方法是在合成的三维物体数据集上训练的，但在下游任务中的实验结果，如三维物体分类和三维物体部分分割，在合成和真实世界的数据集中都证明了该方法在学习可迁移表征方面的有效性。
- A Unified Query-based Paradigm for Point Cloud Understanding
- WarpingGAN: Warping Multiple Uniform Priors for Adversarial 3D Point Cloud Generation
  ⭐code
- 3D点云分割
  - Stratified Transformer for 3D Point Cloud Segmentation
    ⭐code
点云分类
- ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation
  ⭐code📰粗解
点云配准
- SC^2-PCR: A Second Order Spatial Compatibility for Efficient and Robust Point Cloud Registration
  ⭐code
点云补全
- Learning a Structured Latent Space for Unsupervised Point Cloud Completion
- Learning Local Displacements for Point Cloud Completion

6.Object Tracking(目标跟踪)

TCTrack: Temporal Contexts for Aerial Tracking
⭐code📰粗解
Correlation-Aware Deep Tracking
Global Tracking Transformers
⭐code
Unified Transformer Tracker for Object Tracking
⭐code
Global Tracking via Ensemble of Local Trackers
3D 目标跟踪
- Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds
  ⭐code📰粗解
多目标跟踪
- Learning of Global Objective for Network Flow in Multi-Object Tracking
- MeMOT: Multi-Object Tracking with Memory
  😮oral

5.Object Detection(目标检测)

DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
⭐code📰粗解
Unknown-Aware Object Detection: Learning What You Don't Know from Videos in the Wild
⭐code📰粗解
Focal and Global Knowledge Distillation for Detectors
⭐code📰解读
关于目标检测的知识蒸馏工作，只需要30行代码就可以在 anchor-base, anchor-free 的单阶段、两阶段各种检测器上稳定涨点，现在代码已经开源。
Real-time Object Detection for Streaming Perception
⭐code
Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition
Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model
⭐code
Optimal Correction Cost for Object Detection Evaluation
Expanding Low-Density Latent Regions for Open-Set Object Detection
⭐code
SIOD: Single Instance Annotated Per Category Per Image for Object Detection
Task-specific Inconsistency Alignment for Domain Adaptive Object Detection
⭐code
Zero-Query Transfer Attacks on Context-Aware Object Detectors
AdaMixer: A Fast-Converging Query-Based Object Detector
😮oral⭐code
Learning to Detect Mobile Objects from LiDAR Scans Without Labels
⭐code
Forecasting from LiDAR via Future Object Detection
⭐code
Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection
😮oral
Multi-Granularity Alignment Domain Adaptation for Object Detection
小样本目标检测
- Sylph: A Hypernetwork Framework for Incremental Few-shot Object Detection
- Few-Shot Object Detection with Fully Cross-Transformer
目标定位
- Weakly Supervised Object Localization as Domain Adaption
  ⭐code📰粗解
- Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation
3D
- A Versatile Multi-View Framework for LiDAR-based 3D Object Detection with Guidance from Panoptic Segmentation
- Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
  ⭐code📰粗解
- Rope3D: TheRoadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task
  🏠project
- Point2Seq: Detecting 3D Objects as Sequences
  ⭐code
- MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection
  ⭐code
- LiDAR Snowfall Simulation for Robust 3D Object Detection
  😮oral⭐code
伪装目标检测
- Zoom In and Out: A Mixed-scale Triplet Network for Camouflaged Object Detection
  ⭐code
全监督目标检测
- Omni-DETR: Omni-Supervised Object Detection with Transformers
  ⭐code

4.Image Captioning(图像字幕)

3.Image Progress(图像处理)

图像修复
- Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding
  ⭐code📰粗解
- MAT: Mask-Aware Transformer for Large Hole Image Inpainting
  ⭐code
图像拼接
- Deep Rectangling for Image Stitching: A Learning Baseline
  ⭐code📰粗解
图像去噪
- CVF-SID: Cyclic multi-Variate Function for Self-Supervised Image Denoising by Disentangling Noise from Image
  ⭐code
运动去模糊
- Unifying Motion Deblurring and Frame Interpolation with Events
image outpainting
- Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation
  🏠project
图像美学评估
- Personalized Image Aesthetics Assessment with Rich Attributes
  🏠project
图像去雨
- Towards Robust Rain Removal Against Adversarial Attacks: A Comprehensive Benchmark Analysis and Beyond
  ⭐code

2.Image Segmentation(图像分割)

1.其它

论文尚未公布

AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval

ID:Cyelie multi-Variate Function for self-supervised image denoising by disentangling noise form image

Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale

Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation

来源
[Two Systems in Thinking: Dual-System Transformer for Grounded Situation Recognition]
[Autoregressive Image Generation using Residual Quantization]
✔️Instance-wise Occlusion and Depth Orders in Natural Scenes
[Style Neophile: Constantly Seeking Novel Styles for Domain Generalization]
[ReSTR: Convolution-free Referring Image Segmentation Using Transformers]
[FIFO: Learning Fog-invariant Features for Foggy Scene Segmentation]
[TransforMatcher: Match-to-Match Attention for Semantic Correspondence]
[Reflection and Rotation Symmetry Detection via Equivariant Learning]
[Semi-supervised Semantic Segmentation with Error Localization Network]
[Future Transformer for Long-term Action Anticipation]
[Self-Taught Metric Learning without Labels]
✔️Fast Point Transformer
[Integrative Few-Shot Learning for Classification and Segmentation]
[Scene Painting via Semantic Image Synthesis]
[Detector-Free Weakly Supervised Group Activity Recognition]

H-jh20/CVPR-2022-Papers