官网链接:https://cvpr2022.thecvf.com/
开会时间:2022年6月19日-6月24日
❣❣❣近日,CVPR 2022 接收论文公布! 总计2067篇!,部分预印版论文也陆续发布中,本文档也将持续收录更新,多多关注!!
- 形状补全
- GAN
- 航空图像分割
- 轨迹重建
- Text Spotting
- 深度估计
- 目标检测
- 组动作识别
- Visual Grounding
- 三维服装变形
- 神经渲染
- 图像合成
- VQA
- 视觉地理定位
- 几何图形
- 数据集
- 分割
- 其它
⭐[code]🏠[project]
🏠[project]
📰[粗解]
😮oral
📰粗解
- HOI
- 多任务学习
- 类增量
- 分割
- 检索
- transformer
- 时序动作定位
- 车道线检测
- 点云
- 光流估计
- 跟踪
- 目标检测
- 图像美学评估
- 去雨
- GNN
- 自监督
- 3Dope
- VQA
- VL
- VLN
- 视频
- 视频帧插值
- 电影修复
- Face Relighting(人脸重照光)
- 人脸编辑
- 人脸幻构
- 其它
- Generating High Fidelity Data from Low-density Regions using Diffusion Models
- Continuous Scene Representations for Embodied AI
⭐code🏠project - It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher
- End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps
- Reflection and Rotation Symmetry Detection via Equivariant Learning
- Exploiting Explainable Metrics for Augmented SGD
- 声源定位
- 数据集
- 卫星数据集
- Motron: Multimodal Probabilistic Human Motion Forecasting
- Progressively Generating Better Initial Guesses Towards Next Stages for High-Quality Human Motion Prediction
- Controllable Dynamic Multi-Task Architectures
- Task Adaptive Parameter Sharing for Multi-Task Learning
- 增量学习
- 类增量学习
- 对抗样本
- 对抗攻击
- 对抗
- On Generalizing Beyond Domains in Cross-Domain Continual Learning
- Probing Representation Forgetting in Supervised and Unsupervised Continual Learning
- Online Continual Learning on a Contaminated Data Stream with Blurry Task Boundaries
⭐code
- What Matters For Meta-Learning Vision Regression Tasks?
- Multidimensional Belief Quantification for Label-Efficient Meta-Learning
- Selective-Supervised Contrastive Learning with Noisy Labels
⭐code📰粗解 - Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning
⭐code
- 剪枝
- 知识蒸馏
- 模型压缩
- HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction
- MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
- GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection
⭐code - OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction
⭐code - D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions
🏠code
- 🐦️AlignMix: Improving representation by interpolating aligned features
- 3D Common Corruptions and Data Augmentation
⭐code🏠project📺video📰粗解 - Kubric: A scalable dataset generator
- Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
⭐code - Industrial Style Transfer with Large-scale Geometric Warping and Content Preservation
⭐code - 运动风格迁移
- Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships
- VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers
- VLN
- VQA
- AVQA
- 目标导航
- try-on
- OSOP: A Multi-Stage One Shot Object Pose Estimation Framework
- 9D
- 单目目标姿势估计
- 6D
- 3D Object Articulation
- 3Dope
- 零样本
- 域泛化
- 域适应
图像动画
- Thin-Plate Spline Motion Model for Image Animation
- 人物动画
- 3D character animation(三维角色动画)
- 3D 舞蹈生成
- 细粒度分类
- 图像分类
- 小样本分类
- 长尾识别
- 细粒度识别
- 视频超分辨率
- 图像超分辨率
- Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval
⭐code - Sketch3T: Test-Time Training for Zero-Shot SBIR
- 文本-视频检索
- 跨模太检索
- Interactive Image Synthesis with Panoptic Layout Generation
- Autoregressive Image Generation using Residual Quantization
⭐code📰粗解 - GIRAFFE HD: A High-Resolution 3D-aware Generative Model
- 姿势引导的图像合成
- 文本到图像合成
- 图像翻译
- 遥感图像融合
- 自动驾驶
- 车道线检测
- 车道线描述
- 行为预测
- Reid
- 人群计数
- Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations
- BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation
- 3D生物打印
- Generating 3D Bio-Printable Patches Using Wound Segmentation and Reconstruction to Treat Diabetic Foot Ulcers
利用伤口分割和重建生成3D生物打印贴片来治疗糖尿病足溃疡
- Generating 3D Bio-Printable Patches Using Wound Segmentation and Reconstruction to Treat Diabetic Foot Ulcers
- SR(MRI)
- 医学图像配准
- 自监督
- 半监督
- Fast Point Transformer
- ChiTransformer:Towards Reliable Stereo from Cues
- Beyond Fixation: Dynamic Window Visual Transformer
- Training-free Transformer Architecture Search
- Automated Progressive Learning for Efficient Training of Vision Transformers
⭐code - Collaborative Transformers for Grounded Situation Recognition
⭐code - TubeDETR: Spatio-Temporal Video Grounding with Transformers
😮oral⭐code🏠project - Deformable Video Transformer
- Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models
😮oral - 电影修复
- 动作分割
- 动作理解
- 视频实例分割(VIS)
- Video Copy Detection(视频拷贝检测)
- 视频合成
- 视频异常检测
- 视频监控
- 视频时刻检索和视频高光检测
- 视频时刻检索
- 视频预测
- 视频个体计数
- 视频插值
- 视觉对应(视频)
- 视频分类
- Exploring Patch-wise Semantic Relation for Contrastive Learning in Image-to-Image Translation Tasks
- Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation
- InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
- Protecting Celebrities with Identity Consistency Transformer
- Deepfake
- 妆容迁移
- 人脸识别
- 人脸表情识别
- 3D人脸
- 活体检测
- 假脸检测
- 人脸交换
- 人脸属性分类
- Face Relighting(人脸重照光)
- 人脸编辑
- 人脸幻构
- PSMNet: Position-aware Stereo Merging Network for Room Layout Estimation
- Disentangled3D: Learning a 3D Generative Model with Disentangled Geometry and Appearance from Monocular Images
- 深度估计
- 房间布局
- 3D
- 三维服装网格重建
- 三维形状重建
- 基于视频的HPE
- 3D pose
- 4D 人体捕获
- 手势生成
- 3D手网格估计
- 3D形状生成
- 运动捕捉
- 手臂-手部动态估计
- 动作检测
- Colar: Effective and Efficient Online Action Detection by Consulting Exemplars
- Learnable Irrelevant Modality Dropout for Multimodal Action Recognition on Modality-Specific Annotated Videos
- End-to-End Semi-Supervised Learning for Video Action Detection
- SPAct: Self-supervised Privacy Preservation for Action Recognition
⭐code
- 时序动作定位
- Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation
⭐code📰粗解 - Unsupervised Pre-training for Temporal Action Localization Tasks
⭐code - ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization
⭐code - Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization
⭐code
- Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation
- Shape-invariant 3D Adversarial Point Clouds
⭐code - AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception
- REGTR: End-to-end Point Cloud Correspondences with Transformers
⭐code - Equivariant Point Cloud Analysis via Learning Orientations for Message Passing
⭐code - Text2Pos: Text-to-Point-Cloud Cross-Modal Localization
- Deformation and Correspondence Aware Unsupervised Synthetic-to-Real Scene Flow Estimation for Point Clouds
⭐code - 3D 点云
- CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding
⭐code📰粗解
CrossPoint,一个用于 3D 点云表征学习的简单自监督学习框架。虽然该方法是在合成的三维物体数据集上训练的,但在下游任务中的实验结果,如三维物体分类和三维物体部分分割,在合成和真实世界的数据集中都证明了该方法在学习可迁移表征方面的有效性。 - A Unified Query-based Paradigm for Point Cloud Understanding
- WarpingGAN: Warping Multiple Uniform Priors for Adversarial 3D Point Cloud Generation
⭐code - 3D点云分割
- CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding
- 点云分类
- 点云配准
- 点云补全
- TCTrack: Temporal Contexts for Aerial Tracking
⭐code📰粗解 - Correlation-Aware Deep Tracking
- Global Tracking Transformers
⭐code - Unified Transformer Tracker for Object Tracking
⭐code - Global Tracking via Ensemble of Local Trackers
- 3D 目标跟踪
- 多目标跟踪
- DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
⭐code📰粗解 - Unknown-Aware Object Detection: Learning What You Don't Know from Videos in the Wild
⭐code📰粗解 - Focal and Global Knowledge Distillation for Detectors
⭐code📰解读
关于目标检测的知识蒸馏工作,只需要30行代码就可以在 anchor-base, anchor-free 的单阶段、两阶段各种检测器上稳定涨点,现在代码已经开源。 - Real-time Object Detection for Streaming Perception
⭐code - Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition
- Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model
⭐code - Optimal Correction Cost for Object Detection Evaluation
- Expanding Low-Density Latent Regions for Open-Set Object Detection
⭐code - SIOD: Single Instance Annotated Per Category Per Image for Object Detection
- Task-specific Inconsistency Alignment for Domain Adaptive Object Detection
⭐code - Zero-Query Transfer Attacks on Context-Aware Object Detectors
- AdaMixer: A Fast-Converging Query-Based Object Detector
😮oral⭐code - Learning to Detect Mobile Objects from LiDAR Scans Without Labels
⭐code - Forecasting from LiDAR via Future Object Detection
⭐code - Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection
😮oral - Multi-Granularity Alignment Domain Adaptation for Object Detection
- 小样本目标检测
- 目标定位
- 3D
- A Versatile Multi-View Framework for LiDAR-based 3D Object Detection with Guidance from Panoptic Segmentation
- Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
⭐code📰粗解 - Rope3D: TheRoadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task
🏠project - Point2Seq: Detecting 3D Objects as Sequences
⭐code - MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection
⭐code - LiDAR Snowfall Simulation for Robust 3D Object Detection
😮oral⭐code
- 伪装目标检测
- 全监督目标检测
- 字幕
- Novel Object Captioning
- 图像修复
- 图像拼接
- 图像去噪
- 运动去模糊
- image outpainting
- 图像美学评估
- 图像去雨
- ReSTR: Convolution-free Referring Image Segmentation Using Transformers
- 实例分割
- E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation
⭐code📰粗解 - Sparse Instance Activation for Real-Time Instance Segmentation
⭐code - SharpContour: A Contour-based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation
🏠project - 半监督实例分割
- 3D 实例分割
- 🐦️FreeSOLO: Learning to Segment Objects without Annotations
- E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation
- 语义分割
- Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation
⭐code📰粗解 - Deep Hierarchical Semantic Segmentation
⭐code - Semantic Segmentation by Early Region Proxy
⭐code - SimT: Handling Open-set Noise for Domain Adaptive Semantic Segmentation
⭐code - Rethinking Semantic Segmentation: A Prototype View
😮oral⭐code - On the Road to Online Adaptation for Semantic Image Segmentation
- Threshold Matters in WSSS: Manipulating the Activation for the Robust and Accurate Segmentation Model Against Thresholds
- 弱监督语义分割
- Class Re-Activation Maps for Weakly-Supervised Semantic Segmentation
⭐code📰粗解 - Self-supervised Image-specific Prototype Exploration for Weakly Supervised Semantic Segmentation
⭐code - Multi-class Token Transformer for Weakly Supervised Semantic Segmentation
⭐code - Cross Language Image Matching for Weakly Supervised Semantic Segmentation
- Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers
⭐code - Weakly Supervised Semantic Segmentation using Out-of-Distribution Data
⭐code📰粗解
- Class Re-Activation Maps for Weakly-Supervised Semantic Segmentation
- 半监督语义分割
- Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation
- 动作分割
- 场景解析
- Instance-wise Occlusion and Depth Orders in Natural Scenes
- IFOR: Iterative Flow Minimization for Robotic Object Rearrangement
🏠project - PINA: Learning a Personalized Implicit Neural Avatar from a Single RGB-D Video Sequence
⭐code🏠project📺video📰粗解 - CAFE: Learning to Condense Dataset by Aligning Features
⭐code📰粗解 - Enhancing Adversarial Robustness for Deep Metric Learning
- BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning
⭐code📰粗解 - ACVNet: Attention Concatenation Volume for Accurate and Efficient Stereo Matching
⭐code📰粗解 - Polarity Sampling: Quality and Diversity Control of Pre-Trained Generative Networks via Singular Values
⭐code - Do Explanations Explain? Model Knows Best
⭐code - HDNet: High-resolution Dual-domain Learning for Spectral Compressive Imaging
- E-CIR: Event-Enhanced Continuous Intensity Recovery
⭐code - 🐦️Transferability Estimation using Bhattacharyya Class Separability
- Interpretable part-whole hierarchies and conceptual-semantic relationships in neural networks
⭐code - GlideNet: Global, Local and Intrinsic based Dense Embedding NETwork for Multi-category Attributes Prediction
⭐code - Differentially Private Federated Learning with Local Regularization and Sparsification
- Towards Efficient and Scalable Sharpness-Aware Minimization
- DeltaCNN: End-to-End CNN Inference of Sparse Frame Differences in Videos
- Probabilistic Warp Consistency for Weakly-Supervised Semantic Correspondences
⭐code📰粗解 - Dynamic Dual-Output Diffusion Models
- Moving Window Regression: A Novel Approach to Ordinal Regression
- Egocentric Prediction of Action Target in 3D
- Compositional Temporal Grounding
with Structured Variational Cross-Graph Correspondence Learning
⭐code - Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction
⭐code - Neural Reflectance for Shape Recovery with Shadow Handling
⭐code - DyRep: Bootstrapping Training with Dynamic Re-parameterization
⭐code - Enhancing Classifier Conservativeness and Robustness by Polynomiality
- Versatile Multi-Modal Pre-Training for Human-Centric Perception
⭐code - Attributable Visual Similarity Learning
⭐code - Optimizing Elimination Templates by Greedy Parameter Search
- Partially Does It: Towards Scene-Level FG-SBIR with Partial Input
- Bi-level Doubly Variational Learning for Energy-based Latent Variable Models
- Brain-inspired Multilayer Perceptron with Spiking Neurons
- ARCS: Accurate Rotation and Correspondence Search
⭐code - iPLAN: Interactive and Procedural Layout Planning
- HINT: Hierarchical Neuron Concept Explainer
⭐code - Visual Abductive Reasoning
⭐code - A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration
⭐code - Learning Structured Gaussians to Approximate Deep Ensembles
- Self-Supervised Image Representation Learning with Geometric Set Consistency
- Balanced Multimodal Learning via On-the-fly Gradient Modulation
😮oral⭐code - CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters
⭐code - Eigencontours: Novel Contour Descriptors Based on Low-Rank Approximation
:opem_mouth:oral - Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian
- Long-term Visual Map Sparsification with Heterogeneous GNN
- Clean Implicit 3D Structure from Noisy 2D STEM Images
- Equivariance Allows Handling Multiple Nuisance Variables When Analyzing Pooled Neuroimaging Datasets
- CaDeX: Learning Canonical Deformation Coordinate Space for Dynamic Surface Representation via Neural Homeomorphism
⭐code🏠project - Fast Light-Weight Near-Field Photometric Stereo
- Fast, Accurate and Memory-Efficient Partial Permutation Synchronization
- Multi-Robot Active Mapping via Neural Bipartite Graph Matching
- Learning Program Representations for Food Images and Cooking Recipes
😮oral - Iterative Deep Homography Estimation
⭐code - Practical Learned Lossless JPEG Recompression with Multi-Level Cross-Channel Entropy Model in the DCT Domain
- Generating High Fidelity Data from Low-density Regions using Diffusion Models
- Continuous Scene Representations for Embodied AI
⭐code🏠project - It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher
- End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps
- Reflection and Rotation Symmetry Detection via Equivariant Learning
- Exploiting Explainable Metrics for Augmented SGD
AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation
来源
[Two Systems in Thinking: Dual-System Transformer for Grounded Situation Recognition]
[Autoregressive Image Generation using Residual Quantization]
✔️Instance-wise Occlusion and Depth Orders in Natural Scenes
[Style Neophile: Constantly Seeking Novel Styles for Domain Generalization]
[ReSTR: Convolution-free Referring Image Segmentation Using Transformers]
[FIFO: Learning Fog-invariant Features for Foggy Scene Segmentation]
[TransforMatcher: Match-to-Match Attention for Semantic Correspondence]
[Reflection and Rotation Symmetry Detection via Equivariant Learning]
[Semi-supervised Semantic Segmentation with Error Localization Network]
[Future Transformer for Long-term Action Anticipation]
[Self-Taught Metric Learning without Labels]
✔️Fast Point Transformer
[Integrative Few-Shot Learning for Classification and Segmentation]
[Scene Painting via Semantic Image Synthesis]
[Detector-Free Weakly Supervised Group Activity Recognition]