CVMLL-Awesome/WACV-2023-Papers

52CV-WACV-Papers

历年综述论文分类汇总戳这里↘️CV-Surveys施工中~~~~~~~~~~

2022 年论文分类汇总戳这里

↘️CVPR-2022-Papers ↘️WACV-2022-Papers

2021年论文分类汇总戳这里

↘️ICCV-2021-Papers ↘️CVPR-2021-Papers

2020 年论文分类汇总戳这里

↘️CVPR-2020-Papers ↘️ECCV-2020-Papers

❗❗❗🌟🌟🌟分类完成

目录

🐶	🐭	🐹	🐯
53.Gaze Estimation(视线估计)	54.Optical Flow(光流)	55.Object Counting(物体计数)
49.Debiasing(去偏见)	50.Sign Language Translation(手语翻译)	51.SSC(语义场景完成)	52.Eye Tracking(眼动跟踪)
45.Class-Incremental Learning(类增量学习)	46.Metric Learning(度量学习)	47.Data Augmentation(数据增强)	48.Light Fields(光场)
41.Action Generation(动作生成)	42.Landmark Detection(关键点检测)	43.Active Learning(主动学习)	44.Multi-Task Learning(多任务学习)
37.OT(目标跟踪)	38.Sound(音频处理)	39.Style Transfer(风格迁移)	40.AD(异常检测)
33.View Synthesis(视图合成)	34.SLAM\Robots	35.VQA(视觉问答)	36.Soft Biometrics(软生物技术)
29.Image Classification(图像分类)	30.RL(强化学习)	31.Deepfake Detection(假象检测)	32.Continual Learning(持续学习)
25.Image Captioning(图像字幕)	26.Dataset(数据集)	27.Defect Detection(缺陷检测)	28.OPE(物体姿态估计)
21.PC(点云)	22.HAR(人体动作识别与检测)	23.AD(智能驾驶)	24.Image Retrieval(图像检索)
17.OCR(文本检测)	18.NAS(神经架构搜索)	19.MC\KD\Pruning(模型压缩\知识蒸馏\剪枝)	20.Transformer
13.Image Segmentation(图像分割)	14.SSL(半监督学习)	15.Image Synthesis(图像合成)	16.SR(超分辨率)
9.RS\Satellite Image(遥感\卫星图像)	10.AL(对抗学习)	11.Face(人脸)	12.FSL or DA\G(小样本学习 or 域适应\泛化)
5.OD(目标检测)	6.Video(视频相关)	7.Pose(人体姿态)	8.Image Processing(图像处理)
1.其它	2.Medical Image(医学影像)	3.3D(三维视觉)	4.GAN(生成对抗网络)

Human Motion Prediction(人类运动预测)

Multi-view Tracking Using Weakly Supervised Human Motion Prediction
⭐code
Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation
⭐code
GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction

Sound

Audio Visual Event Localization视听事件定位
- AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization
音频去噪
- BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
视听分割
- Unsupervised Audio-Visual Lecture Segmentation
  🏠project

Style Transfer

Line Search-Based Feature Transformation for Fast, Stable, and Tunable Content-Style Control in Photorealistic Style Transfer
⭐code

场景图生成

Grounding Scene Graphs on Natural Images via Visio-Lingual Message Passing
⭐code🏠project

行人搜索
- Gallery Filter Network for Person Search
  ⭐code

57.Federated Learning(联邦学习)

Learning Across Domains and Devices: Style-Driven Source-Free Domain Adaptation in Clustered Federated Learning
⭐code

56.Vision-Language(视觉语言)

55.Object Counting(物体计数)

54.Optical Flow(光流)

Weakly-Supervised Optical Flow Estimation for Time-of-Flight

53.Gaze Estimation(视线估计)

iris localization(虹膜定位)
- Segmentation-free Direct Iris Localization Networks

52.Eye Tracking(眼动跟踪)

51.Semantic Scene Completion(语义场景完成SSC)

50.Sign Language Translation(手语翻译)

49.Debiasing(去偏见)

48.Light Fields(光场)

47.Data Augmentation(数据增强)

Rethinking Rotation in Self-Supervised Contrastive Learning: Adaptive Positive or Negative Data Augmentation
⭐code

46.Metric Learning(度量学习)

45.Class-Incremental Learning(类增量学习)

44.Multi-Task Learning(多任务学习)

43.Active Learning(主动学习)

42.Landmark Detection(关键点检测)

41.Action Generation(动作生成)

40.Anomaly Detection(异常检测)

Asymmetric Student-Teacher Networks for Industrial Anomaly Detection
⭐code

39.Style Transfer(风格迁移)

38.Sound(音频处理)

37.Object Tracking(目标跟踪)

多目标跟踪
- AttTrack: Online Deep Attention Transfer for Multi-object Tracking

36.Soft Biometrics(软生物技术)

手指静脉识别
- Analysis of Master Vein Attacks on Finger Vein Recognition Systems

35.VQA(视觉问答)

34.SLAM\Robots

33.View Synthesis(视图合成)

32.Continual Learning(持续学习)

31.Deepfake Detection(假象检测)

图像伪造
- CFL-Net: Image Forgery Localization Using Contrastive Learning
  ⭐code

30.Reinforcement Learning(强化学习)

29.Image Classification(图像分类)

长尾识别
- Difficulty-Net: Learning to Predict Difficulty for Long-Tailed Recognition
pen-Set Classification
- Large-Scale Open-Set Classification Protocols for ImageNet

28.Pose Estimation(姿态估计)

6D
- CRT-6D: Fast 6D Object Pose Estimation with Cascaded Refinement Transformers
  ⭐code

27.Defect Detection(缺陷检测)

26.Dataset\Benchmark(数据集\基准)

OpenEarthMap: A Benchmark Dataset for Global High-Resolution Land Cover Mapping
🌻dataset

25.Image Captioning(图像字幕)

24.Image Retrieval(图像检索)

Boosting vision transformers for image retrieval
⭐code
图像-句子检索
- Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
图像-文本检索
- Dissecting Deep Metric Learning Losses for Image-Text Retrieval
  ⭐code

23.Autonomous Driving(智能驾驶)

IDD-3D: Indian Driving Dataset for 3D Unstructured Road Scenes
⭐code

22.Human Action Recognition(人体动作识别与检测)

动作识别

21.Point Cloud(点云)

20.Transformer

19.Model Compression\Knowledge Distillation\Pruning(模型压缩\知识蒸馏\剪枝)

18.NAS(神经架构搜索)

17.OCR(文本检测)

16.Super-Resolution(超分辨率)

Single Image Super-Resolution via a Dual Interactive Implicit Neural Network

15.Image Synthesis(图像合成)

14.Un\Self\Semi-Supervised Learning(无\自\半监督学习)

13.Image Segmentation(图像分割)

12.One\Few-Shot Learning or Domain Adaptation\Generalization\Shift(单\小样本学习 or 域适应\泛化\偏移)

11.Face(人脸)

My Face My Choice: Privacy Enhancing Deepfakes for Social Media Anonymization
人脸识别
- DigiFace-1M: 1 Million Digital Face Images for Face Recognition
  ⭐code
人脸交换
- FaceOff: A Video-to-Video Face Swapping System
读唇术
- Towards MOOCs for Lip Reading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale
人脸恢复
- AT-DDPM: Restoring Faces degraded by Atmospheric Turbulence using Denoising Diffusion Probabilistic Models
人脸表情识别
- Uncertainty-aware Label Distribution Learning for Facial Expression Recognition
  ⭐code
人脸重现
- Audio-Visual Face Reenactment
  🏠project
基于表情的脸部皱纹合成
- Mesh-Tension Driven Expression-Based Wrinkles for Synthetic Faces
人脸命名
- Weakly Supervised Face Naming with Symmetry-Enhanced Contrastive Loss

10.Adversarial Learning(对抗学习)

Leveraging Local Patch Differences in Multi-Object Scenes for Generative Adversarial Attacks

9.Remote Sensing\Satellite Image(遥感\卫星图像)

8.Image Processing(图像处理)

图像恢复
- Large-to-small Image Resolution Asymmetry in Deep Metric Learning
  ⭐code
图像增强
- Perceptual Image Enhancement for Smartphone Real-Time Applications
  ⭐code
图像着色
- Guiding Users to Where to Give Color Hints for Efficient Interactive Sketch Colorization via Unsupervised Region Prioritization
HDR重构
- Single-Image HDR Reconstruction by Multi-Exposure Generation
  ⭐code

7.Human Pose(人体姿态)

多人姿态估计
- SoMoFormer: Multi-Person Pose Forecasting with Transformers
  🏠project
三维人体
- Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes
- Uplift and Upsample: Efficient 3D Human Pose Estimation with Uplifting Transformers
  ⭐code
手部重建
- THOR-Net: End-to-end Graformer-based Realistic Two Hands and Object Reconstruction with Self-supervision
  ⭐code

6.Video(视频相关)

视频理解
- 通用事件边界检测
  - Motion Aware Self-Supervision for Generic Event Boundary Detection
    ⭐code
多人检测
- Two-level Data Augmentation for Calibrated Multi-view Detection
  ⭐code
场景识别
- MovieCLIP: Visual Scene Recognition in Movies
  🏠project
Video Grounding
- Language-free Training for Zero-shot Video Grounding
视频异常检测(VAD)
- DyAnNet: A Scene Dynamicity Guided Self-Trained Video Anomaly Detection Network
图像视频编解码
- Universal Deep Image Compression via Content-Adaptive Optimization with Adapters
  ⭐code

5.Object Detection(目标检测)

4.GAN(生成对抗网络)

HoechstGAN: Virtual Lymphocyte Staining Using Generative Adversarial Networks
fashion attribute editing(时尚属性编辑)
- Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation

3.3D(三维视觉)

2.Medical Image(医学影像)

胸部X光分类
- Probabilistic Integration of Object Level Annotations in Chest X-ray Classification
CT图像融合
- Self-Supervised 2D/3D Registration for X-Ray to CT Image Fusion

1.其它