Transformer-in-Vision

A paper list of some recent Transformer-based CV works. If you find some ignored papers, please open issues or pull requests.

**Last updated: 2022/05/19

Update log

2021/April - update all of recent papers of Transformer-in-Vision.
2021/May - update all of recent papers of Transformer-in-Vision.
2021/June - update all of recent papers of Transformer-in-Vision.
2021/July - update all of recent papers of Transformer-in-Vision.
2021/August - update all of recent papers of Transformer-in-Vision.
2021/September - update all of recent papers of Transformer-in-Vision.
2021/October - update all of recent papers of Transformer-in-Vision.
2021/November - update all of recent papers of Transformer-in-Vision.
2021/December - update all of recent papers of Transformer-in-Vision.
2022/January - update all of recent papers of Transformer-in-Vision.
2022/February - update all of recent papers of Transformer-in-Vision.
2022/March - update all of recent papers of Transformer-in-Vision.
2022/April - update all of recent papers of Transformer-in-Vision.
2022/May - update all of recent papers of Transformer-in-Vision.

Survey:

(arXiv 2022.05) Transformers in 3D Point Clouds: A Survey. [Paper]
(arXiv 2022.03) Vision Transformers in Medical Computer Vision - A Contemplative Retrospection. [Paper]
(arXiv 2022.03) Transformers Meet Visual Learning Understanding: A Comprehensive Review. [Paper]
(arXiv 2022.03) Recent Advances in Vision Transformer: A Survey and Outlook of Recent Work. [Paper]
(arXiv 2022.02) Transformers in Medical Image Analysis: A Review. [Paper]
(arXiv 2022.01) Transformers in Medical Imaging: A Survey. [Paper], [Awesome]
(arXiv 2022.01) A Comprehensive Study of Vision Transformers on Dense Prediction Tasks. [Paper]
(arXiv 2022.01) Video Transformers: A Survey. [Paper]
(arXiv 2021.11) A Survey of Visual Transformers. [Paper]
(arXiv 2021.09) Survey: Transformer based Video-Language Pre-training. [Paper]
(arXiv 2021.03) Multi-modal Motion Prediction with Stacked Transformers. [Paper], [Code]
(arXiv 2021.03) Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision. [Paper]
(arXiv 2020.09) Efficient Transformers: A Survey. [Paper]
(arXiv 2020.01) Transformers in Vision: A Survey. [Paper]

Contact & Feedback

If you have any suggestions about this project, feel free to contact me.

[e-mail: yzhangcst[at]gmail.com]

wutianyiRosun/Transformer-in-Computer-Vision

Transformer-in-Vision

Update log

Survey:

Recent Papers

Action

Active Learning

Anomaly Detection

Assessment

Bird's-Eye-View

Captioning

Classification (Backbone)

Completion

Compression

Cross-view

Crowd

Deblurring

Depth

Deepfake Detection

Dehazing

Denoising

Detection

Edge

Enhancement

Face

Few-shot Learning

Fusion

GAN

Gait

Gaze

Hand Gesture

HOI

Hyperspectral

Incremental Learning

In-painting

Instance Segmentation

Knowledge Distillation

Lane

Layout

Lighting

Matching

Matting

Medical

Metric learning

Motion

Multi-label

Multi-task/modal

Multi-view Stereo

NAS

Navigation

Neural Rendering

OCR

Octree

Open Set Recognition

Optical Flow

Panoptic Segmentation

Point Cloud

Pose

Planning

Pruning & Quantization

Recognition

Reconstruction

Registration

Re-identification

Restoration

Retrieval

Robotic

Salient Object Detection

Scene

Self-supervised Learning

Semantic Segmentation

Shape

Super-Resolution

Synthesis

Text-to-Image

Tracking

Traffic

Transfer learning

Translation

Texture