ICCV2021-Papers-with-Code-Demo

☪️论文下载：

密码：aicv

CVPR 2021整理：https://github.com/DWCTOD/CVPR2021-Papers-with-Code-Demo

论文下载：https://pan.baidu.com/share/init?surl=gjfUQlPf73MCk4vM8VbzoA

密码：aicv

🌟 ICCV 2021持续更新最新论文/paper和相应的开源代码/code！

🚗 ICCV 2021 收录列表：https://docs.google.com/spreadsheets/u/1/d/e/2PACX-1vRfaTmsNweuaA0Gjyu58H_Cx56pGwFhcTYII0u1pg0U7MbhlgY0R6Y-BbK3xFhAiwGZ26u3TAtN5MnS/pubhtml

🚗 官网链接：http://iccv2021.thecvf.com/home

⏲️ 时间 ⌚ 论文/paper接收公布时间：2021年7月23日

✋ 注：欢迎各位大佬提交issue，分享ICCV 2021论文/paper和开源项目！共同完善这个项目

✈️ 为了方便下载，已将论文/paper存储在文件夹中 ✔️ 表示论文/paper已下载 / Paper Download

🎆 欢迎进群 | Welcome

ICCV 2021 论文/paper交流群已成立！已经收录的同学，可以添加微信：nvshenj125，请备注：ICCV+姓名+学校/公司名称！一定要根据格式申请，可以拉你进群。

🔨 目录 |Table of Contents（点击直接跳转）

Backbone
Dataset
Visual Transformer
目标检测/Object Detection
Image Semantic Segmentation
实例分割/Instance Segmentation
GAN
Geometric deep learning
Human Actions
Pose Estimation
Face Reconstruction
行人重识别/Re-Identification
Face-Anti-spoofing
视频插帧/Video Frame Interpolation
NeRF
超分辨/Super-Resolution
Image Reconstruction
人机交互/Hand-object Interaction
点云/point cloud
字体生成/Font Generation
Autonomous-Driving
Visdrone_detection
其他/Others

Backbone

✔️Conformer: Local Features Coupling Global Representations for Visual Recognition

论文/paper：https://arxiv.org/abs/2105.03889
代码/code：https://github.com/pengzhiliang/Conformer

Reg-IBP: Efficient and Scalable Neural Network Robustness Training via Interval Bound Propagation

论文/paper：None
代码/code：https://github.com/harrywuhust2022/Reg_IBP_ICCV2021

Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?

论文/paper：https://arxiv.org/abs/2105.02498
代码/code：https://github.com/KingJamesSong/DifferentiableSVD

返回目录/back

Dataset

✔️FineAction: A Fined Video Dataset for Temporal Action Localization

论文/paper：https://arxiv.org/abs/2105.11107 | 主页/Homepage
代码/code： None

✔️MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions

论文/paper：https://arxiv.org/abs/2105.07404 | 主页/Homepage
代码/code：https://github.com/MCG-NJU/MultiSports/

返回目录/back

Visual Transformer

✔️Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

论文/paper：https://arxiv.org/abs/2101.11986
代码/code： https://github.com/yitu-opensource/T2T-ViT

✔️Visual Transformer with Statistical Test for COVID-19 Classification

论文/paper：https://arxiv.org/abs/2107.05334
代码/code： None

返回目录/back

目标检测/Object Detection

Active Learning for Deep Object Detection via Probabilistic Modeling

论文/paper：https://arxiv.org/abs/2103.16130
代码/code：None

Conditional Variational Capsule Network for Open Set Recognition

论文/paper： https://arxiv.org/abs/2104.09159
代码/code：https://github.com/guglielmocamporese/cvaecaposr

DetCo: Unsupervised Contrastive Learning for Object Detection

论文/paper：https://arxiv.org/abs/2102.04803
代码/code： https://github.com/xieenze/DetCo

Detecting Invisible People

论文/paper：https://arxiv.org/abs/2012.08419 | 主页/Homepage
代码/code：None

MDETR : Modulated Detection for End-to-End Multi-Modal Understanding

论文/paper：https://arxiv.org/abs/2104.12763 | 主页/Homepage
代码/code： https://github.com/ashkamath/mdetr

Rank & Sort Loss for Object Detection and Instance Segmentation (Oral)

论文/paper：https://arxiv.org/abs/2107.11669
代码/code：https://github.com/kemaloksuz/RankSortLoss

返回目录/back

Image Semantic Segmentation

Enhanced Boundary Learning for Glass-like Object Segmentation

论文/paper：https://arxiv.org/abs/2103.15734
代码/code：https://github.com/hehao13/EBLNet

Personalized Image Semantic Segmentation

论文/paper：None
代码/code： https://github.com/zhangyuygss/PIS

返回目录/back

实例分割/Instance Segmentation

CDNet: Centripetal Direction Network for Nuclear Instance Segmentation

论文/paper：None
代码/code： https://github.com/2021-ICCV/CDNet

✔️Crossover Learning for Fast Online Video Instance Segmentation

论文/paper：https://arxiv.org/abs/2104.05970
代码/code： https://github.com/hustvl/CrossVIS

✔️Instances as Queries

论文/paper：https://arxiv.org/abs/2105.01928
代码/code： https://github.com/hustvl/QueryInst

Rank & Sort Loss for Object Detection and Instance Segmentation (Oral)

论文/paper：https://arxiv.org/abs/2107.11669
代码/code：https://github.com/kemaloksuz/RankSortLoss

返回目录/back

GAN

Manifold Matching via Deep Metric Learning for Generative Modeling

论文/paper：https://arxiv.org/abs/2106.10777
代码/code：https://github.com/dzld00/pytorch-manifold-matching

返回目录/back

Geometric deep learning

Manifold Matching via Deep Metric Learning for Generative Modeling

论文/paper：https://arxiv.org/abs/2106.10777
代码/code：https://github.com/dzld00/pytorch-manifold-matching

返回目录/back

Human Actions

Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition

论文/paper：https://arxiv.org/abs/2107.12213
代码/code：https://github.com/Uason-Chen/CTR-GCN

✔️FineAction: A Fined Video Dataset for Temporal Action Localization

论文/paper：https://arxiv.org/abs/2105.11107 | 主页/Homepage
代码/code： None

✔️MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions

论文/paper：https://arxiv.org/abs/2105.07404 | 主页/Homepage
代码/code：https://github.com/MCG-NJU/MultiSports/

返回目录/back

Pose Estimation

论文/paper：https://arxiv.org/pdf/2104.03304.pdf | 主页/Homepage
代码/code： None

返回目录/back

Face Reconstruction

Towards High Fidelity Monocular Face Reconstruction with Rich Reflectance using Self-supervised Learning and Ray Tracing

论文/paper：https://arxiv.org/abs/2103.15432
代码/code：None

返回目录/back

行人重识别/Re-Identification

TransReID: Transformer-based Object Re-Identification

论文/paper：https://arxiv.org/abs/2102.04378
代码/code：https://github.com/heshuting555/TransReID

Face-Anti-spoofing

CL-Face-Anti-spoofing

论文/paper：None
代码/code：https://github.com/xxheyu/CL-Face-Anti-spoofing

返回目录/back

视频插帧/Video Frame Interpolation

✔️XVFI: eXtreme Video Frame Interpolation(Oral)

论文/paper：https://arxiv.org/abs/2103.16206
代码/code： https://github.com/JihyongOh/XVFI

返回目录/back

NeRF

GNeRF: GAN-based Neural Radiance Field without Posed Camera

论文/paper：https://arxiv.org/abs/2103.15606 | 主页/Homepage
代码/code：https://github.com/MQ66/gnerf

In-Place Scene Labelling and Understanding with Implicit Scene Representation (Oral)

论文/paper：https://arxiv.org/abs/2103.15875 | 主页/Homepage
代码/code：None

KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

论文/paper：https://arxiv.org/abs/2103.13744| 主页/Homepage
代码/code：https://github.com/creiser/kilonerf

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis

论文/paper：https://arxiv.org/abs/2104.00677 | 主页/Homepage
代码/code：None

UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction (Oral)

论文/paper：https://arxiv.org/abs/2104.10078 | 主页/Homepage
代码/code：None

返回目录/back

超分辨/Super-Resolution

Learning for Scale-Arbitrary Super-Resolution from Scale-Specific Networks

论文/paper：https://arxiv.org/abs/2004.03791
代码/code：https://github.com/LongguangWang/ArbSR

Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation

论文/paper：None
代码/code： https://github.com/Anonymous-iccv2021-paper3163/CaFM-Pytorch

返回目录/back

Image Reconstruction

Equivariant Imaging: Learning Beyond the Range Space (Oral)

论文/paper：https://arxiv.org/abs/2103.14756
代码/code：https://github.com/edongdongchen/EI

返回目录/back

人机交互/Hand-object Interaction

✔️CPF: Learning a Contact Potential Field to Model the Hand-object Interaction

论文/paper：https://arxiv.org/abs/2012.00924
代码/code：https://github.com/lixiny/CPF

返回目录/back

点云/Point Cloud

InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring

论文/paper：https://arxiv.org/pdf/2103.01128.pdf
代码/code：https://github.com/CurryYuan/InstanceRefer

MVP Benchmark: Multi-View Partial Point Clouds for Completion and Registration

论文/paper：None |主页/Homepage
代码/code：https://github.com/paul007pl/MVP_Benchmark

Unsupervised Point Cloud Pre-Training via View-Point Occlusion, Completion

论文/paper：https://arxiv.org/abs/2010.01089 |主页/Homepage
代码/code：https://github.com/hansen7/OcCo

Walk in the Cloud: Learning Curves for Point Clouds Shape Analysis

论文/paper：https://arxiv.org/abs/2105.01288v1| 主页/Homepage
代码/code：https://github.com/tiangexiang/CurveNet

返回目录/back

字体生成/Font Generation

✔️Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts

论文/paper：https://arxiv.org/abs/2104.00887
代码/code：https://github.com/clovaai/mxfont

返回目录/back

Autonomous-Driving

Road-Challenge-Event-Detection-for-Situation-Awareness-in-Autonomous-Driving

论文/paper：None
代码/code：https://github.com/Trevorchenmsu/Road-Challenge-Event-Detection-for-Situation-Awareness-in-Autonomous-Driving

Social NCE: Contrastive Learning of Socially-aware Motion Representations

论文/paper：https://arxiv.org/abs/2012.11717
代码/code：https://github.com/vita-epfl/social-nce-crowdnav

返回目录/back

Visdrone_detection

ICCV2021_Visdrone_detection

论文/paper：None
代码/code：https://github.com/Gumpest/ICCV2021_Visdrone_detection

返回目录/back

其他/Others

Cross-Camera Convolutional Color Constancy

论文/paper：https://arxiv.org/abs/2011.11164
代码/code：https://github.com/mahmoudnafifi/C5

Learnable Boundary Guided Adversarial Training

论文/paper：https://arxiv.org/abs/2011.11164
代码/code：https://github.com/FPNAS/LBGAT

Prior-Enhanced network with Meta-Prototypes (PEMP)

论文/paper：None
代码/code：https://github.com/PaperSubmitAAAA/ICCV2021-2337

MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding

论文/paper：https://arxiv.org/abs/2104.12763 | 主页/Homepage
代码/code：https://github.com/ashkamath/mdetr

Generalized-Shuffled-Linear-Regression （Oral）

论文/paper：https://drive.google.com/file/d/1Qu21VK5qhCW8WVjiRnnBjehrYVmQrDNh/view
代码/code：https://github.com/SILI1994/Generalized-Shuffled-Linear-Regression

VLGrammar: Grounded Grammar Induction of Vision and Language

论文/paper：https://arxiv.org/abs/2103.12975
代码/code：https://github.com/evelinehong/VLGrammar

返回目录/back

YuejiangLIU/ICCV2021-Papers-with-Code-Demo

ICCV2021-Papers-with-Code-Demo

🎆 欢迎进群 | Welcome

🔨 目录 |Table of Contents（点击直接跳转）

Backbone

Dataset

Visual Transformer

目标检测/Object Detection

Image Semantic Segmentation

实例分割/Instance Segmentation

GAN

Geometric deep learning

Human Actions

Pose Estimation

Face Reconstruction

行人重识别/Re-Identification

Face-Anti-spoofing

视频插帧/Video Frame Interpolation

NeRF

超分辨/Super-Resolution

Image Reconstruction

人机交互/Hand-object Interaction

点云/Point Cloud

字体生成/Font Generation

Autonomous-Driving

Visdrone_detection

其他/Others