ICCV2021-Papers-with-Code-Demo

☪️论文下载:

密码:aicv

CVPR 2021整理:https://github.com/DWCTOD/CVPR2021-Papers-with-Code-Demo

论文下载:https://pan.baidu.com/share/init?surl=gjfUQlPf73MCk4vM8VbzoA

密码:aicv

🌟 ICCV 2021持续更新最新论文/paper和相应的开源代码/code!

🚗 ICCV 2021 收录列表:https://docs.google.com/spreadsheets/u/1/d/e/2PACX-1vRfaTmsNweuaA0Gjyu58H_Cx56pGwFhcTYII0u1pg0U7MbhlgY0R6Y-BbK3xFhAiwGZ26u3TAtN5MnS/pubhtml

🚗 官网链接:http://iccv2021.thecvf.com/home

⏲️ 时间 ⌚ 论文/paper接收公布时间:2021年7月23日

✋ ​注:欢迎各位大佬提交issue,分享ICCV 2021论文/paper和开源项目!共同完善这个项目

✈️ 为了方便下载,已将论文/paper存储在文件夹中 ✔️ 表示论文/paper已下载 / Paper Download

🎆 欢迎进群 | Welcome

ICCV 2021 论文/paper交流群已成立!已经收录的同学,可以添加微信:nvshenj125,请备注:ICCV+姓名+学校/公司名称!一定要根据格式申请,可以拉你进群。

🔨 目录 |Table of Contents(点击直接跳转)

Backbone

✔️Conformer: Local Features Coupling Global Representations for Visual Recognition

Reg-IBP: Efficient and Scalable Neural Network Robustness Training via Interval Bound Propagation

Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?

返回目录/back

Dataset

✔️FineAction: A Fined Video Dataset for Temporal Action Localization

✔️MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions

返回目录/back

Visual Transformer

✔️Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

✔️Visual Transformer with Statistical Test for COVID-19 Classification

返回目录/back

目标检测/Object Detection

Active Learning for Deep Object Detection via Probabilistic Modeling

Conditional Variational Capsule Network for Open Set Recognition

DetCo: Unsupervised Contrastive Learning for Object Detection

Detecting Invisible People

MDETR : Modulated Detection for End-to-End Multi-Modal Understanding

Rank & Sort Loss for Object Detection and Instance Segmentation (Oral)

返回目录/back

Image Semantic Segmentation

Enhanced Boundary Learning for Glass-like Object Segmentation

Personalized Image Semantic Segmentation

返回目录/back

实例分割/Instance Segmentation

CDNet: Centripetal Direction Network for Nuclear Instance Segmentation

✔️Crossover Learning for Fast Online Video Instance Segmentation

✔️Instances as Queries

Rank & Sort Loss for Object Detection and Instance Segmentation (Oral)

返回目录/back

GAN

Manifold Matching via Deep Metric Learning for Generative Modeling

返回目录/back

Geometric deep learning

Manifold Matching via Deep Metric Learning for Generative Modeling

返回目录/back

Human Actions

Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition

✔️FineAction: A Fined Video Dataset for Temporal Action Localization

✔️MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions

返回目录/back

Pose Estimation

返回目录/back

Face Reconstruction

Towards High Fidelity Monocular Face Reconstruction with Rich Reflectance using Self-supervised Learning and Ray Tracing

返回目录/back

行人重识别/Re-Identification

TransReID: Transformer-based Object Re-Identification

Face-Anti-spoofing

CL-Face-Anti-spoofing

返回目录/back

视频插帧/Video Frame Interpolation

✔️XVFI: eXtreme Video Frame Interpolation(Oral)

返回目录/back

NeRF

GNeRF: GAN-based Neural Radiance Field without Posed Camera

In-Place Scene Labelling and Understanding with Implicit Scene Representation (Oral)

KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis

UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction (Oral)

返回目录/back

超分辨/Super-Resolution

Learning for Scale-Arbitrary Super-Resolution from Scale-Specific Networks

Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation

返回目录/back

Image Reconstruction

Equivariant Imaging: Learning Beyond the Range Space (Oral)

返回目录/back

人机交互/Hand-object Interaction

✔️CPF: Learning a Contact Potential Field to Model the Hand-object Interaction

返回目录/back

点云/Point Cloud

InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring

MVP Benchmark: Multi-View Partial Point Clouds for Completion and Registration

Unsupervised Point Cloud Pre-Training via View-Point Occlusion, Completion

Walk in the Cloud: Learning Curves for Point Clouds Shape Analysis

返回目录/back

字体生成/Font Generation

✔️Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts

返回目录/back

Autonomous-Driving

Road-Challenge-Event-Detection-for-Situation-Awareness-in-Autonomous-Driving

Social NCE: Contrastive Learning of Socially-aware Motion Representations

返回目录/back

Visdrone_detection

ICCV2021_Visdrone_detection

返回目录/back

其他/Others

Cross-Camera Convolutional Color Constancy

Learnable Boundary Guided Adversarial Training

Prior-Enhanced network with Meta-Prototypes (PEMP)

MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding

Generalized-Shuffled-Linear-Regression (Oral)

VLGrammar: Grounded Grammar Induction of Vision and Language

返回目录/back