☪️论文下载:
密码:aicv
CVPR 2021整理:https://github.com/DWCTOD/CVPR2021-Papers-with-Code-Demo
论文下载:https://pan.baidu.com/share/init?surl=gjfUQlPf73MCk4vM8VbzoA
密码:aicv
🌟 ICCV 2021持续更新最新论文/paper和相应的开源代码/code!
🚗 ICCV 2021 收录列表:https://docs.google.com/spreadsheets/u/1/d/e/2PACX-1vRfaTmsNweuaA0Gjyu58H_Cx56pGwFhcTYII0u1pg0U7MbhlgY0R6Y-BbK3xFhAiwGZ26u3TAtN5MnS/pubhtml
🚗 官网链接:http://iccv2021.thecvf.com/home
⏲️ 时间 ⌚ 论文/paper接收公布时间:2021年7月23日
✋ 注:欢迎各位大佬提交issue,分享ICCV 2021论文/paper和开源项目!共同完善这个项目
✈️ 为了方便下载,已将论文/paper存储在文件夹中 ✔️ 表示论文/paper已下载 / Paper Download
ICCV 2021 论文/paper交流群已成立!已经收录的同学,可以添加微信:nvshenj125,请备注:ICCV+姓名+学校/公司名称!一定要根据格式申请,可以拉你进群。
- Backbone
- Dataset
- Visual Transformer
- 目标检测/Object Detection
- Image Semantic Segmentation
- 实例分割/Instance Segmentation
- GAN
- Geometric deep learning
- Human Actions
- Pose Estimation
- Face Reconstruction
- 行人重识别/Re-Identification
- Face-Anti-spoofing
- 视频插帧/Video Frame Interpolation
- NeRF
- 超分辨/Super-Resolution
- Image Reconstruction
- 人机交互/Hand-object Interaction
- 点云/point cloud
- 字体生成/Font Generation
- Autonomous-Driving
- Visdrone_detection
- 其他/Others
✔️Conformer: Local Features Coupling Global Representations for Visual Recognition
Reg-IBP: Efficient and Scalable Neural Network Robustness Training via Interval Bound Propagation
- 论文/paper:None
- 代码/code:https://github.com/harrywuhust2022/Reg_IBP_ICCV2021
Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?
- 论文/paper:https://arxiv.org/abs/2105.02498
- 代码/code:https://github.com/KingJamesSong/DifferentiableSVD
✔️FineAction: A Fined Video Dataset for Temporal Action Localization
-
论文/paper:https://arxiv.org/abs/2105.11107 | 主页/Homepage
-
代码/code: None
✔️MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions
-
论文/paper:https://arxiv.org/abs/2105.07404 | 主页/Homepage
✔️Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
-
论文/paper:https://arxiv.org/abs/2101.11986
✔️Visual Transformer with Statistical Test for COVID-19 Classification
-
论文/paper:https://arxiv.org/abs/2107.05334
-
代码/code: None
Active Learning for Deep Object Detection via Probabilistic Modeling
-
论文/paper:https://arxiv.org/abs/2103.16130
-
代码/code:None
Conditional Variational Capsule Network for Open Set Recognition
DetCo: Unsupervised Contrastive Learning for Object Detection
-
论文/paper:https://arxiv.org/abs/2102.04803
-
代码/code: https://github.com/xieenze/DetCo
Detecting Invisible People
- 论文/paper:https://arxiv.org/abs/2012.08419 | 主页/Homepage
- 代码/code:None
MDETR : Modulated Detection for End-to-End Multi-Modal Understanding
- 论文/paper:https://arxiv.org/abs/2104.12763 | 主页/Homepage
- 代码/code: https://github.com/ashkamath/mdetr
Rank & Sort Loss for Object Detection and Instance Segmentation (Oral)
Enhanced Boundary Learning for Glass-like Object Segmentation
-
论文/paper:https://arxiv.org/abs/2103.15734
Personalized Image Semantic Segmentation
- 论文/paper:None
- 代码/code: https://github.com/zhangyuygss/PIS
CDNet: Centripetal Direction Network for Nuclear Instance Segmentation
-
论文/paper:None
-
代码/code: https://github.com/2021-ICCV/CDNet
✔️Crossover Learning for Fast Online Video Instance Segmentation
-
论文/paper:https://arxiv.org/abs/2104.05970
-
代码/code: https://github.com/hustvl/CrossVIS
✔️Instances as Queries
- 论文/paper:https://arxiv.org/abs/2105.01928
- 代码/code: https://github.com/hustvl/QueryInst
Rank & Sort Loss for Object Detection and Instance Segmentation (Oral)
Manifold Matching via Deep Metric Learning for Generative Modeling
- 论文/paper:https://arxiv.org/abs/2106.10777
- 代码/code:https://github.com/dzld00/pytorch-manifold-matching
Manifold Matching via Deep Metric Learning for Generative Modeling
- 论文/paper:https://arxiv.org/abs/2106.10777
- 代码/code:https://github.com/dzld00/pytorch-manifold-matching
Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition
-
论文/paper:https://arxiv.org/abs/2107.12213
✔️FineAction: A Fined Video Dataset for Temporal Action Localization
-
论文/paper:https://arxiv.org/abs/2105.11107 | 主页/Homepage
-
代码/code: None
✔️MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions
-
论文/paper:https://arxiv.org/abs/2105.07404 | 主页/Homepage
-
论文/paper:https://arxiv.org/pdf/2104.03304.pdf | 主页/Homepage
-
代码/code: None
Towards High Fidelity Monocular Face Reconstruction with Rich Reflectance using Self-supervised Learning and Ray Tracing
-
论文/paper:https://arxiv.org/abs/2103.15432
-
代码/code:None
TransReID: Transformer-based Object Re-Identification
CL-Face-Anti-spoofing
-
论文/paper:None
✔️XVFI: eXtreme Video Frame Interpolation(Oral)
-
论文/paper:https://arxiv.org/abs/2103.16206
-
代码/code: https://github.com/JihyongOh/XVFI
GNeRF: GAN-based Neural Radiance Field without Posed Camera
- 论文/paper:https://arxiv.org/abs/2103.15606 | 主页/Homepage
- 代码/code:https://github.com/MQ66/gnerf
In-Place Scene Labelling and Understanding with Implicit Scene Representation (Oral)
- 论文/paper:https://arxiv.org/abs/2103.15875 | 主页/Homepage
- 代码/code:None
KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs
Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis
- 论文/paper:https://arxiv.org/abs/2104.00677 | 主页/Homepage
- 代码/code:None
UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction (Oral)
-
论文/paper:https://arxiv.org/abs/2104.10078 | 主页/Homepage
-
代码/code:None
Learning for Scale-Arbitrary Super-Resolution from Scale-Specific Networks
-
论文/paper:https://arxiv.org/abs/2004.03791
Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation
-
论文/paper:None
-
代码/code: https://github.com/Anonymous-iccv2021-paper3163/CaFM-Pytorch
Equivariant Imaging: Learning Beyond the Range Space (Oral)
-
论文/paper:https://arxiv.org/abs/2103.14756
✔️CPF: Learning a Contact Potential Field to Model the Hand-object Interaction
-
论文/paper:https://arxiv.org/abs/2012.00924
-
代码/code:https://github.com/lixiny/CPF
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring
MVP Benchmark: Multi-View Partial Point Clouds for Completion and Registration
- 论文/paper:None |主页/Homepage
- 代码/code:https://github.com/paul007pl/MVP_Benchmark
Unsupervised Point Cloud Pre-Training via View-Point Occlusion, Completion
- 论文/paper:https://arxiv.org/abs/2010.01089 |主页/Homepage
- 代码/code:https://github.com/hansen7/OcCo
Walk in the Cloud: Learning Curves for Point Clouds Shape Analysis
✔️Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts
-
论文/paper:https://arxiv.org/abs/2104.00887
Road-Challenge-Event-Detection-for-Situation-Awareness-in-Autonomous-Driving
-
论文/paper:None
Social NCE: Contrastive Learning of Socially-aware Motion Representations
ICCV2021_Visdrone_detection
-
论文/paper:None
-
代码/code:https://github.com/Gumpest/ICCV2021_Visdrone_detection
Cross-Camera Convolutional Color Constancy
-
论文/paper:https://arxiv.org/abs/2011.11164
Learnable Boundary Guided Adversarial Training
-
论文/paper:https://arxiv.org/abs/2011.11164
-
代码/code:https://github.com/FPNAS/LBGAT
Prior-Enhanced network with Meta-Prototypes (PEMP)
- 论文/paper:None
- 代码/code:https://github.com/PaperSubmitAAAA/ICCV2021-2337
MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
- 论文/paper:https://arxiv.org/abs/2104.12763 | 主页/Homepage
- 代码/code:https://github.com/ashkamath/mdetr
Generalized-Shuffled-Linear-Regression (Oral)
- 论文/paper:https://drive.google.com/file/d/1Qu21VK5qhCW8WVjiRnnBjehrYVmQrDNh/view
- 代码/code:https://github.com/SILI1994/Generalized-Shuffled-Linear-Regression
VLGrammar: Grounded Grammar Induction of Vision and Language
- 论文/paper:https://arxiv.org/abs/2103.12975
- 代码/code:https://github.com/evelinehong/VLGrammar