原文链接:https://mp.weixin.qq.com/s/SmS-guwg6oUqPYwfeC6iiw
同步更新地址:http://bbs.cvmart.net/topics/302/cvpr2019paper
cvpr2019 accepted papers list:
http://cvpr2019.thecvf.com/files/cvpr_2019_final_accept_list.txt
论文PDF下载(更新中)
链接:https://pan.baidu.com/s/1s4FuLscWcslN5rQQvP92JA
提取码:osvy
Related paper links:(也欢迎大家推荐自己的CVPR2019文章,我们会及时更新上来)
75.Self-supervised Learning of Dense Shape Correspondence(Oral Presentation)
作者:Oshri Halimi, Or Litany, Emanuele Rodolà, Alex Bronstein, Ron Kimmel
论文链接:https://arxiv.org/abs/1812.02415
74.A Kernelized Manifold Mapping to Diminish the Effect of Adversarial Perturbations
作者:Saeid Asgari Taghanaki Kumar Abhishek1 Shekoofeh Azizi and Ghassan Hamarneh
论文链接:http://cs.sfu.ca/~hamarneh/ecopy/cvpr2019.pdf
73.RepMet: Representative-based metric learning for classification and one-shot object detection
作者:Leonid Karlinsky, Joseph Shtok, Sivan Harary, Eli Schwartz, Amit Aides, Rogerio Feris, Raja Giryes, Alex M. Bronstein
论文链接:https://arxiv.org/abs/1806.04728
72.Handwriting Recognition in Low-resource Scripts using Adversarial Learning
作者:Ayan Kumar Bhunia, Abhirup Das, Ankan Kumar Bhunia, Perla Sai Raj Kishore, Partha Pratim Roy
论文链接:https://arxiv.org/pdf/1811.01396.pdf
71.DeepMapping: Unsupervised Map Estimation From Multiple Point Clouds
作者:Li Ding, Chen Feng
论文链接:https://arxiv.org/abs/1811.11397
70.Multi-Step Prediction of Occupancy Grid Maps with Recurrent Neural Networks
作者:Nima Mohajerin, Mohsen Rohani
论文链接:https://arxiv.org/pdf/1812.09395.pdf
69.On the Continuity of Rotation Representations in Neural Networks
作者:Yi Zhou, Connelly Barnes, Jingwan Lu, Jimei Yang, Hao Li
论文链接:https://arxiv.org/pdf/1812.07035.pdf
68.LaSOT: A High-quality Benchmark for Large-scale Single Object Tracking(目标跟踪)
作者:Heng Fan, Liting Lin, Fan Yang, Peng Chu, Ge Deng, Sijia Yu, Hexin Bai, Yong Xu, Chunyuan Liao, Haibin Ling
论文链接:https://arxiv.org/pdf/1809.07845.pdf
project链接:https://cis.temple.edu/lasot/
67.Siamese Cascaded Region Proposal Networks for Real-Time Visual Tracking(CRPN,目标跟踪)
作者:Heng Fan, Haibin Ling
论文链接:https://arxiv.org/pdf/1812.06148.pdf
66.SiamRPN++: Evolution of Siamese Visual Tracking with Very Deep Networks(目标跟踪)
作者:Bo Li, Wei Wu, Qiang Wang, Fangyi Zhang, Junliang Xing, Junjie Yan
论文链接:https://arxiv.org/pdf/1901.01660.pdf
Project链接:http://bo-li.info/SiamRPN++/
论文解读:https://mp.weixin.qq.com/s/dB5u2No8eakLnrjto0kvyQ
65.Deeper and Wider Siamese Networks for Real-Time Visual Tracking(CIR,目标跟踪)
作者:Zhipeng Zhang, Houwen Peng
论文链接:https://arxiv.org/pdf/1901.01660.pdf
Code链接:https://gitlab.com/MSRA_NLPR/deeper_wider_siamese_trackers
64.Fast Online Object Tracking and Segmentation: A Unifying Approach(SiamMask,目标跟踪)
作者:Qiang Wang, Li Zhang, Luca Bertinetto, Weiming Hu, Philip H.S. Torr
论文链接:https://arxiv.org/abs/1812.05050
project链接:http://www.robots.ox.ac.uk/~qwang/SiamMask/
63.Mask Scoring R-CNN
作者:Zhaojin Huang, Lichao Huang, Yongchao Gong, Chang Huang, Xinggang Wang
论文链接:https://arxiv.org/abs/1903.00241
62.Octree guided CNN with Spherical Kernels for 3D Point Clouds
作者:Huan Lei, Naveed Akhtar, Ajmal Mian
论文链接:https://arxiv.org/abs/1903.00343
61.Context-Aware Visual Compatibility Prediction
作者:Guillem Cucurull, Perouz Taslakian, David Vazquez
论文链接:https://arxiv.org/abs/1902.03646
60.Competitive Collaboration: Joint Unsupervised Learning of Depth, CameraMotion, Optical Flow and Motion Segmentation
作者:Anurag Ranjan, Varun Jampani, Kihwan Kim, Deqing Sun, Jonas Wulff, Michael J. Black
论文链接:https://arxiv.org/pdf/1805.09806.pdf
Reading Note:Single view depth prediction, camera motion estimation, optical flow, and segmentation of a video into the static scene and moving regions are challenging but coupled problems. Our key insight is that these four fundamental vision problems are coupled through geometric constraints. Thus, we introduce Competitive Collaboration, a framework that facilitates the coordinated training of multiple specialized neural networks to solve complex problems.
59.Neural RGB-D Sensing: Depth estimation from a video
作者:Chao Liu, Jinwei Gu, Kihwan Kim, Srinivasa Narasimhan, Jan Kautz
论文链接:https://arxiv.org/pdf/1901.02571.pdf
project链接:https://research.nvidia.com/publication/2019-06_Neural-RGBD
Reading Note:In this paper, we propose a deep learning (DL) method to estimate per-pixel depth and its uncertainty continuously from a monocular video stream, with the goal of effectively turning an RGB camera into an RGB-D camera. Unlike prior DL-basedmethods, we estimate a depth probability distribution for each pixel rather than a single depth value, leading to an estimate of a 3D depth probability volume for each input frame.
58.PlaneRCNN: 3D Plane Detection and Reconstruction from a Single Image
作者:Chen Liu, Kihwan Kim, Jinwei Gu, Yasutaka Furukawa, Jan Kautz
论文链接:https://arxiv.org/pdf/1812.04072.pdf
project链接:https://research.nvidia.com/publication/2019-06_PlaneRCNN
Reading Note:This paper proposes a deep neural architecture, PlaneR-CNN, that detects and reconstructs piecewise planar surfaces from a single RGB image. PlaneRCNN employs a variant of Mask R-CNN to detect planes with their plane parameters and segmentation masks. PlaneRCNN then jointly refines all the segmentation masks with a novel loss enforcing the consistency with a nearby view during training.
57.A General and Adaptive Robust Loss Function(Oral Presentation)
作者:Jonathan T. Barron
论文链接:https://arxiv.org/abs/1701.03077
Reading Note:A single robust loss function is a superset of many other common robust loss functions, and allows training to automatically adapt the robustness of its own loss.
56.Learning to Synthesize Motion Blur(Oral Presentation)
作者:Tim Brooks, Jonathan T. Barron
论文链接:https://arxiv.org/abs/1811.11745
project链接:http://timothybrooks.com/tech/motion-blur/
Reading note:Frame interpolation techniques can be used to train a network to directly synthesize linear motion blur.
55.Unprocessing Images for Learned Raw Denoising (Oral Presentation)
作者:Tim Brooks, Ben Mildenhall, Tianfan Xue, Jiawen Chen, Dillon Sharlet, Jonathan T. Barron
论文链接:https://arxiv.org/abs/1811.11127
project链接:http://timothybrooks.com/tech/unprocessing/
Reading note:We can learn a better denoising model by processing and unprocessing images the same way a camera does.
54.PlaneRCNN: 3D Plane Detection and Reconstruction from a Single Image
作者:Chen Liu, Kihwan Kim, Jinwei Gu, Yasutaka Furukawa, Jan Kautz
论文链接:https://arxiv.org/abs/1812.04072
53.Bi-Directional Cascade Network for Perceptual Edge Detection
作者:Jianzhong He, Shiliang Zhang, Ming Yang, Yanhu Shan, Tiejun Huang
论文链接:https://arxiv.org/abs/1902.10903
Github源码:https://github.com/pkuCactus/BDCN
52.Dual Attention Network for Scene Segmentation
作者:Jun Fu, Jing Liu, Haijie Tian, Yong Li, Yongjun Bao, Zhiwei Fang, Hanqing Lu
论文链接:https://arxiv.org/abs/1809.02983
Github源码:https://github.com/junfu1115/DANet
51.Object-driven Text-to-Image Synthesis via Adversarial Training
作者:Wenbo Li, Pengchuan Zhang, Lei Zhang, Qiuyuan Huang, Xiaodong He, Siwei Lyu, Jianfeng Gao
论文链接:https://arxiv.org/abs/1902.10740
50.Joint Face Detection and Facial Motion Retargeting for Multiple Faces
作者:Bindita Chaudhuri, Noranart Vesdapunt, Baoyuan Wang
论文链接:https://arxiv.org/abs/1902.10744
49.End-to-End Efficient Representation Learning via Cascading Combinatorial Optimization
作者:Yeonwoo Jeong, Yoonsuing Kim, Hyun Oh Song
论文链接:https://arxiv.org/abs/1902.10990
48.Efficient Video Classification Using Fewer Frames
作者:Shweta Bhardwaj, Mukundhan Srinivasan, Mitesh M. Khapra
论文链接:https://arxiv.org/abs/1902.10640
来源:https://mp.weixin.qq.com/s/61C-k3Ijy_7ry5B5lRML6Q
47.Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference
作者:Yao Yao, Zixin Luo, Shiwei Li, Tianwei Shen, Tian Fang, Long Quan
论文链接:https://arxiv.org/abs/1902.10556
代码链接:https://github.com/YoYo000/MVSNet
来源:https://mp.weixin.qq.com/s/61C-k3Ijy_7ry5B5lRML6Q
46.Single-frame Regularization for Temporally Stable CNNs(视频处理)
作者:Gabriel Eilertsen, Rafał K. Mantiuk, Jonas Unger
论文链接:https://arxiv.org/abs/1902.10424
来源:https://mp.weixin.qq.com/s/61C-k3Ijy_7ry5B5lRML6Q
45.FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stochastic Inference
作者:Jungbeom Lee, Eunji Kim, Sungmin Lee, Jangho Lee, Sungroh Yoon
论文链接:https://arxiv.org/abs/1902.10421
来源:https://mp.weixin.qq.com/s/61C-k3Ijy_7ry5B5lRML6Q
44.Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning
作者:Nayyer Aafaq, Naveed Akhtar, Wei Liu, Syed Zulqarnain Gilani, Ajmal Mian
论文链接:https://arxiv.org/abs/1902.10322
来源:https://mp.weixin.qq.com/s/61C-k3Ijy_7ry5B5lRML6Q
43.Self-Supervised Generative Adversarial Networks
作者:Ting Chen, Xiaohua Zhai, Marvin Ritter, Mario Lucic, Neil Houlsby
论文链接:https://arxiv.org/abs/1811.11212
Github链接:https://github.com/google/compare_gan
42.RePr: Improved Training of Convolutional Filters
作者:Aaditya Prakash, James Storer, Dinei Florencio, Cha Zhang
论文链接:https://arxiv.org/abs/1811.07275
41.Data augmentation using learned transforms for one-shot medical image segmentation
作者:Amy Zhao, Guha Balakrishnan, Frédo Durand, John V. Guttag, Adrian V. Dalca
论文链接:https://arxiv.org/abs/1902.09383
40.Monocular Total Capture: Posing Face, Body, and Hands in the Wild
作者:Donglai Xiang, Hanbyul Joo, Yaser Sheikh
论文链接:https://arxiv.org/pdf/1812.01598.pdf
项目链接:http://domedb.perception.cs.cmu.edu/monototalcapture.html
39.3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans
作者:Ji Hou Angela Dai Matthias Nießner
论文链接:https://niessnerlab.org/projects/hou20183dsis.html
YouTube视频:https://youtu.be/IH9rNLD1-JE
38.MUREL: Multimodal Relational Reasoning for Visual Question Answering
作者:Remi Cadene, Hedi Ben-younes, Matthieu Cord, Nicolas Thome
论文链接:https://arxiv.org/abs/1902.09487
github链接:https://github.com/Cadene/murel.bootstrap.pytorch
37.ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape
作者:Fabian Manhardt, Wadim Kehl, Adrien Gaidon
论文链接:https://arxiv.org/abs/1812.02781
36.GANFIT: Generative Adversarial Network Fitting for High Fidelity 3D Face Reconstruction
作者:Baris Gecer, Stylianos Ploumpis, Irene Kotsia, Stefanos Zafeiriou
论文链接:https://arxiv.org/abs/1902.05978
github链接:https://github.com/barisgecer/ganfit
35.Disentangled Representation Learning for 3D Face Shape
作者:Zi-Hang Jiang, Qianyi Wu, Keyu Chen, Juyong Zhang
论文链接:https://arxiv.org/abs/1902.09887
34.RepNet: Weakly Supervised Training of an Adversarial Reprojection Network for 3D Human Pose Estimation
作者:Bastian Wandt, Bodo Rosenhahn
论文链接:https://arxiv.org/abs/1902.09868
33.Single-Image Piece-wise Planar 3D Reconstruction via Associative Embedding(开源)
作者:Zehao Yu, Jia Zheng, Dongze Lian, Zihan Zhou, Shenghua Gao
论文链接:https://arxiv.org/abs/1902.09777
代码链接:https://github.com/svip-lab/PlanarReconstruction
来源:https://mp.weixin.qq.com/s/mamDhLUw6O9v8gldyIOPUA
32.Learning a Deep ConvNet for Multi-label Classification with Partial Labels(分类)
作者:Thibaut Durand, Nazanin Mehrasa, Greg Mori
论文链接:https://arxiv.org/abs/1902.09720
来源:https://mp.weixin.qq.com/s/mamDhLUw6O9v8gldyIOPUA
31.3D Hand Shape and Pose from Images in the Wild
作者:Adnane Boukhayma, Rodrigo de Bem, Philip H.S. Torr
论文链接:https://arxiv.org/pdf/1902.03451.pdf
Github链接:https://github.com/boukhayma/3dhand
30.Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression(检测)
作者:Hamid Rezatofighi, Nathan Tsoi, JunYoung Gwak, Amir Sadeghian, Ian Reid, Silvio Savarese
论文链接:https://arxiv.org/abs/1902.09630
来源:https://mp.weixin.qq.com/s/mamDhLUw6O9v8gldyIOPUA
论文解读:https://mp.weixin.qq.com/s/6QsyYtEVjavoLfU_lQF1pw
29.Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving
作者:Yan Wang, Wei-Lun Chao, Divyansh Garg, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger
论文链接:https://arxiv.org/abs/1812.07179
28.MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment
作者:Da Zhang, Xiyang Dai, Xin Wang, Yuan-Fang Wang, Larry S. Davis
论文链接:https://arxiv.org/pdf/1812.00087.pdf
27.Associatively Segmenting Instances and Semantics in Point Clouds(点云分割,开源)
作者:Xinlong Wang, Shu Liu, Xiaoyong Shen, Chunhua Shen, Jiaya Jia
论文链接:https://arxiv.org/abs/1902.09852
代码链接:https://github.com/WXinlong/ASIS
- Efficient Parameter-free Clustering Using First Neighbor Relations
作者:M. Saquib Sarfraz, Vivek Sharma, Rainer Stiefelhagen
论文链接:https://arxiv.org/abs/1902.11266
25.Stereo R-CNN based 3D Object Detection for Autonomous Driving(3D检测)
作者:Peiliang Li, Xiaozhi Chen, Shaojie Shen
论文链接:https://arxiv.org/abs/1902.09738
24.Image-Question-Answer Synergistic Network for Visual Dialog
作者:Dalu Guo, Chang Xu, Dacheng Tao
论文链接:https://arxiv.org/abs/1902.09774
1、Attention-guided Unified Network for Panoptic Segmentation(全景分割)
作者:Yanwei Li, Xinze Chen, Zheng Zhu, Lingxi Xie, Guan Huang, Dalong Du, Xingang Wang
论文链接:https://arxiv.org/abs/1812.03904
论文解读:https://mp.weixin.qq.com/s/1tohID6SM3weS476XU5okw
2、Deep High-Resolution Representation Learning for Human Pose Estimation(目前SOTA,已经开源)
作者:Ke Sun, Bin Xiao, Dong Liu, Jingdong Wang
论文链接:https://128.84.21.199/abs/1902.09212
代码链接:https://github.com/leoxiaobin/deep-high-resolution-net.pytorch
论文解读:https://mp.weixin.qq.com/s/ZRCzBTBmlEzQCVo1HLWtbQ
3、MUREL: Multimodal Relational Reasoning for Visual Question Answering
作者:Remi Cadene, Hedi Ben-younes, Matthieu Cord, Nicolas Thome
论文链接:https://arxiv.org/abs/1902.09487
4、End-to-End Multi-Task Learning with Attention
作者:Shikun Liu, Edward Johns, Andrew J. Davison
论文链接:https://arxiv.org/abs/1803.10704
5、SpherePHD: Applying CNNs on a Spherical PolyHeDron Representation of 360 degree Images
作者:Yeon Kun Lee, Jaeseok Jeong, Jong Seob Yun, Cho Won June, Kuk-Jin Yoon
论文链接:https://arxiv.org/abs/1811.08196
6、Event-based High Dynamic Range Image and Very High Frame Rate Video Generation using Conditional Generative Adversarial Networks
作者:S. Mohammad Mostafavi I., Lin Wang, Yo-Sung Ho, Kuk-Jin Yoon
论文链接:https://arxiv.org/abs/1811.08230
7、FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation
作者:Paul Voigtlaender, Yuning Chai, Florian Schroff, Hartwig Adam, Bastian Leibe, Liang-Chieh Chen
论文链接:https://arxiv.org/abs/1902.09513
8、An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition
作者:Chenyang Si, Wentao Chen, Wei Wang, Liang Wang, Tieniu Tan
论文链接:https://arxiv.org/abs/1902.09130
9、Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video Demonstration
作者:De-An Huang, Suraj Nair, Danfei Xu, Yuke Zhu, Animesh Garg, Li Fei-Fei, Silvio Savarese, Juan Carlos Niebles
论文链接:https://arxiv.org/abs/1807.03480
10、DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion
作者:Chen Wang, Danfei Xu, Yuke Zhu, Roberto Martín-Martín, Cewu Lu, Li Fei-Fei, Silvio Savarese
论文链接:https://arxiv.org/abs/1901.04780
论文解读:https://mp.weixin.qq.com/s/wrND2cocWlPPVXPqpq-Glg
11、A Neurobiological Evaluation Metric for Neural Network Model Search
作者:Nathaniel Blanchard, Jeffery Kinnison, Brandon RichardWebster, Pouya Bashivan, Walter J. Scheirer
论文链接:https://arxiv.org/pdf/1805.10726.pdf
12、The Perfect Match: 3D Point Cloud Matching with Smoothed Densities
作者:Zan Gojcic, Caifa Zhou, Jan D. Wegner, Andreas Wieser
论文链接:https://arxiv.org/abs/1811.06879
13、Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition with Multimodal Training
作者:Mahdi Abavisani, Hamid Reza Vaezi Joze, Vishal M. Patel
链接:https://arxiv.org/abs/1812.06145
14、Variational Bayesian Dropout
作者:Yuhang Liu, Wenyong Dong, Lei Zhang, Dong Gong, Qinfeng Shi
论文链接:https://arxiv.org/abs/1811.07533
15、LiFF: Light Field Features in Scale and Depth
作者:Donald G. Dansereau, Bernd Girod, Gordon Wetzstein
论文链接:https://arxiv.org/abs/1901.03916
16、Classification-Reconstruction Learning for Open-Set Recognition
作者:Ryota Yoshihashi, Wen Shao, Rei Kawakami, Shaodi You, Makoto Iida, Takeshi Naemura
论文链接:https://arxiv.org/abs/1812.04246
17、Weakly Supervised Deep Image Hashing through Tag Embeddings
作者:Vijetha Gattupalli, Yaoxin Zhuo, Baoxin Li
论文链接:https://arxiv.org/abs/1806.05804
18、InverseRenderNet: Learning single image inverse rendering
作者:Ye Yu, William A. P. Smith
论文链接:https://arxiv.org/abs/1811.12328
19、Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
作者:Xin Wang, Qiuyuan Huang, Asli Celikyilmaz, Jianfeng Gao, Dinghan Shen, Yuan-Fang Wang, William Yang Wang, Lei Zhang
论文链接:https://arxiv.org/abs/1811.10092
20、Disentangled Representation Learning for 3D Face Shape
作者:Baris Gecer, Stylianos Ploumpis, Irene Kotsia, Stefanos Zafeiriou
论文链接:https://arxiv.org/abs/1902.05978
21.Iterative Residual CNNs for Burst Photography Applications
作者:Filippos Kokkinos Stamatis Lefkimmiatis
论文链接:https://arxiv.org/abs/1811.12197
22.Mixture Density Generative Adversarial Networks
作者:Hamid Eghbal-zadeh, Werner Zellinger, Gerhard Widmer
论文链接:https://arxiv.org/abs/1811.00152
23.Taking A Closer Look at Domain Shift: Category-level Adversaries for Semantics Consistent Domain Adaptation
作者:Yawei Luo, Liang Zheng, Tao Guan, Junqing Yu, Yi Yang
论文链接:https://arxiv.org/abs/1809.09478