- Learning representations by back-propagating errors [David E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams] [Link]
- Invariant Scattering Convolution Networks [Joan Bruna, Stephane Mallat] [Link]
- ImageNet Classification with Deep Convolutional Neural Networks [Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton] [Link]
- Very Deep Convolutional Networks for Large-Scale Image Recognition [Karen Simonyan, Andrew Zisserman] [arXiv]
- Max-Pooling
- Overfitting, Saturation and Dropout
- Going Deeper with Convolutions [Christian Szegedy, Wei Liu, Andrew Rabinvich] [arXiv]
- Deep Residual Learning for Image Recognition [Kaiming He, Xiangyu Zhang, Jian Sun] [arXiv]
- Aggregated Residual Transformations for Deep Neural Networks [Saining Xie, Kaiming He] [arXiv]
N/A
- Fully Convolutional Networks for Semantic Segmentation [Jonathan Long] [Link]
- Fast Approximate Energy Minimization via Graph Cuts [Yuri Boykov] [Link]
- Exact optimization for Markov random fields with convex priors [Hiroshi Ishikawa] [Link]
- “GrabCut” — Interactive Foreground Extraction using Iterated Graph Cuts [Carsten Rother] [Link]
- Synergistic Face Detection and Pose Estimation with Energy-Based Models [Margarita Osadchy, Yann LeCun, Matthew L. Miller] [Link]
- Rapid Object Detection using a Boosted Cascade of Simple Features [Paul Viola, Michael Jones] [Link]
- Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks [NYU Hand Pose Dataset]
- Hand Video and Images Collection
- Additional Articles:
- SSD: Single Shot MultiBox Detector [Arxiv]
- R-CNN: Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation [PDF]
- YOLO:
- You Only Look Once: Unified, Real-Time Object Detection [Arxiv]
- Another YOLO Paper
- Comparison
- Generative Adversarial Nets (GAN) https://arxiv.org/pdf/1406.2661v1.pdf
- Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks. Emily L. Denton et. al. Nips 2015. https://arxiv.org/abs/1506.05751
- GANs with Feature Extraction Function:
- ALI
- BiGAN
- Adversarial Autoencoder
- Adversarial Variational Bayes.
- Tutorial on Variational Autoencoders(VAEs), Carl Doersch. August, 2016. https://arxiv.org/abs/1606.05908
- Relevance: sampling from distributions inside deep networks, and can be trained with stochastic gradient descent. VAEs have already shown promise in generating many kinds of complicated data, including handwritten digits, faces, house numbers, CIFAR images, physical models of scenes, segmentation, and predicting the future from static images.
- "Variational Convolutional Networks for Human-Centric Annotations," 13th Asian Conference on Computer Vision, 2016. Tsung-Wei Ke, Che-Wei Lin, Tyng-Luh Liu and Davi Geiger,
- Relevance: Use of VAEs to annotate automatically images.
- RCNN: Region-based Convolutional Neural Networks
- Fast and Faster R-CNN: https://github.com/rbgirshick/py-faster-rcnn
- YOLO
- SSD
- Soft-NMS: https://github.com/bharatsingh430/soft-nms
机器学习自学者必读的20篇顶级论文导读: https://mp.weixin.qq.com/s/ghMj3OO2yu7IIkQEkvkdVA 常见面试机器学习方法总览: http://www.chinakdd.com/article-oyU85v018dQL0Iu.html