deep-learning-papers

Papers about deep learning ordered by task, date. Current state-of-the-art papers are labelled.

Object Recognition

Learning to Make Better Mistakes: Semantics-aware Visual Food Recognition, okt 2016, IBM, paper
T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos, aug 2016, github, arxiv
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning, aug 2016, Google, arxiv
Residual Networks of Residual Networks: Multilevel Residual Networks, aug 2016, arxiv
Training Region-based Object Detectors with Online Hard Example Mining, apr 2016, Facebook, arxiv
Deep Residual Learning for Image Recognition, dec 2015, arxiv
SSD: Single Shot MultiBox Detector, dec 2015, Google, github, arxiv
ParseNet: Looking Wider to See Better, jun 2015, arxiv
You Only Look Once: Unified, Real-Time Object Detection, jun 2015, Facebook, arxiv
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, jun 2015, Microsoft/Facebook arxiv
Selective Search for Object Recognition, 2012, paper
Rich feature hierarchies for accurate object detection and semantic segmentation, 2014, paper

Pose Estimation

Fast Single Shot Detection and Pose Estimation, sep 2016, arxiv

Face Recognition

Accessorize to a Crime: Real and Stealthy Attacks on State-of-the-Art Face Recognition, paper
OpenFace: A general-purpose face recognition library with mobile applications, June 2016, paper
Deep Face Recognition, 2015, paper
Compact Convolutional Neural Network Cascade for Face Detection, aug 2015, arxiv
Learning Robust Deep Face Representation, Jul 2015, arxiv
FaceNet: A Unified Embedding for Face Recognition and Clustering, jun 2015, paper
Multi-view Face Detection Using Deep Convolutional Neural Networks, yahoo, feb 2015, arxiv

Style Transfer

A learned representation for artistic style, okt 2016, Google, arxiv, demo
Fast Style Transfer in TensorFlow, github
https://arxiv.org/abs/1508.06576
https://arxiv.org/abs/1607.08022
http://cs.stanford.edu/people/jcjohns/eccv16/

Logo Recognition

Automatic Graphic Logo Detection via Fast Region-based Convolutional Networks, apr 2016, arxiv
Logo Localization and Recognition in Natural Images Using Homographic Class Graphs, 2016, paper
LOGO-Net: Large-scale Deep Logo Detection and Brand Recognition with Deep Region-based Convolutional Networks, nov 2015, arxiv
DeepLogo: Hitting Logo Recognition with the Deep Neural Network Hammer, okt 2015, Berkely, arxiv
Automatic detection of logos in video and their removal using inpainting, jul 2015, paper
On the Benefit of Synthetic Data for Company Logo Detection, 2015, paper
Fast and Robust Realtime Storefront Logo Recognition, paper
Scalable Logo Recognition in Real-World Images, 2011, paper
https://arxiv.org/pdf/1609.01414v1.pdf

note: also includes some papers that use SIFT

Text (in the Wild) Recognition

COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images, jun 2016, arxiv
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild, mar 2016, arxiv
Efficient Scene Text Localization and Recognition with Local Character Refinement, apr 2015, arxiv
Reading Text in the Wild with Convolutional Neural Networks, dec 2014, arxiv
Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition, jun 2014, arxiv
Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning, 2011, paper

ConvNet visualization

Visualizing and Understanding Convolutional Networks, Nov 2013, arxiv

Image Segmentation

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation, dec 2015, arxiv

Pedestrian Detection

Joint Deep Learning for Pedestrian Detection, 2013, paper

Super Resolution

Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network, sep 2016, Twitter, arxiv

Automated Theorem Proving

DeepMath - Deep Sequence Models for Premise Selection, jun 2016, Google arxiv

Reverse Engineering

Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data, okt 2016, arxiv
Stealing Machine Learning Models via Prediction APIs, aug 2016, paper

Language

Rationalizing Neural Predictions github, arxiv

Other

Architecture and optimization

SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size, feb 2016, arxiv

Tools for Deep Learning

Barrista github
Deep Learning 4 J github
Caffe: Convolutional Architecture for Fast Feature Embedding, jun 2014, github, [arxiv](https://arxiv.org/pdf/1408.5093v1

Tools for Papers

http://www.arxiv-sanity.com/

Data Sets

MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition, jul 2016, arxiv
Family in the Wild (FIW): A Large-scale Kinship Recognition Database, apr 2016, arxiv
https://github.com/openimages/dataset
YouTube-8M: A Large-Scale Video Classification Benchmark, sep 2016, Google, arxiv

Uncategorized

https://github.com/blue-yonder/tsfresh

JamesZhuh/deep-learning-papers

deep-learning-papers

Object Recognition

Pose Estimation

Face Recognition

Style Transfer

Logo Recognition

Text (in the Wild) Recognition

ConvNet visualization

Image Segmentation

Pedestrian Detection

Super Resolution

Automated Theorem Proving

Reverse Engineering

Language

Other

Architecture and optimization

Tools for Deep Learning

Tools for Papers

Data Sets

Uncategorized