Awesome-Embedded-AI

A curated list of awesome A.I. & Embedded/Mobile-devices resources, tools and more.

Looking for contributors. Submit a pull request if you have something to add :)
Please check the contribution guidelines for info on formatting and writing pull requests.

Papers
App-Experience
Demo-Codes
- Android
- iOS
- Vulkan
Frameworks
Course/Guide/Tutorial
Hardware
- GPU
News

Papers

Classic

[1512.03385] Deep Residual Learning for Image Recognition
[1610.02357] Xception: Deep Learning with Depthwise Separable Convolutions
[1611.05431] ResneXt: Aggregated Residual Transformations for Deep Neural Networks

Overview

Representation

Structure

[1707.06342] ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
[1707.01083] ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
[1704.04861] MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
[1706.03912] SEP-Nets: Small and Effective Pattern Networks

Binarization

Pruning

[CVPR'17] Designing Energy-Efficient Convolutional Neural Networks using Energy-Aware Pruning
[ICLR'17] Pruning Filters for Efficient ConvNets
[ICLR'17] Pruning Convolutional Neural Networks for Resource Efficient Inference
[ICLR'17] Soft Weight-Sharing for Neural Network Compression
[ICLR'16] Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
[NIPS'16] Dynamic Network Surgery for Efficient DNNs
[NIPS'15] Learning both Weights and Connections for Efficient Neural Networks

Quantization

[ICML'17] The ZipML Framework for Training Models with End-to-End Low Precision: The Cans, the Cannots, and a Little Bit of Deep Learning
[1412.6115] Compressing Deep Convolutional Networks using Vector Quantization
[CVPR '16] Quantized Convolutional Neural Networks for Mobile Devices
[ICASSP'16] Fixed-Point Performance Analysis of Recurrent Neural Networks
[arXiv'16] Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations
[ICLR'17] Loss-aware Binarization of Deep Networks
[ICLR'17] Towards the Limit of Network Quantization
[CVPR'17] Deep Learning with Low Precision by Half-wave Gaussian Quantization
[1706.02393] ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks

LowRankApproximation

Distillation

FrameworkPaper

Experience

Codes

Android

iOS

Vulkan

Frameworks

General

General frameworks contain inference and backprop stages.

Inference

Inference frameworks contains inference stage only.

Benchmark

Convertor

Model convertor. More convertors please refer deep-learning-model-convertor

NervanaSystems/caffe2neon: Tools to convert Caffe models to neon's serialization format

Kaushiki111111/awesome-embedded-ai-learning