Existing compression techniques


  1. Network quantization

  1. Network prunning

  1. Dark Knowledge

Tutorials:

- http://machinethink.net/blog/compressing-deep-neural-nets/

Interesting Papers:

- Deep Model Compression: https://arxiv.org/pdf/1610.09650.pdf
- Efficient Inference Engine on Compressed Deep Neural Network: https://arxiv.org/pdf/1602.01528.pdf

Interesting Codes:

- Distillation: https://github.com/iRapha/replayed_distillation (Very recent, check later, paper not released yet)