Lossless Compression(An NTK based method)

This repository contains code for the paper "Lossless Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach"(NTK-LC) ¹ submitted to NeurIPS 2022.

About the code

We packed the code into three different folders--compression, spectral_characteristics, utils:

compression : (experiment 2)
- mnist
  - mnist_origin.py
    Classification performance of the original neural network on MNIST
  - performance_match.py
    Classification performance of the compressed neural network compressed by our NTK-LC on MNIST
- cifar10
  - vgg_net_cifar10.py
    Define VGG19 used for CIFAR10
  - vgg_train.py
    Train VGG19 defined in vgg_net_cifar10.py on CIFAR10, and get parameters for convolution layers, which will be used for feature extraction
  - performance_origin.py
    Classification performance of the original neural network on CIFAR10
  - performance_two.py
    Classification performance of the compressed neural network compressed by our NTK-LC on CIFAR10
spectral_characteristics : (experiment 1)
- tilde_CK.py
  Verify the consistency of theoretical and practical conjugate kernel(CK) spectrum distribution
- plot_eigen.py
  Plot eigenvalues and eigen vectors
utils : some utils
- activation_numpy.py
  Activations define
- activation_tensor.py
  Activations define
- data_prepare.py
  Data prepare, including data sampled from MNIST/CIFAR10 and generated GMM data
- expect_calculate.py
  expect calculated by numerical integration
- expect_calculate_math.py
  expect calculated with analytical expresion
- model.py
  model define
- solve_equation.py
  solve equation to define activation functions
- utils.py
  some more utils

Dependencies

You can run following bash command to install packages used in this repository

pip install requirments.txt

or you can install follwing basic packages yourself:

Python: tested with version 3.8.13
Numpy and Scipy
Matplotlib for visulazation
Pytorch: tested with version 1.12.0
Pandas for data record

Discription of NTK-LC

Building upon recent research advances in neural tangent kernel (NTK) and random matrix theory, we provide a novel compression approach to wide and fully-connected deep neural nets.

The most related work is ² , The difference is that this article works in the single-hidden-layer setting, while ours works in multi-layer fully-connected DNN setting.

Reference

Gu L, Du Y, Zhang Y, et al. Lossless Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach[J].link ↩
Ali H T, Liao Z, Couillet R. Random matrices in service of ML footprint: ternary random features with no performance loss[J]. arXiv preprint arXiv:2110.01899, 2021.link ↩

Zhenyu-LIAO/Lossless_Compression

Lossless Compression(An NTK based method)

About the code

Dependencies

Discription of NTK-LC

Reference

Footnotes