Frequency-response-characterization-of-CNNs

Numerous recent and popular publications on deep learning investigate specific short-comings and biases of CNN models. Often, such short-comings are closely related to the distribution of frequency components in the training data, or the frequency response of the convolutional layers in the network. For example, many adversarial attacks can be classified based on their spectral density, and may be ameliorated by suppressing high frequencies in the input. Similarly, smoothing the output of convolutional layers to remove aliasing effects consistently improves the performance of CNNs.

In this project, we aim to find concrete links between input statistics, learned convolutional filter statistics and classification performance. Specifically, we will first investigate the anatomy of frequency responses in different, existing CNN architectures and identify characteristics which lead to robust and improved classification performance. Driving from previous work we would eventually like to develop a learning scheme based directly on optimizing the frequency response characteristics of different layers, by using convolutional filters which act as bandpass filters.

Structures of the Folders

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.

Folder Structure Explanation:

Main folder Contains:

  1. CIFAR10
  2. Fashion_MNIST

In CIFAR10 there are various files

a. ResNet 20 model
b. Gaussian Filter with Sigma = 2
c. Gaussian Filter with Sigma = 4
d. Gaussian Filter subtracting from orginal image (sigma = 2)
e. Gaussian Filter subtracting from orginal image (sigma = 4)
f. CIFAR-10 with Fourier Transform and visualizing its frequency of trained model.
Similar is the structure in Fashion_MNIST folder


There is file Get_input_data_ffts_v2 which contains FFT transform for CIFAR10 and MNIST datasets.

Select 0 for getting Fashion_MNIST Fourier Transform.

Select 1 for getting CIFAR10 Fourier Transform.

FFT transform is performed and average is taken for each category with plotting of images.

Experimental Results

In all the obtained validation accuracy and test accuracy the model trained using ResNet18 or ResNet20 model.

Dataset and Variations in Model CIAR-10(Test Accuracy) CIFAR-10(Validation Accuracy) Fashion-MNIST(Test Accuracy) Fashion-MNIST Validation Accuracy
ResNet-18 92.79% 92.16% 92.04% 92.04%
ResNet-18(FFT Transform) 10.29% 9.89% 88.38% 88.38%
ResNet-18(Gaussian Blur Sigma=2) 78.25% 78.25% 91.54% 91.54%
ResNet-18(Gaussian Blur Sigma=4) 59.13% 59.13% 88.87% 88.87%
ResNet-18(Original Image - Gaussian Blur Sigma=2) 10.00% 9.52% 88.38% 88.38%
ResNet-18(Original Image - Gaussian Blur Sigma=4) 10.00% 9.52% 88.38% 88.38%
ResNet-18(Original Image - Gaussian Blur Sigma=12) - - 89.17% 89.31%
ResNet-18(Original Image - Gaussian Blur Sigma=20) - - 89.01 89.01%

In some of the cases accuracy is very low because th input has been passed through different filters like Gaussian filters with high sigma values which kind of Blur the image hence decreasing the features of the input image and thus reducing the accuracy, in last 4 rows of the results original image is subtracted from the image obtained by applying gaussian filters with different sigma values.

For any doubts in this project please feel free to contact me on my mail Id: kakshak@gmail.com