/AdversarialAttackTrafficSystem

Adversarial Attack on Autonomous Vehicles - An Approach To Deceive Traffic Light Detection System

Primary LanguageJupyter Notebook

Adversarial Attack on Autonomous Vehicles - An Approach To Deceive Traffic Light Detection System

Autonomous cars are gradually occupying the streets. In the near future, they will become ubiquitous. Recent events that resulted in the loss of human lives force us to raise questions about their security, primarily concerning their onboard learning algorithms that act as the brains of the car. This work exposes one such vulnerability i.e. street sign misclassified as a traffic light. To the best of our knowledge, this vulnerability has not been explored in the existing literature. We achieve the stated vulnerability using a white box adversarial attack by creating perturbations that can fool a machine learning algorithm to misclassify traffic lights. We consider the ImageNet dataset and our custom-curated dataset to test and train our model and generate adversarial images to expose vulnerabilities in the learning algorithm.

To accomplish the task, we use Inception-v3 model. Specifically, the target image is passed through a Convolutional Neural Network (CNN). The loss will be calculated from the softmax layer. The gradient descent optimizer calculates the perturbation from the loss function. The perturbation is added in the target image and gets evaluated in the CNN again. We evaluate our performance of the generated image using L2 distance and Structural Similarity Index (SSIM).

ML models and Tools:

  • Inception V3
  • Transfer Learning
  • Convolutional Neural Network (CNN)

Dataset used:

  • ImageNet
  • Custom dataset

System Architecture

system_architecture.png

Loss function convergence with training iterations

iteration_vs_loss.png

Classification results of trained model after transfer learning

correct_classification.png

Perturbation, mask generation and adversarial image

perturbation.png

Classification results of previously trained model on adversarial image

faulty_classification.png

Source Code and Data

Project Team and Contributors