ML-BackDoor-Detector

We expect there to be two directories where you run the code:

models
data

the expected contents are below. Otherwise our scripts won't run.

├── data 
    └── clean_validation_data.h5 // this is clean data used to evaluate the BadNet and design the backdoor defense
    └── clean_test_data.h5
    └── sunglasses_poisoned_data.h5

├── models
    └── anonymous_bd_net.h5
    └── anonymous_bd_weights.h5
    └── sunglasses_bd_net.h5
    └── sunglasses_bd_weights.h5
    └── multi_trigger_multi_target_bd_net.h5
    └── multi_trigger_multi_target_bd_weights.h5

I. Dependencies

Python 3.6.9
Keras 2.3.1
Numpy 1.16.3
Matplotlib 2.2.2
H5py 2.9.0
TensorFlow-gpu 1.15.2
OpenCV2 4.4

II. Running the code for one image using a script

To run the code depending on the backdoor type:

python3 eval_sunglasses.py <test_image.png>
python3 eval_multi.py <test_image.png>
python3 eval_anonymous.py <test_image.png>

Output should be either 1283 (if test_image.png is poisoned) or one class in range [0, 1282] (if test_image.png is not poisoned).

III. Running Unit Tests

You can run the units by typing:

python test_multi.py

python test_anonymous.py

python test_glasses.py

This will give output as follows:

***Finish***
***Initialzation finish***
**************************************************************
***Start bad net prediction***
***Predict finish***
***Start novelty detector prediction***
***Predict finish***
***Start new decision function prediction***
***Predict finish***
**************************END*********************************
Backdoor attack Success Rate for Sunglasses poisoned data: 0.015588464536236302
.**************************************************************
***Start bad net prediction***
***Predict finish***
***Start novelty detector prediction***
***Predict finish***
***Start new decision function prediction***
***Predict finish***
**************************END*********************************
Clean data accuracy: 81.46531566640685
.
----------------------------------------------------------------------
Ran 2 tests in 137.882s

IV. Running the code in Colab

To run all the code do the following:

Create the directories for the file either in google drive or locally
open the notebook in colab or jupyter
run from beginning to end.

To run just parts here's the python code

  # First load the model and weights, this also gets the model cleaning the data
  bad_net_cleaner = BadNetCleaner('models/sunglasses_bd_net.h5','models/sunglasses_bd_weights.h5')

  # Load the data
  x_test, y_test = data_loader('data/clean_test_data.h5')

  # Get the label from both the Novelty Detector and the New Decision Function
  y_hat_2 = bad_net_cleaner.predict_label(x_test) # x_test : image data X, MUST NOT /255!
  class_accu_2 = np.mean(np.equal(y_hat_2, y_test))*100
  print('Classification accuracy:', class_accu_2)

GusSand/ML-BackDoor-Detector

ML-BackDoor-Detector

I. Dependencies

II. Running the code for one image using a script

III. Running Unit Tests

IV. Running the code in Colab