Skip to content

Evaluation of various defence mechanisms and various UAPs. Done as a part of GD-UAP.

Notifications You must be signed in to change notification settings

BardOfCodes/defence_against_the_dark_arts

Repository files navigation

Defence Against The Dark Arts

A repository containing evaluation of various universal adversarial perturbations against various defense mechanisms (part of our paper Generalizable Data-free Objective for Crafting Universal Adversarial Perturbations).

UAPs are quite a big threat to anyone using deep learning, be it muggles or wizards. They are indeed the perfect curse that someone might use against your Deep model. In this repository we evaluated some of the recent defence techniques for various UAPs.

The Attacks

The following 3 Universal Adversarial attacks were evaluated:

  1. Universal Adversarial Perturbations: Paper.

  2. Generalizable Objective for Universal Adversarial Perturbations: Paper.

  3. Network for Adversarial generation: Paper.

I Look forward to anyone willing to contribute more perturbations to test ( UAPs specifically).

The Defenses

The following defenses were evaluated:

  1. Prediction using multiple crops of input (Not implemented in this repo.)

  2. Gaussian smoothing, Median smoothing, and Bilateral Filtering.

  3. JPEG Compression: Paper

  4. BIT compression: Paper

  5. TV-minimization: Paper

  6. Image-quilting: Paper

  7. Perturbation rectification network: Paper

Of course, you can contribute your defenses.

Note: Quilting is still to be properly integrated. Till then, the code provide by the authors can be used here. For Perturbation Rectification Network, code provided by the authors here can be used.

Note: The numbers reported in this are from using the various author provided code only(For defence 5, 6, and 7).

Instructions

  1. Firstly, there is a long list of things to be installed, (specially for tvm and quilting). Instead of paraphrasing it here, I would recommend the user follow the instructions given by the authors here.

  2. After installation, download the weights for the networks,

# uncomment as required in the sourced file
cd weights
source download_weights.sh
  1. Now, download the perturbations.
  • GD-UAP: Link
  • NAG: UAP can be generated from here (link to crafted perturbations will be added soon.)
  • UAP: Link. Note: The UAP provided in this repository is for a different googlenet instance.
  1. Evaluating the defence use:
python evaluate.py --network googlenet --adv_im perturbations/GD_UAP_perts/best_fool_rate_googlenet_with_data_sat_diff_reg_0.0.npy --img_list utils/ilsvrc_test.txt --gt_labels utils/ilsvrc_test_gt.txt --batch_size 10 --defence tvm

The various defences can be used by changing the arguement for defence to Gaussian, Median, Bilateral, Bit_Compression, JPEG, tvm and quilting. Each defence can be configured in the defence_config.py file. (Look at tensorflow-classification/misc/utils.py for closer look at the defence code.)

  1. For quilting you need to create patches. This can be done using the following code:
# all the parameters are specified in defence_config.py file
python index_patches.py

TODO

  • Finish Quilting
  • Add the perturbations from NAG and UAP
  • Add results from paper.

Releases

No releases published

Packages

No packages published

Languages