Improved Network Robustness with Adversary Critic

Alexander Matyasko, Lap-Pui Chau, Improved Network Robustness with Adversary Critic. Advances in Neural Information Processing Systems (NIPS), 2018.

Ideally, what confuses neural network should be confusing to humans. However, recent experiments have shown that small, imperceptible perturbations can change the network prediction. To address this gap in perception, we propose a novel approach for learning robust classifier. Our main idea is: adversarial examples for the robust classifier should be indistinguishable from the regular data of the adversarial target. We formulate a problem of learning robust classifier in the framework of Generative Adversarial Networks (GAN), where the adversarial attack on classifier acts as a generator, and the critic network learns to distinguish between regular and adversarial images. The classifier cost is augmented with the objective that its adversarial examples should confuse the adversary critic. To improve the stability of the adversarial mapping, we introduce adversarial cycle-consistency constraint which ensures that the adversarial mapping of the adversarial examples is close to the original. In the experiments, we show the effectiveness of our defense. Our method surpasses in terms of robustness networks trained with adversarial training. Additionally, we verify in the experiments with human annotators on MTurk that adversarial examples are indeed visually confusing.

@inproceedings{matyasko2018adversarycritic,
    title = {Improved Network Robustness with Adversary Critic},
    author = {Matyasko, Alexander and Chau, Lap-Pui},
    booktitle = {NIPS},
    year = 2018
}

Requirements (tested with python 3.6)

Tensorflow v1.9.0
Pytorch (used to save images in the grid)
Scikit-image

Training

python generate_script.py --train=True | bash

Testing

python generate_script.py --train=False --carlini=False | bash
python generate_script.py --train=False --carlini=True | bash

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
carlini		carlini
data		data
.gitignore		.gitignore
README.md		README.md
attack.py		attack.py
generate_script.py		generate_script.py
models.py		models.py
poster.pdf		poster.pdf
test.py		test.py
test_carlini.py		test_carlini.py
train_critic.py		train_critic.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

carlini

carlini

data

data

.gitignore

.gitignore

README.md

README.md

attack.py

attack.py

generate_script.py

generate_script.py

models.py

models.py

poster.pdf

poster.pdf

test.py

test.py

test_carlini.py

test_carlini.py

train_critic.py

train_critic.py

utils.py

utils.py

Repository files navigation

Improved Network Robustness with Adversary Critic

Requirements (tested with python 3.6)

Training

Testing

About

Releases

Packages

Languages

aam-at/adversary_critic

Folders and files

Latest commit

History

Repository files navigation

Improved Network Robustness with Adversary Critic

Requirements (tested with python 3.6)

Training

Testing

About

Resources

Stars

Watchers

Forks

Languages