Thermometer Encoding: One Hot Way to Resist Adversarial Examples
Stanford, 2017-11-16
Jacob Buckman* Aurko Roy* Colin Raffel Ian Goodfellow *joint first author
Thermometer Encoding: One Hot Way to Resist Adversarial Examples - - PowerPoint PPT Presentation
Thermometer Encoding: One Hot Way to Resist Adversarial Examples Stanford, 2017-11-16 Aurko Roy* Colin Ra ff el Jacob Ian Buckman* Goodfellow *joint first author Adversarial Examples Adversarial Definitely Probably panda perturbation
Stanford, 2017-11-16
Jacob Buckman* Aurko Roy* Colin Raffel Ian Goodfellow *joint first author
(Goodfellow 2017)
Probably panda Adversarial perturbation Definitely gibbon
Image from “Explaining and Harnessing Adversarial Examples”, Goodfellow et al, 2014
(Goodfellow 2017)
Argument to softmax
Plot from “Explaining and Harnessing Adversarial Examples”, Goodfellow et al, 2014
(Goodfellow 2017)
To train: changing this weight needs to have a large, predictable effect To defend: changing this input needs to have a small or unpredictable effect
(Goodfellow 2017)
DEFENSE
Train
this part
(Goodfellow 2017)
(Goodfellow 2017)
Plot from “Pixel Recurrent Neural Networks”, van den Oord et al, 2016
(Goodfellow 2017)
(Goodfellow 2017)
(Goodfellow 2017)
5 years ago, this would have been SOTA
(Goodfellow 2017)
6 years ago, this would have been SOTA
(Goodfellow 2017)
(Goodfellow 2017)
examples
(Goodfellow 2017)
https://github.com/tensorflow/cleverhans
(Goodfellow 2017)
g.co/airesidency