thermometer encoding one hot way to resist adversarial
play

Thermometer Encoding: One Hot Way to Resist Adversarial Examples - PowerPoint PPT Presentation

Thermometer Encoding: One Hot Way to Resist Adversarial Examples Stanford, 2017-11-16 Aurko Roy* Colin Ra ff el Jacob Ian Buckman* Goodfellow *joint first author Adversarial Examples Adversarial Definitely Probably panda perturbation


  1. Thermometer Encoding: One Hot Way to Resist Adversarial Examples Stanford, 2017-11-16 Aurko Roy* Colin Ra ff el Jacob Ian Buckman* Goodfellow *joint first author

  2. Adversarial Examples Adversarial Definitely Probably panda perturbation gibbon Image from “Explaining and Harnessing Adversarial Examples”, Goodfellow et al, 2014 (Goodfellow 2017)

  3. Unreasonable Linear Extrapolation Argument to softmax Plot from “Explaining and Harnessing Adversarial Examples”, Goodfellow et al, 2014 (Goodfellow 2017)

  4. Di ffi cult to train extremely nonlinear hidden layers To train: changing this weight needs to have a large, predictable e ff ect To defend: changing this input needs to have a small or unpredictable e ff ect (Goodfellow 2017)

  5. Idea: edit only the input layer Train only this part DEFENSE (Goodfellow 2017)

  6. (Goodfellow 2017)

  7. Observation: PixelRNN shows one-hot codes work Plot from “Pixel Recurrent Neural Networks”, van den Oord et al, 2016 (Goodfellow 2017)

  8. (Goodfellow 2017)

  9. Fast Improvement Early in Learning (Goodfellow 2017)

  10. Large improvements on SVHN white box attacks 5 years ago, this would have been SOTA on clean data (Goodfellow 2017)

  11. Large Improvements against CIFAR-10 white box attacks 6 years ago, this would have been SOTA on clean data (Goodfellow 2017)

  12. Other results • Improvement on CIFAR-100 • (Still very broken) • Improvement on MNIST • Please quit caring about MNIST (Goodfellow 2017)

  13. Caveats • Slight drop in accuracy on clean examples • Only small improvement on black-box adversarial examples (Goodfellow 2017)

  14. Get involved! https://github.com/tensorflow/cleverhans (Goodfellow 2017)

  15. g.co/airesidency (Goodfellow 2017)

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend