DATA ANALYTICS USING DEEP LEARNING
GT 8803 // FALL 2019 // JOY ARULRAJ
L E C T U R E # 2 0 : A D V E R S A R I A L T R A I N I N G
DATA ANALYTICS USING DEEP LEARNING GT 8803 // FALL 2019 // JOY - - PowerPoint PPT Presentation
DATA ANALYTICS USING DEEP LEARNING GT 8803 // FALL 2019 // JOY ARULRAJ L E C T U R E # 2 0 : A D V E R S A R I A L T R A I N I N G administrivia Reminders Best project prize Quiz cancelled Guest lecture GT 8803 // Fall 2019
L E C T U R E # 2 0 : A D V E R S A R I A L T R A I N I N G
GT 8803 // Fall 2019
2
GT 8803 // Fall 2019
3
GT 8803 // Fall 2019
4
GT 8803 // Fall 2019
5
GT 8803 // Fall 2019 6 ...solving CAPTCHAS and reading addresses...
...recognizing
and faces….
(Szegedy et al, 2014) (Goodfellow et al, 2013) (Taigmen et al, 2013) (Goodfellow et al, 2013)
and other tasks... Since 2013, deep neural networks have matched human performance at...
GT 8803 // Fall 2019
7
Timeline: “Adversarial Classification” Dalvi et al 2004: fool spam filter “Evasion Attacks Against Machine Learning at Test Time” Biggio 2013: fool neural nets Szegedy et al 2013: fool ImageNet classifiers imperceptibly Goodfellow et al 2014: cheap, closed form attack
GT 8803 // Fall 2019
8
GT 8803 // Fall 2019
9
GT 8803 // Fall 2019
10
GT 8803 // Fall 2019
11
GT 8803 // Fall 2019
12
GT 8803 // Fall 2019
13
Rectified linear unit Carefully tuned sigmoid Maxout LSTM
GT 8803 // Fall 2019
14
GT 8803 // Fall 2019
15
Clean example Perturbation Corrupted example All three perturbations have L2 norm 3.96 This is actually small. We typically use 7! Perturbation changes the true class Random perturbation does not change the class Perturbation changes the input to “rubbish class”
GT 8803 // Fall 2019
16
GT 8803 // Fall 2019
17
(collaboration with David Warde-Farley and Nicolas Papernot)
GT 8803 // Fall 2019
18
GT 8803 // Fall 2019
19
(collaboration with David Warde-Farley and Nicolas Papernot)
GT 8803 // Fall 2019
20
GT 8803 // Fall 2019
21
(“Clever Hans, Clever Algorithms,” Bob Sturm)
GT 8803 // Fall 2019
22
GT 8803 // Fall 2019
23
(Huang et al., 2017)
GT 8803 // Fall 2019
24
Weights Signs of weights Clean examples Adversarial examples
GT 8803 // Fall 2019
25
(Andrej Karpathy, “Breaking Linear Classifiers on ImageNet”)
GT 8803 // Fall 2019
26
GT 8803 // Fall 2019
27
GT 8803 // Fall 2019
28
(Papernot 2016)
GT 8803 // Fall 2019
29
Train your own model Target model with unknown weights, machine learning algorithm, training set; maybe non-differentiable Substitute model mimicking target model with known, differentiable function Adversarial examples Adversarial crafting against substitute Deploy adversarial examples against the target; transferability property results in them succeeding
GT 8803 // Fall 2019 30
(Papernot 2016)
GT 8803 // Fall 2019
31
(Liu et al, 2016)
GT 8803 // Fall 2019
32
(Pinna and Gregory, 2002)
These are concentric circles, not intertwined spirals.
GT 8803 // Fall 2019
33
GT 8803 // Fall 2019
34
GT 8803 // Fall 2019
35
GT 8803 // Fall 2019
36
GT 8803 // Fall 2019
37
Neural nets can represent either function: Maximum likelihood doesn’t cause them to learn the right function. But we can fix that...
GT 8803 // Fall 2019
38
GT 8803 // Fall 2019
39
GT 8803 // Fall 2019
40
GT 8803 // Fall 2019
41
GT 8803 // Fall 2019
42
Labeled as bird Decrease probability
Still has same label (bird)
GT 8803 // Fall 2019
43
Unlabeled; model guesses it’s probably a bird, maybe a plane Adversarial perturbation intended to change the guess New guess should match old guess (probably bird, maybe plane)
GT 8803 // Fall 2019
44
7.70 7.20 7.40 7.12 7.05 6.97 6.68
6.00 6.50 7.00 7.50 8.00
Earlier SOTA Our baseline Virtual Adversarial Both + bidirectional model
Zoomed in for legibility
GT 8803 // Fall 2019 45
Training data Extrapolation Make new inventions by finding input that maximizes model’s predicted performance
GT 8803 // Fall 2019 46
Open-source library available at: https://github.com/openai/cleverhans Built on top of TensorFlow (Theano support anticipated) Standard implementation of attacks, for adversarial training and reproducible benchmarks
GT 8803 // Fall 2019
47