Robustness and geometry of deep neural networks
Alhussein Fawzi DeepMind May 23rd 2019 The Mathematics of Deep Learning and Data Science University of Cambridge
1
Robustness and geometry of deep neural networks Alhussein Fawzi - - PowerPoint PPT Presentation
Robustness and geometry of deep neural networks Alhussein Fawzi DeepMind May 23rd 2019 The Mathematics of Deep Learning and Data Science University of Cambridge 1 Recent advances in machine learning Error rate (%) He et., al., Delving
1
He et., al., “Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification” , 2015 Karpathy et., al, “Automated Image Captioning with ConvNets and Recurrent Nets” LeCun et. al.,, “Deep Learning” , 2015
Error rate (%)
DeepMind https://deepmind.com/research/alphago/
2
3
3
3
3
3
3
[Szegedy et. al. ICLR 2014], [Biggio et. al., PKDD 2013], ...
[Fawzi et. al., NIPS 2016], [Franceschi et. al., AISTATS 2018]
[Bruna et. al., TPAMI 2013], [Jaderberg et. al., NIPS 2015] , occlusions [Sharif et. al., CCS 2016] , etc...). 3
4
Class 1 Class 2
4
1 Fooling classifiers is easy: vulnerability to different perturbations. 2 Improving the robustness (i.e., “defending”) is difficult. 3 Geometric analysis of a successful defense: adversarial training. 5
6
6
6
Figure from [Szegedy et. al., ICLR 2014]. 6
Figure from [Szegedy et. al., ICLR 2014].
6
7
Flagpole Joystick Chihuahua L a b r a d
T e r r i e r Balloon
8
9
10
10
10
10
11
Adversarial accuracy (CIFAR-10): 11
Adversarial accuracy (CIFAR-10): [Madry et. al., 2017] 11
Adversarial accuracy (CIFAR-10): [Madry et. al., 2017]
11
Adversarial accuracy (CIFAR-10): [Madry et. al., 2017]
11
Random direction 12
Normal training
Random direction 12
Normal training Adversarial training
Random direction 12
Normal training Adversarial training
Random direction
12
Logit Label
13
Before adv. fine-tuning Logit Label
13
Before adv. fine-tuning After adv. fine-tuning Logit Label
13
14
15
1.5 1.0 0.5 0.0 3000 2500 2000 1500 1000 500
Original Adversarial
Eigenvalue profile
Eigenvalue number Value 15
1.5 1.0 0.5 0.0 3000 2500 2000 1500 1000 500
Original Adversarial
Eigenvalue profile
Eigenvalue number Value 15
16
16
16
16
16
16
Upper bound Lower bound
16
17
17
17
17
17
17
Accuracy on clean samples: Adversarial accuracy: 18
Accuracy on clean samples: Adversarial accuracy: 18
Accuracy on clean samples: Adversarial accuracy: 18
Accuracy on clean samples: Adversarial accuracy: 18
Accuracy on clean samples: Adversarial accuracy:
18
19
20
20
21
Decision boundary
r∗(x) x
22
Decision boundary
r∗(x) x
22
Decision boundary
r∗(x) x
22
Decision boundary
r∗(x) x
22
Decision boundary
r∗(x) x √ d r∗(x) 2
22
23
24
25