Certified Robustness to Adversarial Examples with Differential Privacy
Mathias Lécuyer, Vaggelis Atlidakis, Roxana Geambasu, Daniel Hsu, Suman Jana Columbia University
Code: https://github.com/columbia/pixeldp Contact: mathias@cs.columbia.edu
Certified Robustness to Adversarial Examples with Di ff erential - - PowerPoint PPT Presentation
Certified Robustness to Adversarial Examples with Di ff erential Privacy Mathias Lcuyer, Vaggelis Atlidakis, Roxana Geambasu, Daniel Hsu, Suman Jana Columbia University Code: https://github.com/columbia/pixeldp Contact:
Code: https://github.com/columbia/pixeldp Contact: mathias@cs.columbia.edu
2
By Sopan Deb, Natasha Singer - Dec. 13, 2018
3
ticket 2 ticket 3 ticket 1 no ticket … … …
input x layer 1 layer 2 layer 3 softmax 0.1 0.2 0.1 0.6
4
… … …
0.1 0.2 0.1 0.6 input x layer 1 layer 2 layer 3 softmax
argmax
5
… … …
0.1 0.2 0.1 0.6 input x layer 1 layer 2 layer 3 softmax
argmax 0.1 0.7 0.1 0.1
0.25 0.5 0.75 1
0.5 1 1.5 2 2.5 3
2
2
7
8
9
10
11
12
… … …
0.1 0.6 0.1 0.2 input x layer 1 layer 2 layer 3 softmax
0.1 0.7 0.1 0.1 argmax
2
=
13
… … …
0.1 0.6 0.1 0.2 input x layer 1 layer 2 layer 3 softmax
argmax
2
= 0.1 0.7 0.1 0.1
14
15
… … …
0.1 0.2 0.1 0.6 input x layer 1 layer 2 layer 3 softmax argmax
argmax 0.1 0.2 0.1 0.6
16
… … …
0.1 0.2 0.1 0.6 input x layer 1 layer 2 layer 3 softmax
argmax 0.1 0.2 0.1 0.6
17
… … …
0.1 0.2 0.1 0.6 input x layer 1 layer 2 layer 3 softmax
18
19
… …
input x layer 1 noise layer
+
layer 2 layer 3 softmax
…
0.2 0.1 0.1 0.6
… …
input x layer 1 noise layer
+
20
layer 2 layer 3 softmax
…
0.2 0.1 0.1 0.6
… …
input x layer 1 noise layer
+
21
layer 2 layer 3 softmax
…
0.2 0.1 0.1 0.6
… …
input x layer 1 0.1 0.2 0.1 0.6
layer 2 layer 3 softmax noise layer
+ …
0.2 0.1 0.1 0.6
22
… …
input x layer 1 0.1 0.2 0.1 0.6
…
0.2 0.1 0.1 0.6 layer 2 layer 3 softmax noise layer
+
23
… …
input x layer 1 0.1 0.2 0.1 0.6
…
0.2 0.1 0.1 0.6 layer 2 layer 3 softmax noise layer
+
24
25
26
… … …
input x
…
noise layer
+
input x
27
28
… …
… …
+
PixelDP auto-encoder
29
Dataset Image size Number of Classes ImageNet 299x299x3 1000 CIFAR-100 32x32x3 100 CIFAR-10 32x32x3 10 SVHN 32x32x3 10 MNIST 28x28x1 10 Dataset Number of Layers Number of Parameters Inception-v3 48 23M Wide ResNet 28 36M CNN 3 3M
Model Accuracy (%) Guaranteed accuracy (%) 0.05 0.1 0.2 Baseline 78
68 63 PixelDP: L=0.75 58 53 49 40
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
0.2 0.4 0.6 0.8 1 1.2 1.4
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
0.2 0.4 0.6 0.8 1 1.2 1.4
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
0.2 0.4 0.6 0.8 1 1.2 1.4
37
38
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
0.2 0.4 0.6 0.8 1 1.2 1.4
+ Provable guarantees:
Wong+ '18, Raghunathan+ '18, Wang+ '18].
computation [Wong-Kolter+ '18, Wong+ '18, Wang+ '18].
[Raghunathan+ '18].
guarantees [Sinha+ '17].
'18].
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5
2
2
46
47
48