Enhancing Privacy in Machine Learning
Mathias Humbert
INSA Toulouse/CNRS Toulouse, January 22, 2019
Enhancing Privacy in Machine Learning Mathias Humbert INSA - - PowerPoint PPT Presentation
Enhancing Privacy in Machine Learning Mathias Humbert INSA Toulouse/CNRS Toulouse, January 22, 2019 Enhancing Privacy in Machine Learning data ML What ML? What data? What threat? Mathias Humbert - Enhancing Privacy in Machine Learning
INSA Toulouse/CNRS Toulouse, January 22, 2019
Mathias Humbert - Enhancing Privacy in Machine Learning
2
Mathias Humbert - Enhancing Privacy in Machine Learning
3
Mathias Humbert - Enhancing Privacy in Machine Learning
4
Mathias Humbert - Enhancing Privacy in Machine Learning
5
Mathias Humbert - Enhancing Privacy in Machine Learning
6
Mathias Humbert - Enhancing Privacy in Machine Learning
7
Mathias Humbert - Enhancing Privacy in Machine Learning
8
Mathias Humbert - Enhancing Privacy in Machine Learning
9
Mathias Humbert - Enhancing Privacy in Machine Learning
10
Mathias Humbert - Enhancing Privacy in Machine Learning
11
k
k
Mathias Humbert - Enhancing Privacy in Machine Learning
12
rt1
k
rt2
i
{rt1
i }n i=1
{rt2
i }n i=1
rt2
i − ¯
rt1
k
σ n
i=1
σ(i) − ¯
i
Mathias Humbert - Enhancing Privacy in Machine Learning
13
rt1
k
rt2
i
{rt1
i }n i=1
{rt2
i }n i=1
rt2
i − ¯
rt1
k
σ n
i=1
σ(i) − ¯
i
Mathias Humbert - Enhancing Privacy in Machine Learning
14
Mathias Humbert - Enhancing Privacy in Machine Learning
15
Mathias Humbert - Enhancing Privacy in Machine Learning
16 number of PCA dimensions number of PCA dimensions
90% 48% 55% 29%
Mathias Humbert - Enhancing Privacy in Machine Learning
17
Mathias Humbert - Enhancing Privacy in Machine Learning
18
Mathias Humbert - Enhancing Privacy in Machine Learning
19
[1] Chatzikokolakis et al. Broadening the scope of differential privacy using metrics, PETS, 2013
Mathias Humbert - Enhancing Privacy in Machine Learning
20
Mathias Humbert - Enhancing Privacy in Machine Learning
21
Mathias Humbert - Enhancing Privacy in Machine Learning
22
Mathias Humbert - Enhancing Privacy in Machine Learning
23
<80% <100 miRNAs
Mathias Humbert - Enhancing Privacy in Machine Learning
24
Mathias Humbert - Enhancing Privacy in Machine Learning
25
Mathias Humbert - Enhancing Privacy in Machine Learning
26
Mathias Humbert - Enhancing Privacy in Machine Learning
27
Trade-off at 7 miRNAs Attack success decreased (relative to all) by 54% SVM accuracy decreased (relative to max) by only 1%
Mathias Humbert - Enhancing Privacy in Machine Learning
28
Mathias Humbert - Enhancing Privacy in Machine Learning
29
Mathias Humbert - Enhancing Privacy in Machine Learning
30
Mathias Humbert - Enhancing Privacy in Machine Learning
31
Mathias Humbert - Enhancing Privacy in Machine Learning
32
Mathias Humbert - Enhancing Privacy in Machine Learning
33
Mathias Humbert - Enhancing Privacy in Machine Learning
34
96,9%
Mathias Humbert - Enhancing Privacy in Machine Learning
35
Mathias Humbert - Enhancing Privacy in Machine Learning
36
[2] Gymrek et al., Identifying personal genomes by surname inference, Science, 2013 [3] Humbert et al., De-anonymizing genomic databases using phenotypic traits, PoPETS, 2015
Mathias Humbert - Enhancing Privacy in Machine Learning
37
j = gi j | M i j) =
j | Gi j = gi j) Pr(Gi j = gi j)
gi
j p(M i
j | Gi j = gi j) Pr(Gi j = gi j)
Mathias Humbert - Enhancing Privacy in Machine Learning
38
Mathias Humbert - Enhancing Privacy in Machine Learning
39
[4] Danielsson et al. MethPed: A DNA methylation classifier tool for the identification of pediatric brain tumor subtypes, Clinical Epigenetics, 2015
Mathias Humbert - Enhancing Privacy in Machine Learning
40
Mathias Humbert - Enhancing Privacy in Machine Learning
41
Mathias Humbert - Enhancing Privacy in Machine Learning
42
Objective: determine if v is part of the training dataset by using the
having access to a data sample
20 40 60 80 cat dog panda
Mathias Humbert - Enhancing Privacy in Machine Learning
43
Target Model
40 80 cat panda 20 40 cat panda
In or not in Same Distribution
40 80 cat panda
Shadow Models
. . .
Shadow Models Attack Models
. . .
Attack Models
Ground Truth? Target Dataset Local Dataset Multiple Attack Models Multiple Shadow Models
Shokri et al., Membership Inference Attacks against Machine Learning Models. IEEE S&P, 2017
Mathias Humbert - Enhancing Privacy in Machine Learning
44
Mathias Humbert - Enhancing Privacy in Machine Learning
45 Adult CIFAR-10 CIFAR-100 Face Location MNIST News Purchase-2 Purchase-10 Purchase-20 Purchase-50 Purchase-100 0.0 0.2 0.4 0.6 0.8 1.0
Precision
Shokri et al. Our approach
Adult CIFAR-10 CIFAR-100 Face Location MNIST News Purchase-2 Purchase-10 Purchase-20 Purchase-50 Purchase-100 0.0 0.2 0.4 0.6 0.8 1.0
Recall
Shokri et al. Our approach
95,95 94,95 88,89 83,85
Mathias Humbert - Enhancing Privacy in Machine Learning
46
Mathias Humbert - Enhancing Privacy in Machine Learning
47
A d u l t C I F A R
C I F A R
F a c e L
a t i
M N I S T N e w s P u r c h a s e
P u r c h a s e
P u r c h a s e
P u r c h a s e
P u r c h a s e
Adult CIFAR-10 CIFAR-100 Face Location MNIST News Purchase-2 Purchase-10 Purchase-20 Purchase-50 Purchase-100 0.50 0.25 0.75 0.87 0.25 0.25 0.78 0.25 0.24 0.25 0.77 0.82 0.50 0.87 0.90 0.85 0.65 0.74 0.92 0.77 0.79 0.80 0.78 0.82 0.50 0.83 0.95 0.87 0.75 0.75 0.89 0.77 0.78 0.79 0.83 0.87 0.50 0.83 0.95 0.88 0.79 0.75 0.88 0.77 0.78 0.79 0.82 0.87 0.50 0.81 0.92 0.83 0.88 0.75 0.85 0.76 0.77 0.78 0.80 0.83 0.50 0.86 0.72 0.55 0.68 0.65 0.92 0.54 0.51 0.54 0.84 0.67 0.50 0.84 0.95 0.87 0.77 0.75 0.88 0.77 0.78 0.79 0.83 0.88 0.50 0.87 0.88 0.80 0.65 0.71 0.90 0.73 0.77 0.60 0.73 0.73 0.50 0.87 0.84 0.77 0.66 0.73 0.93 0.71 0.77 0.75 0.78 0.86 0.50 0.87 0.89 0.84 0.66 0.74 0.92 0.76 0.79 0.80 0.82 0.83 0.50 0.86 0.93 0.87 0.67 0.75 0.92 0.77 0.79 0.81 0.85 0.86 0.50 0.85 0.95 0.88 0.69 0.75 0.91 0.77 0.79 0.80 0.84 0.89 0.0 0.2 0.4 0.6 0.8 1.0 A d u l t C I F A R
C I F A R
F a c e L
a t i
M N I S T N e w s P u r c h a s e
P u r c h a s e
P u r c h a s e
P u r c h a s e
P u r c h a s e
Adult CIFAR-10 CIFAR-100 Face Location MNIST News Purchase-2 Purchase-10 Purchase-20 Purchase-50 Purchase-100 0.50 0.50 0.52 0.83 0.50 0.50 0.69 0.50 0.47 0.50 0.57 0.73 0.50 0.82 0.89 0.84 0.54 0.53 0.92 0.59 0.66 0.69 0.76 0.82 0.50 0.75 0.95 0.82 0.72 0.52 0.88 0.57 0.62 0.64 0.73 0.83 0.50 0.75 0.95 0.87 0.78 0.52 0.86 0.56 0.61 0.64 0.73 0.82 0.50 0.68 0.91 0.75 0.86 0.51 0.82 0.54 0.57 0.60 0.66 0.75 0.49 0.84 0.55 0.52 0.51 0.53 0.92 0.53 0.51 0.54 0.79 0.62 0.50 0.76 0.95 0.83 0.74 0.52 0.86 0.57 0.62 0.65 0.74 0.84 0.50 0.82 0.86 0.80 0.54 0.53 0.90 0.59 0.66 0.60 0.73 0.71 0.50 0.84 0.80 0.76 0.55 0.53 0.92 0.59 0.66 0.68 0.76 0.85 0.50 0.83 0.88 0.83 0.53 0.53 0.92 0.59 0.66 0.69 0.78 0.83 0.50 0.81 0.92 0.85 0.57 0.53 0.91 0.59 0.65 0.69 0.78 0.85 0.50 0.79 0.95 0.85 0.61 0.53 0.90 0.58 0.64 0.67 0.77 0.86 0.0 0.2 0.4 0.6 0.8 1.0
Precision Recall 95 95 89 89
Mathias Humbert - Enhancing Privacy in Machine Learning
48
Mathias Humbert - Enhancing Privacy in Machine Learning
49
Mathias Humbert - Enhancing Privacy in Machine Learning
Adult CIFAR-10 CIFAR-100 Face Location MNIST News Purchase-2 Purchase-10 Purchase-20 Purchase-50 Purchase-100 0.0 0.2 0.4 0.6 0.8 1.0
Recall
Original Dropout Model stacking
50
Adult CIFAR-10 CIFAR-100 Face Location MNIST News Purchase-2 Purchase-10 Purchase-20 Purchase-50 Purchase-100 0.0 0.2 0.4 0.6 0.8 1.0
Precision
Original Dropout Model stacking Adult CIFAR-10 CIFAR-100 Face Location MNIST News Purchase-2 Purchase-10 Purchase-20 Purchase-50 Purchase-100 0.0 0.2 0.4 0.6 0.8 1.0
Accuracy
Original Dropout Model stacking
ML classifier Inference attack Inference attack
Mathias Humbert - Enhancing Privacy in Machine Learning
51