Algorithms for Differential Privacy: Exponential & Median Mechanism
CompSci 590.03 Instructor: Ashwin Machanavajjhala
1 Lecture 7 : 590.03 Fall 12
Algorithms for Differential Privacy: Exponential & Median - - PowerPoint PPT Presentation
Algorithms for Differential Privacy: Exponential & Median Mechanism CompSci 590.03 Instructor: Ashwin Machanavajjhala Lecture 7 : 590.03 Fall 12 1 Recap: Differential Privacy For every pair of inputs For every output that differ in one
1 Lecture 7 : 590.03 Fall 12
2 Lecture 7 : 590.03 Fall 12
adversary should not be able to distinguish between D1 and D2.
Lecture 7 : 590.03 Fall 12 3
. . . Worst discrepancy in probabilities
Theorem (Composability): If algorithms A1, A2, …, Ak use independent randomness and each Ai satisfies εi-differential privacy, resp. Then, outputting all the answers together satisfies differential privacy with ε = ε1 + ε2 + … + εk
4 Lecture 7 : 590.03 Fall 12
Lecture 7 : 590.03 Fall 12 5
0.2 0.4 0.6
2 4 6 8 10
Laplace Distribution – Lap(λ)
Database
Researcher
Query q
True answer
q(d) q(d) + η η
6 Lecture 7 : 590.03 Fall 12
[Dwork et al., TCC 2006] Thm: If sensitivity of the query is S, then the following guarantees ε- differential privacy.
7 Lecture 7 : 590.03 Fall 12
[Dwork et al., TCC 2006] Smallest number s.t. for any d, d’ differing in one entry, || q(d) – q(d’) || ≤ S(q) Example 2: HISTOGRAM queries
Changing one entry in d from ci to cj
8 Lecture 7 : 590.03 Fall 12
Lecture 7 : 590.03 Fall 12 9
– “what is the most common nationality in this room”: Chinese/Indian/American… – Other examples?
– “Which price would bring the most money from a set of buyers?”
Lecture 7 : 590.03 Fall 12 10
Lecture 7 : 590.03 Fall 12 11
$100 $100 $100 $401
Lecture 7 : 590.03 Fall 12 12
Inputs Outputs
Examples:
possible.
Lecture 7 : 590.03 Fall 12 13
where D, D’ differ in one tuple
Lecture 7 : 590.03 Fall 12 14
Randomly sample an output O from Outputs with probability
Lecture 7 : 590.03 Fall 12 15
Lecture 7 : 590.03 Fall 12 16
best output.
“What is the most common nationality?” w(D,nationality) = # people in D having that nationality Sensitivity of w is 1.
Lecture 7 : 590.03 Fall 12 17
Theorem:
Lecture 7 : 590.03 Fall 12 18
Theorem: Suppose there are 4 nationalities Outputs = {Chinese, Indian, American, Greek} Exponential mechanism will output some nationality that is shared by at least K people with probability 1-e-3(=0.95), where K ≥ OPT – 2(log(4) + 3)/ε = OPT – 6.8/ε
Lecture 7 : 590.03 Fall 12 19
≤ maxD,D’ |f(D) – f(D’)| = sensitivity of f
probability proportional to
Lecture 7 : 590.03 Fall 12 20
Laplace noise with parameter 2Δ/ε
make sense.
the resulting distribution.
mechanism.
– By choosing the appropriate score function.
Lecture 7 : 590.03 Fall 12 21
– Can work well even if output space is exponential in the input
efficient if output space is large.
Lecture 7 : 590.03 Fall 12 22
Lecture 7 : 590.03 Fall 12 23
– Queries may be coming from different researchers – But they may collude …
Lecture 7 : 590.03 Fall 12 24
then we can’t answer any more queries.
reused the same answer for all the remaining queries.
– We can still answer k-1 more queries!
from previous queries?
Lecture 7 : 590.03 Fall 12 25
Lecture 7 : 590.03 Fall 12 26
– When more than half the databases D’ have |qi(D’) – qi(D)| < ε – Then the median of all the answers is close to the true answer ai = qi(D) – But this could leak information … – Solution: Compute a noisy version of …
Lecture 7 : 590.03 Fall 12 27
when range of algorithm is not a real number.
Lecture 7 : 590.03 Fall 12 28
Lecture 7 : 590.03 Fall 12 29