Post-processing outputs for better utility
CompSci 590.03 Instructor: Ashwin Machanavajjhala
1 Lecture 10 : 590.03 Fall 12
Post-processing outputs for better utility CompSci 590.03 - - PowerPoint PPT Presentation
Post-processing outputs for better utility CompSci 590.03 Instructor: Ashwin Machanavajjhala Lecture 10 : 590.03 Fall 12 1 Announcement Project proposal submission deadline is Fri, Oct 12 noon . Lecture 10 : 590.03 Fall 12 2 Recap:
CompSci 590.03 Instructor: Ashwin Machanavajjhala
1 Lecture 10 : 590.03 Fall 12
Lecture 10 : 590.03 Fall 12 2
3 Lecture 10 : 590.03 Fall 12
0.2 0.4 0.6
2 4 6 8 10
Laplace Distribution – Lap(λ)
Database
Researcher
Query q
True answer
q(d) q(d) + η η
4 Lecture 10 : 590.03 Fall 12
Thm: If sensitivity of the query is S, then the following guarantees ε- differential privacy.
Sensitivity: Smallest number s.t. for any d, d’ differing in one entry, || q(d) – q(d’) || ≤ S(q) Histogram query: Sensitivity = 2
5 Lecture 10 : 590.03 Fall 12
Lecture 10 : 590.03 Fall 12 6
– Value of x1 η1 = x1 + δ1 – Value of x2 η2 = x2 + δ2 – Value of x1 + x2 η3 = x1 + x2 + δ3
Lecture 10 : 590.03 Fall 12 7
– Ensure that the returned answers are consistent with each other.
– Answer a different set of strategy queries A – Answer original queries using A – Universal Histograms – Wavelet Mechanism – Matrix Mechanism
Lecture 10 : 590.03 Fall 12 8
– Ensure that the returned answers are consistent with each other.
– Answer a different set of strategy queries A – Answer original queries using A – Universal Histograms – Wavelet Mechanism – Matrix Mechanism
Lecture 10 : 590.03 Fall 12 9
Lecture 10 : 590.03 Fall 12 10
η2 and η3
y2 (for x2) from the noisy values.
min (y1-η1)2 + (y2 – η2)2 + (y3 – η3)2 s.t., y1 + y2 = y3
Lecture 10 : 590.03 Fall 12 11
Lecture 10 : 590.03 Fall 12 12
– (without associating a particular count to the corresponding disease)
– (without associating a degree to a particular node)
Lecture 10 : 590.03 Fall 12 13
True Values 20, 10, 8, 8, 8, 5, 3, 2 Noisy Values 25, 9, 13, 7, 10, 6, 3, 1 (noise from Lap(1/ε)) Proof:?
Lecture 10 : 590.03 Fall 12 14
Lecture 10 : 590.03 Fall 12 15
Lecture 10 : 590.03 Fall 12 16
– Ensure that the returned answers are consistent with each other.
– Answer a different set of strategy queries A – Answer original queries using A – Universal Histograms – Wavelet Mechanism – Matrix Mechanism
Lecture 10 : 590.03 Fall 12 17
Lecture 10 : 590.03 Fall 12 18
Private Data
Differential Privacy
Original Query Workload Strategy Query Workload Noisy Strategy Answers Noisy Workload Answers
Q: Suppose we want to answer all range queries? Strategy 1: Answer all range queries using Laplace mechanism
Lecture 10 : 590.03 Fall 12 19
Q: Suppose we want to answer all range queries? Strategy 1: Answer all range queries using Laplace mechanism
Lecture 10 : 590.03 Fall 12 20
Q: Suppose we want to answer all range queries? Strategy 2: Answer all xi queries using Laplace mechanism Answer range queries using noisy xi values.
Lecture 10 : 590.03 Fall 12 21
Strategy 3: Answer sufficient statistics using Laplace mechanism Answer range queries using noisy sufficient statistics.
Lecture 10 : 590.03 Fall 12 22
x1 x2 x3 x4 x5 x6 x7 x8 x12 x34 x56 x78 x1234 x5678 x1-8
[Hay et al VLDB 2010]
Error = 2 x 5log2n/ε2 = x2 + x34 + x56 Error = 2 x 3log2n/ε2
Lecture 10 : 590.03 Fall 12 23
x1 x2 x3 x4 x5 x6 x7 x8 x12 x34 x56 x78 x1234 x5678 x1-8
different noisy answers
Lecture 10 : 590.03 Fall 12 24
x1 x2 x3 x4 x5 x6 x7 x8 x12 x34 x56 x78 x1234 x5678 x1-8
x1234 = x12 + x34 = x1 + x2 + x3 + x4
Lecture 10 : 590.03 Fall 12 25
[Hay et al VLDB 2010]
Lecture 10 : 590.03 Fall 12 26
[Hay et al VLDB 2010]
– Have lower error than noisy counts (upto 10 times smaller in some cases) – Unbiased estimators – Have the least error amongst all unbiased estimators
Lecture 10 : 590.03 Fall 12 27
– Ensure that the returned answers are consistent with each other.
– Answer a different set of strategy queries A – Answer original queries using A – Universal Histograms – Wavelet Mechanism – Matrix Mechanism
Lecture 10 : 590.03 Fall 12 28