SLIDE 14 14 CISPA Center for IT Security, Privacy and Accountabiltiy Foundations of Cybersecurity 2016
Post-Sanitization and Differential Privacy
Post sanitization (deterministic and probabilistic) possible: As long as it only depends on the noisy output (not on the original dataset), every computation is possible and does not decrease privacy.
40 Introduction to Cybersecurity 2016 170 Students have passed, 82 have failed
Precise answer
167.3 Students have passed, 83.4 have failed
โฆ
Noisy answer
Name Age Gender Semester Grade Alice 19 Female 1 1.3 Bob 18 Male 1 failed Charlie 18 Male 1 1.7 Dave 18 Male 1 3.7 Eve 17 Female 1 1.0 Fritz 19 Male 3 1.3 Gerd 21 Male 3 2.3 Hans 23 Male 3 3.0 Isa 20 Female 3 failed
Arbitrary computation:
rounding, bounding, relations e.g., If (answer < 0) then answer = 0
167 Students have passed, 83 have failed. Twice as many students have passed than have failed
More details in future lectures!
Introduction to Cybersecurity 2016
Privacy-friendly Aggregation (Smart Metering)
41
1 2 3 4 5 6 7
aggregated
kW 2 4 6 time 00:00 12:00 06:00 18:00
2 4 6 8 10 12 14 16 18 20
๐๐๐ เต ฮ๐ ๐
Goal: Privacy Guarantess for aggregated data: It should be impossible to infer the energy consumption of any individual household
Probability density function: ๐๐๐๐ ๐ฆ =
1 2๐ ๐โ ๐ฆโฮ๐ ๐
kW 500 1000 1500 Zeit 00:00 12:00 06:00 18:00 ฮ๐: Sensitivity, i.e. a single userโs/householdโs impact
๐: Privacy parameter from Differential Privacy
Technische Definition: Differential Privacy: Pr ๐ฆ โ ๐; ๐ฆ โ ๐ ๐ธ1 โค ๐๐ Pr ๐ฆ โ ๐; ๐ฆ โ ๐ ๐ธ2 + ๐
Introduction to Cybersecurity 2016
Challenges: Sanitization of dynamic data(streaming) decentralized noising, learning with differential privacy Potential killer arguments: provided utility not sufficient for any practical use-case; interesting outliers are removed Privacy-friendly Aggregation (Smart Metering)
42
1 2 3 4 5 6 7
aggregated
kW 2 4 6 time 00:00 12:00 06:00 18:00
2 4 6 8 10 12 14 16 18 20
๐๐๐ เต ฮ๐ ๐
Probability density function: ๐๐๐๐ ๐ฆ =
1 2๐ ๐โ ๐ฆโฮ๐ ๐
kW 500 1000 1500 Zeit 00:00 12:00 06:00 18:00 ฮ๐: Sensitivity, i.e. a single userโs/householdโs impact
๐: Privacy parameter from Differential Privacy