Differential Privacy: An Economic Method for Choosing Epsilon Justin - PowerPoint PPT Presentation

Differential Privacy: An Economic Method for Choosing Epsilon Justin Hsu 1 Marco Gaboardi 2 Andreas Haeberlen 1 Sanjeev Khanna 1 Arjun Narayan 1 Benjamin C. Pierce 1 Aaron Roth 1 1 University of Pennsylvania 2 University of Dundee July 22, 2014

Problem: Privacy!

Differential privacy? History • Notion of privacy by Dwork, McSherry, Nissim, Smith • Many algorithms satisfying differential privacy now known

Differential privacy? History • Notion of privacy by Dwork, McSherry, Nissim, Smith • Many algorithms satisfying differential privacy now known Some key features • Rigorous: differential privacy must be formally proved • Randomized: property of a probabilistic algorithm • Quantitative: numeric measure of “privacy loss”

In pictures

In words The setting • Database: multiset of records (one per individual) • Neighboring databases D , D ′ : databases differing in one record • Randomized algorithm M mapping database to outputs R

In words The setting • Database: multiset of records (one per individual) • Neighboring databases D , D ′ : databases differing in one record • Randomized algorithm M mapping database to outputs R Definition Let ε > 0 be fixed. M is ε -differentially private if for all neighboring databases D , D ′ and sets of outputs S ⊆ R , Pr[ M ( D ) ∈ S ] ≤ e ε · Pr[ M ( D ′ ) ∈ S ] .

But what about ε ?

The challenge: How to set ε ? The equation ??? Pr[ M ( D ) ∈ S ] ≤ e ε · Pr[ M ( D ′ ) ∈ S ] .

The challenge: How to set ε ? The equation ??? Pr[ M ( D ) ∈ S ] ≤ e ε · Pr[ M ( D ′ ) ∈ S ] . Why do we need to set ε ? • Many private algorithms work for a range of ε , but performance highly dependent on particular choice • Experimental evaluations of private algorithms • Real-world uses of private algorithms

An easy question? Theorists say... • Set ε to be small constant, like 2 or 3 • Proper setting of ε depends on society

An easy question? Theorists say... • Set ε to be small constant, like 2 or 3 • Proper setting of ε depends on society Experimentalists say... • Try a range of values • Literature: ε = 0 . 01 to 100 e ε ∼ 2 . 69 · 10 48 e ε ∼ 1 . 01

We say Think about costs rather than privacy • ε measures privacy, too abstract • Monetary costs: more concrete way to measure privacy

We say Think about costs rather than privacy • ε measures privacy, too abstract • Monetary costs: more concrete way to measure privacy Add more parameters!(?) • Break ε down into more manageable parameters • More parameters, but more concrete • Set ε as function of new parameters

The plan today Model the central tradeoff • Stronger privacy for smaller ε , weaker privacy for larger ε • Better accuracy for larger ε , worse accuracy for smaller ε

The plan today Model the central tradeoff • Stronger privacy for smaller ε , weaker privacy for larger ε • Better accuracy for larger ε , worse accuracy for smaller ε Introduce parameters for two parties Individual: concerned about privacy • Analyst: concerned about accuracy •

The plan today Model the central tradeoff • Stronger privacy for smaller ε , weaker privacy for larger ε • Better accuracy for larger ε , worse accuracy for smaller ε Introduce parameters for two parties Individual: concerned about privacy • Analyst: concerned about accuracy • Combine the parties Balance accuracy against privacy guarantee •

What does ε mean for privacy?

Interpreting ε Participation • Private algorithm M is a study • Bob the individual has choice to participate in the study • Study will happen regardless of Bob’s choice

Interpreting ε Participation • Private algorithm M is a study • Bob the individual has choice to participate in the study • Study will happen regardless of Bob’s choice Bad events • Set of real-world bad events O • Bob wants to avoid these events

Outputs to events Thought experiment: two possible worlds • Identical, except Bob participates in first world and not in the second world • Rest of database, all public information is identical • All differences in two worlds due to the output of the study • Every output r ∈ R leads to an event in O or not

Outputs to events Don’t For all sets of outputs S . . . participate Pr[ M ( D ) ∈ S ] ≤ e ε · Pr[ M ( D ′ ) ∈ S ] . Participate

Outputs to events Don’t For all sets of outputs S . . . participate Pr[ M ( D ) ∈ S ] ≤ e ε · Pr[ M ( D ′ ) ∈ S ] . Participate Bad events interpretation of ε • Let S be set of outputs leading to events in O • Bob participating increases probability of bad event by at most e ε factor

Introducing cost Bad events not equally bad • Cost function on bad events f : O → R + (non-negative) • Insurance premiums, embarrassment, etc.

Introducing cost Bad events not equally bad • Cost function on bad events f : O → R + (non-negative) • Insurance premiums, embarrassment, etc. Our model Pay participants for their cost

How much to pay? Marginal increase in cost • Someone (society?) has decided the study is worth running • Non-participants may feel cost, but are not paid • Only pay participants for increase in expected cost

How much to pay? Marginal increase in cost • Someone (society?) has decided the study is worth running • Non-participants may feel cost, but are not paid • Only pay participants for increase in expected cost The cost of participation • Can show: under ε -differential privacy, expected cost increase is at most e ε factor when participating • Non-participants: expected cost P • Participants: expected cost at most e ε P • Compensate participants: e ε P − P

Summing up: the individual model Individuals • have an expected cost P if they do not participate, determined by their cost function; • can choose to participate in an ε -private study for fixed ε in exchange for fixed monetary payment; • participate if payment is larger than their increase in expected cost for participating: e ε P − P . Bigger for bigger ε

Summing up: the individual model Individuals • have an expected cost P if they do not participate, determined by their cost function; • can choose to participate in an ε -private study for fixed ε in exchange for fixed monetary payment; • participate if payment is larger than their increase in expected cost for participating: e ε P − P . Bigger for bigger ε How to set P ? • Depends on people’s perception of privacy costs • Derive empirically, surveys

The plan today Model the central tradeoff • Stronger privacy for smaller ε , weaker privacy for larger ε • Better accuracy for larger ε , worse accuracy for smaller ε Introduce parameters for two parties Individual: concerned about privacy • Analyst: concerned about accuracy • Combine the parties Balance accuracy against privacy guarantee •

Why not just take ε small?

Differential Privacy: An Economic Method for Choosing Epsilon Justin - PowerPoint PPT Presentation

Differential Privacy: An Economic Method for Choosing Epsilon Justin Hsu 1 Marco Gaboardi 2 Andreas Haeberlen 1 Sanjeev Khanna 1 Arjun Narayan 1 Benjamin C. Pierce 1 Aaron Roth 1 1 University of Pennsylvania 2 University of Dundee July 22, 2014

CS573 Data Privacy and Security Differential Privacy Real World Deployments Li Xiong

Toniann Pitassi Outline 1. Differential Privacy: The Basics 2. Differential Privacy in New

Differential Privacy Techniques Beyond Differential Privacy Steven Wu Assistant Professor

Differential Privacy Li Xiong Outline Differential Privacy Definition Basic techniques

CS573 Data Privacy and Security Local Differential Privacy Li Xiong Privacy at Scale: Local

Differential Privacy (Part III) Approximate (or ( , ))-differential privacy

Data privacy: Privacy models Vicen c Torra March, 2019 Hamilton Institute, Maynooth

Your Plan After High School Choosing a Career Choosing a College College Admissions

DIFFERENTIAL AROMA VOL DIFFERENTIAL AROMA VOL DIFFERENTIAL AROMA VOLATILES DIFFERENTIAL AROMA

$ Lesson Fourteen Consumer Privacy 04/09 privacy and information information privacy: privacy

$ Lesson Ten Consumer Privacy 04/09 privacy and information information privacy: privacy that

CS305 Topic Privacy Concept Evolution Rights to Privacy Privacy and Technologies

Privacy Protection privacy notions and metrics; privacy in RFID systems; location privacy in

Graph Analysis with Node Differential Privacy Node Differential Privacy Sofya Sofya

Mobile Data Collection and Analysis with Local Differential Privacy - Part 1 Ninghui Li (Purdue

Data Mining with Differential Privacy Arik Friedman and Assal Schuster by Slawomir Goryczka

Deep Learning With Differential Privacy Presenter: Xiaojun Xu Deep Learning Framework

JUST THE MATHS SLIDES NUMBER 15.1 ORDINARY DIFFERENTIAL EQUATIONS 1 (First order

Correlation Autoencoder Hashing for Supervised Cross-Modal Search . . . Yue Cao, Mingsheng

Massively Parallel Computation Philip Bille Sequential Computation Computation. Read and

Risky Traitor Tracing and New Differential Privacy Negative Results Rishab Goyal Venkata Koppula

A Differential Equation for Modeling Nesterovs Accelerated Gradient Method: Theory and Insights

LightDP: Towards Automating Differential Privacy Proofs Danfeng Zhang Daniel Kifer Penn

3.1 Classic Differential Geometry Hao Li http://cs599.hao-li.com 1 Spring 2014 CSCI 599:

Sambuz

Useful Links

Newsletter

Mail Us