Graphical-model based estimation and inference for differential - - PowerPoint PPT Presentation

graphical model based estimation and inference for
SMART_READER_LITE
LIVE PREVIEW

Graphical-model based estimation and inference for differential - - PowerPoint PPT Presentation

Graphical-model based estimation and inference for differential privacy Ryan McKenna 1 , Daniel Sheldon 1,2 , Gerome Miklau 1 1 University of Massachusetts, Amherst 2 Mount Holyoke College Inference in Privacy Mechanisms Randomized Algorithm


slide-1
SLIDE 1

Graphical-model based estimation and inference for differential privacy

Ryan McKenna1, Daniel Sheldon1,2, Gerome Miklau1

1University of Massachusetts, Amherst 2Mount Holyoke College

slide-2
SLIDE 2

Inference in Privacy Mechanisms

Sensitive Data Workload Answers

Randomized 
 Algorithm

slide-3
SLIDE 3

Inference in Privacy Mechanisms

Sensitive Data Workload Answers

Randomized 
 Algorithm

Private Observations

Inference 
 Algorithm

slide-4
SLIDE 4

Inference in Privacy Mechanisms

Sensitive Data Workload Answers

  • Existing techniques for inference either don’t scale or don’t

extract the most utility from the private observations

  • Proper inference has many benefits:
  • Resolves inconsistencies
  • Improves utility
  • Answers new queries
  • Supports synthetic data generation

Randomized 
 Algorithm

Private Observations

Inference 
 Algorithm

slide-5
SLIDE 5

Problem Statement

  • Given:


an unknown discrete data distribution p ∈ ℝn 
 a query matrix Q ∈ ℝm x n

  • Our observation model is:


  • We want to recover an estimate of p from y

y = Qp + ε

̂ p ∈ arg min

p∈S ∥Qp − y∥

Random Laplace or Gaussian noise Size of p is intractably large

slide-6
SLIDE 6

Approach

  • Reformulate problem to find a graphical model pθ instead 



 
 
 


  • If Q only depends on p though its marginals,
  • We can solve this problem efficiently
  • Solution to reformulated problem is the maximum

entropy solution to the original problem

̂ θ ∈ arg min

θ ∥Qpθ − y∥

Much smaller than p

slide-7
SLIDE 7

Scalability Improvements of PGM

101 102 103

AWWrLbuWes

0.0 0.5 1.0 1.5 2.0

TLPe Ser IWeraWLRn (s) 0W LS05 3G0

  • Graphical-model inference scales much better than

traditional approaches.

PGM scales to 1000 dimensions Traditional approaches fail at 10 dimensions

slide-8
SLIDE 8

WiWanic aGulW loans sWroke

0.0 0.5 1.0 1.5 2.0 2.5

WorkloaG Error PrivBayes PrivBayes+PG0

Dataset 6X 1.4X 1.5X 6X

Utility Improvements of PGM

  • Graphical-model inference improves the utility of several

state-of-the-art privacy mechanisms.

Error reduction up to 6X We offer similar improvements for DualQuery, HDMM, and MWEM as well (see poster)

slide-9
SLIDE 9

Graphical-model based estimation and inference for differential privacy

Poster #171

Code available on GitHub:

https://github.com/ryan112358/private-pgm