Privacy-Preserving Inference in Crowdsourcing Systems
Liyao Xiang Supervisor: Baochun Li
- Oct. 9, 2017
Privacy-Preserving Inference in Crowdsourcing Systems Liyao Xiang - - PowerPoint PPT Presentation
Privacy-Preserving Inference in Crowdsourcing Systems Liyao Xiang Supervisor: Baochun Li Oct. 9, 2017 University of Toronto Localization via Crowdsourcing ? A d AC d AB ? B C d BC In a crowd, some users know about their locations
2
B A C dAB dAC dBC
? ?
3
time t user i Prior estimate Z i,t time t user j Prior estimate Z j,t Upload Z i,t, Dij Upload Z j,t, Dji Run inference alg. Return Z* i,t Return Z* j,t
4
B dAB dAC dBC A C
? ?
5
B dAB dAC dBC A C
? ?
6
Z∗
t = arg max Zt
P(Zt|Zt−1, D).
7
8
9
10
ci,r = Y
j∈N (i)
Y
s∈{1,...,R}
Epk(ln wj,s) · Epk(d(zi,r, zj,s)2)−
1 2σ2
· Epk(Dij)
d(zi,r,zj,s) σ2
· Epk(D2
ij)−
1 2σ2
= Epk[ X
j∈N (i)
X
s∈{1,...,R}
(ln wj,s − (d(zi,r, zj,s) − Dij)2/2σ2)].
11
wk
i,r = wk−1 i,r
exp[Esk(ci,r)] = wk−1
i,r
exp[ X
j∈N (i)
X
s∈{1,...,R}
(ln wj,s (djs Dij)2/2σ2)] = wk−1
i,r
Y
j∈N (i)
Y
s∈{1,...,R}
exp(ln wj,s (djs Dij)2/2σ2) = wk−1
i,r
Y
j∈N (i)
Y
s∈{1,...,R}
wj,s · exp ⇣ (djs Dij)2 2σ2 ⌘ ' wk−1
i,r
Y
j∈N (i)
Y
s∈{1,...,R}
Pr(zi,r, zj,s|Dij,t).
12
Run inference. time t Prior Z i,t. U p l
d Z i , t, E( w) a n d E(D) . Decrypt and update prior with Z* i, t. D
n l
d C i , t . Upload Z i, t+1, E(w) and E(D). time t+1 Prior Z i,t+1. Download C i, t+1.
13
14
N(0, σ2)
15
A mechanism M satisfies (✏, )-differential privacy iff for all z, z0 that are d(z, z0) apart: Pr[M(z) ∈ Y ] ≤ e✏Pr[M(z0) ∈ Y ] + , and ✏ = ⇢d2(z, z0) + 2 p ⇢ log(1/)d(z, z0), where ρ is a constant specific to the perturbation mechanism we adopt.
16
✏
z
d ( z , z ’ )
z’ M(z)(Y) M(z’)(Y) M(z’’)(Y) z” d(z, z”) ϵ1 ϵ1 ϵ2 ϵ2 ϵ1 < ϵ2 ϵ1 < ϵ2
✏
17
If we report Z = (z1, ..., zR) as Y = (y1, ..., yR), then the probability of reporting Y given Z is: Pr[M(Z) ∈ Y] = Y
i
Pr[M(zi) ∈ Y ]. The user enjoys (✏0, )-differential privacy with ✏0 = ⇢Rd2(Z, Z0) + 2 p ⇢ log(1/)Rd2(Z, Z0).
18
˜ d(y, y0) = q ||y − y0||2
2 − 4σ2.
19
20
˜ d(y, y0) √ R d(z, z0)
21
20 40 60 80 0.2 0.4 0.6 0.8 Highest Particle Weight
Convergence of the Particle Distribution
10 20 30 40 50 2000 7000 12000 Number of Users Average Running Time (ms) Running Time of the MAP Inference R = 50 R = 75 R = 100
22
5 10 15 20 25 0.2 0.4 0.6 0.8 1 Position Error (m) CDF Position Error of 20 Users(R = 100) Unperturbed σ = 0.2 σ = 0.7 σ = 1.0 σ = 1.5 σ = 2.3 5 10 15 20 25 0.2 0.4 0.6 0.8 1 Position Error (m) CDF Position Error of 20 Users( σ = 0.5) R = 50 R = 75 R = 100 R = 125 R = 150
23
10 20 30 40 0.2 0.4 0.6 0.8 1 Position Error (m) CDF Comparison with Hilbert Curves on RWP Model Hilbert Curves (n = 64) Hilbert Curves (n = 512) Private Inference (σ = 5.0) Private Inference (σ = 10.0) 5 10 15 2 4 6 8 10 Sequence number Average Position Error(m) Average Position Error of 7 Users in Different Settings Unperturbed σ = 0.2, ε = 23.23 σ = 0.7, ε = 4.09 σ = 1.0, ε = 2.65 σ = 2.3, ε = 1.03
24