Maximum Likelihood Estimation for Learning Populations of Parameters Ramya Korlakai Vinayak
joint work with Weihao Kong, Gregory Valiant, Sham Kakade
Paul G. Allen School of CSE
1
Maximum Likelihood Estimation for Learning Populations of Parameters - - PowerPoint PPT Presentation
Maximum Likelihood Estimation for Learning Populations of Parameters Ramya Korlakai Vinayak Postdoctoral Researcher Paul G. Allen School of CSE joint work with Weihao Kong, Gregory Valiant, Sham Kakade Poster #189 ramya@cs.washington.edu 1
1
2
2
2
2
2
2
2
True Distribution P ?
3
True Distribution P ?
3
True Distribution P ?
3
i=1
True Distribution P ?
3
i=1
4
ˆ Pplug-in = histogram ⇢X1 t , ...Xi t , ..., XN t
✓ 1 √ t ◆
Number of coins
Number of tosses per coin
4
Paninski 2003, Valiant and Valiant 2011, Jiao et. al. 2015, Orlitsky et. al. 2016, Acharya et. al. 2017 ….
ˆ Pplug-in = histogram ⇢X1 t , ...Xi t , ..., XN t
✓ 1 √ t ◆
Number of coins
Number of tosses per coin
4
Paninski 2003, Valiant and Valiant 2011, Jiao et. al. 2015, Orlitsky et. al. 2016, Acharya et. al. 2017 ….
ˆ Pplug-in = histogram ⇢X1 t , ...Xi t , ..., XN t
✓ 1 √ t ◆
✓1 t ◆
Number of coins
Number of tosses per coin
4
Paninski 2003, Valiant and Valiant 2011, Jiao et. al. 2015, Orlitsky et. al. 2016, Acharya et. al. 2017 ….
ˆ Pplug-in = histogram ⇢X1 t , ...Xi t , ..., XN t
✓ 1 √ t ◆
✓1 t ◆
Number of coins
Number of tosses per coin
5
h = [h0, h1, ..hs, .., ht]
1 2 3 4 5
s
hs = # coins that show s heads N
s = 0, 1, ..., t
5
Q∈dist[0,1] KL
h = [h0, h1, ..hs, .., ht]
1 2 3 4 5
s
hs = # coins that show s heads N
s = 0, 1, ..., t
5
Q∈dist[0,1] KL
h = [h0, h1, ..hs, .., ht]
1 2 3 4 5
s
hs = # coins that show s heads N
s = 0, 1, ..., t
5
Q∈dist[0,1] KL
h = [h0, h1, ..hs, .., ht]
1 2 3 4 5
s
hs = # coins that show s heads N
s = 0, 1, ..., t
5
Q∈dist[0,1] KL
h = [h0, h1, ..hs, .., ht]
1 2 3 4 5
s
hs = # coins that show s heads N
s = 0, 1, ..., t
6
Number of coins
Number of tosses per coin
Sparse Regime
W1 ⇣ P ?, ˆ Pmle ⌘ = O ✓1 t ◆
W1 ⇣ P ?, ˆ Pmle ⌘ = O ✓ 1 √t log N ◆
6
inf
f sup P
E [W1(P, f(X))] > Ω ✓1 t ◆ ∨ Ω ✓ 1 √t log N ◆
Number of coins
Number of tosses per coin
Sparse Regime
W1 ⇣ P ?, ˆ Pmle ⌘ = O ✓1 t ◆
W1 ⇣ P ?, ˆ Pmle ⌘ = O ✓ 1 √t log N ◆
7
t
j=0
7
t
j=0
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1
CDF Political Leanings
MLE TVK17 Empirical
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1
CDF Flight Delays
MLE TVK17 Empirical
8
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1
CDF Political Leanings
MLE TVK17 Empirical
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1
CDF Flight Delays
MLE TVK17 Empirical