Policy Evaluation with Latent Confounders via Optimal Balance
Andrew Bennett1 Cornell University awb222@cornell.edu Nathan Kallus1 Cornell University kallus@cornell.edu
1Alphabetical order. 1 / 33
Policy Evaluation with Latent Confounders via Optimal Balance Andrew - - PowerPoint PPT Presentation
Policy Evaluation with Latent Confounders via Optimal Balance Andrew Bennett 1 Cornell University awb222@cornell.edu Nathan Kallus 1 Cornell University kallus@cornell.edu 1 Alphabetical order. 1 / 33 Policy Learning Problem Given some
1Alphabetical order. 1 / 33
2 / 33
3 / 33
4 / 33
5 / 33
6 / 33
7 / 33
8 / 33
9 / 33
10 / 33
11 / 33
12 / 33
13 / 33
14 / 33
15 / 33
16 / 33
17 / 33
18 / 33
19 / 33
20 / 33
21 / 33
1
2
22 / 33
23 / 33
24 / 33
25 / 33
26 / 33
i from the posterior ϕ(· ; Xi, Ti)
1 B2
b=1
c=1 K(Z b i , Z c i )
j=1 QijπTj (Xi)
27 / 33
28 / 33
1 OptZ Our method, using Γ = γ Identity(n) for
2 IPS IPS weights based on X using estimated ˆ
3 OptX The optimal weighting method of (Kallus 2018) with same
4 DirX Direct method by fitting ˆ
5 DirZ Direct method by first fitting ˆ
6 D:W Doubly robust estimation using direct estimator D and weighted
29 / 33
30 / 33
31 / 33
32 / 33
33 / 33