Efficient Policy Learning from Surrogate-Loss Classifications
Andrew Bennett (Cornell Tech)
Joint work with Nathan Kallus (Cornell Tech)
Andrew Bennett Efficient Policy Learning 1 / 25
Efficient Policy Learning from Surrogate-Loss Classifications - - PowerPoint PPT Presentation
Efficient Policy Learning from Surrogate-Loss Classifications Andrew Bennett (Cornell Tech) Joint work with Nathan Kallus (Cornell Tech) Andrew Bennett Efficient Policy Learning 1 / 25 This Talk Introduction 1 Surrogate-Loss Reduction 2
Andrew Bennett Efficient Policy Learning 1 / 25
1
2
3
4
5
Andrew Bennett Efficient Policy Learning 2 / 25
Andrew Bennett Efficient Policy Learning 3 / 25
Andrew Bennett Efficient Policy Learning 4 / 25
Andrew Bennett Efficient Policy Learning 5 / 25
1
2
3
4
5
Andrew Bennett Efficient Policy Learning 6 / 25
Andrew Bennett Efficient Policy Learning 7 / 25
2[Y (+1) − Y (−1)]
TY P(T|X)
P(T|X)
n
i=1 ˆ
Andrew Bennett Efficient Policy Learning 8 / 25
n
i=1 ˆ
n
Andrew Bennett Efficient Policy Learning 9 / 25
Andrew Bennett Efficient Policy Learning 10 / 25
1
2
3
4
5
Andrew Bennett Efficient Policy Learning 11 / 25
g unconstrained
Andrew Bennett Efficient Policy Learning 12 / 25
Andrew Bennett Efficient Policy Learning 13 / 25
π unconstrained
θ∈Θ L(θ) ,
Andrew Bennett Efficient Policy Learning 14 / 25
1
2
3
4
5
Andrew Bennett Efficient Policy Learning 15 / 25
Andrew Bennett Efficient Policy Learning 16 / 25
n
n
θ∈Θ
f ∈F
Andrew Bennett Efficient Policy Learning 17 / 25
1
2
3
4
5
Andrew Bennett Efficient Policy Learning 18 / 25
Andrew Bennett Efficient Policy Learning 19 / 25
n
Andrew Bennett Efficient Policy Learning 20 / 25
n
Andrew Bennett Efficient Policy Learning 21 / 25
Andrew Bennett Efficient Policy Learning 22 / 25
Andrew Bennett Efficient Policy Learning 23 / 25
Andrew Bennett Efficient Policy Learning 24 / 25
Andrew Bennett Efficient Policy Learning 25 / 25