Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation
Qiang Liu† Lihong Li‡ Ziyang Tang† Dengyong Zhou‡
† Department of Computer Science, The University of Texas at Austin ‡ Google Brain (KIR)
Liu et al. Breaking the Curse of Horizon 1 / 7