Brahma S. Pavse (UT Austin) Reducing Sampling Error in Batch Temporal Difference Learning
Reducing Sampling Error in Batch Temporal Difference Learning
Brahma S. Pavse1, Ishan Durugkar1, Josiah Hanna2, Peter Stone1 3
1The University of Texas at Austin 2The University of Edinburgh 3Sony AI
ICML July 2020
1
brahmasp@cs.utexas.edu