Stochastic approximation for speeding up LSTD/LSPI (and least squares regression/LinUCB)
Prashanth L A†
Joint work with Nathaniel Korda♯ and Rémi Munos†
†INRIA Lille - Team SequeL ♯MLRG - Oxford University
November 24, 2014
Prashanth L A Fast LSTD using SA November 24, 2014 1 / 39