- R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction
1
Chapter 6: Temporal Difference Learning
Introduce Temporal Difference (TD) learning Focus first on policy evaluation, or prediction, methods Compare efficiency of TD learning with MC learning Then extend to control methods Objectives of this chapter: