Stochastic Optimal Control – part 2 discrete time, Markov Decision Processes, Reinforcement Learning
Marc Toussaint
Machine Learning & Robotics Group – TU Berlin mtoussai@cs.tu-berlin.de ICML 2008, Helsinki, July 5th, 2008
- Why stochasticity?
- Markov Decision Processes
- Bellman optimality equation, Dynamic Programming, Value Iteration
- Reinforcement Learning: learning from experience
1/21