Planning and Optimization
- G5. Monte-Carlo Tree Search: Framework
Gabriele R¨
- ger and Thomas Keller
Universit¨ at Basel
Planning and Optimization G5. Monte-Carlo Tree Search: Framework - - PowerPoint PPT Presentation
Planning and Optimization G5. Monte-Carlo Tree Search: Framework Gabriele R oger and Thomas Keller Universit at Basel December 10, 2018 Motivation MCTS Tree Framework Summary Content of this Course Tasks Progression/ Regression
Universit¨ at Basel
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
Decision or OR nodes Chance or AND nodes
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
visit counter N(d) state-value estimate ˆ V (d) state s(d) probability p(d)
visit counter N(c) action-value (or Q-value) estimate ˆ Q(c) state s(c) action a(c)
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
Taken from Browne et al., “A Survey of Monte Carlo Tree Search Methods”, 2012
Motivation MCTS Tree Framework Summary
1
an action is applicable that is not explicated, or
2
an outcome is sampled that is not explicated, or
3
a goal state is reached
extending average state-/action-values estimate with accumulated cost following the search node (both from simulation and decisions in the tree) increasing visit counter by 1
Motivation MCTS Tree Framework Summary
19 9 35/1 9/4 25/4 35 1 10 2 8 2 22 2 28 2 12/1 10/1 16/1 24/1 12 1 10 1 16 1 24 1
Motivation MCTS Tree Framework Summary
19 9 35/1 9/4 25/4 35 1 10 2 8 2 22 2 28 2 12/1 10/1 16/1 24/1 12 1 10 1 16 1 24 1
Motivation MCTS Tree Framework Summary
19 9 35/1 9/4 25/4 35 1 10 2 8 2 22 2 28 2 12/1 10/1 16/1 24/1 12 1 10 1 16 1 24 1
Motivation MCTS Tree Framework Summary
19 9 35/1 9/4 25/4 35 1 10 2 8 2 22 2 28 2 12/1 10/1 16/1 24/1 12 1 10 1 16 1 24 1
Motivation MCTS Tree Framework Summary
19 9 35/1 9/4 25/4 35 1 10 2 8 2 22 2 28 2 / 12/1 10/1 16/1 24/1 12 1 10 1 16 1 24 1
Motivation MCTS Tree Framework Summary
19 9 35/1 9/4 25/4 35 1 10 2 8 2 22 2 28 2 / 12/1 10/1 16/1 24/1 12 1 10 1 16 1 24 1 19
Motivation MCTS Tree Framework Summary
19 9 35/1 9/4 25/4 35 1 10 2 8 2 22 2 28 2 / 19 1 12/1 10/1 16/1 24/1 12 1 10 1 16 1 24 1 19
Motivation MCTS Tree Framework Summary
19 9 35/1 9/4 25/4 35 1 10 2 8 2 22 2 28 2 19/1 19 1 12/1 10/1 16/1 24/1 12 1 10 1 16 1 24 1 19
Motivation MCTS Tree Framework Summary
19 9 35/1 9/4 25/4 35 1 13 3 8 2 22 2 28 2 19/1 19 1 12/1 10/1 16/1 24/1 12 1 10 1 16 1 24 1 19
Motivation MCTS Tree Framework Summary
19 9 35/1 11/5 25/4 35 1 13 3 8 2 22 2 28 2 19/1 19 1 12/1 10/1 16/1 24/1 12 1 10 1 16 1 24 1 19
Motivation MCTS Tree Framework Summary
19 10 35/1 11/5 25/4 35 1 13 3 8 2 22 2 28 2 19/1 19 1 12/1 10/1 16/1 24/1 12 1 10 1 16 1 24 1 19
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary
V (d) N(d)+1 , N(d) := N(d) + 1
Motivation MCTS Tree Framework Summary
Q(c) N(c)+1 , N(c) := N(c) + 1
Motivation MCTS Tree Framework Summary
Motivation MCTS Tree Framework Summary