Lecture 16: MCTS 1
Emma Brunskill
CS234 Reinforcement Learning.
Winter 2020
1With many slides from or derived from David Silver Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 1 / 59
Zoom Logistics When listening, please set your video off and mute - - PowerPoint PPT Presentation
Lecture 16: MCTS 1 Emma Brunskill CS234 Reinforcement Learning. Winter 2020 1 With many slides from or derived from David Silver Lecture 16: MCTS 1 Emma Brunskill (CS234 Reinforcement Learning. ) Winter 2020 1 / 59 Zoom Logistics When
1With many slides from or derived from David Silver Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 1 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 2 / 59
1
2
3
4
5
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 3 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 4 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 5 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 6 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 7 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 8 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 9 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 10 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 11 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 12 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 13 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 14 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 15 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 16 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 17 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 18 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 19 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 20 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 21 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 22 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 23 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 24 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 25 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 26 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 27 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 28 / 59
t+1, ..., Sk T}K k=1 ∼ Mv, π
K
P
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 29 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 30 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 31 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 32 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 33 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 34 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 35 / 59
1
2
3
4
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 36 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 37 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 38 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 39 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 40 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 41 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 42 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 43 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 44 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 45 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 46 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 47 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 48 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 49 / 59
1Relates to metalevel reasoning (for an example related to Go see ”Selecting
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 50 / 59
1
2
3
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 51 / 59
2Relates to metalevel reasoning (for an example related to Go see ”Selecting
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 52 / 59
AlphaGo trailer link Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 53 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 54 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 55 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 56 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 57 / 59
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2020 58 / 59