Lecture 14: MCTS 2
Emma Brunskill
CS234 Reinforcement Learning.
Winter 2018
2With many slides from or derived from David Silver Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 3 Winter 2018 1 / 57
Class Structure Last time: Batch RL This Time: MCTS Next time: - - PowerPoint PPT Presentation
Lecture 14: MCTS 2 Emma Brunskill CS234 Reinforcement Learning. Winter 2018 2 With many slides from or derived from David Silver Lecture 14: MCTS 3 Emma Brunskill (CS234 Reinforcement Learning. ) Winter 2018 1 / 57 Class Structure Last time:
2With many slides from or derived from David Silver Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 3 Winter 2018 1 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 4 Winter 2018 2 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 5 Winter 2018 3 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 6 Winter 2018 4 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 7 Winter 2018 5 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 8 Winter 2018 6 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 9 Winter 2018 7 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 10 Winter 2018 8 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 11 Winter 2018 9 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 12 Winter 2018 10 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 13 Winter 2018 11 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 14 Winter 2018 12 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 15 Winter 2018 13 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 16 Winter 2018 14 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 17 Winter 2018 15 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 18 Winter 2018 16 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 19 Winter 2018 17 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 20 Winter 2018 18 / 57
Sampled experience B, 1 B, 0 B, 1 A, 0 B, 1 B, 1 A, 0 B, 1 B, 1 B, 0
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 21 Winter 2018 19 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 22 Winter 2018 20 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 23 Winter 2018 21 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 24 Winter 2018 22 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 25 Winter 2018 23 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 26 Winter 2018 24 / 57
t+1, ..., Sk T}K k=1 ∼ Mv, π
K
P
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 27 Winter 2018 25 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 28 Winter 2018 26 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 29 Winter 2018 27 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 30 Winter 2018 28 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 31 Winter 2018 29 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 32 Winter 2018 30 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 33 Winter 2018 31 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 34 Winter 2018 32 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 35 Winter 2018 33 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 36 Winter 2018 34 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 37 Winter 2018 35 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 38 Winter 2018 36 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 39 Winter 2018 37 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 40 Winter 2018 38 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 41 Winter 2018 39 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 42 Winter 2018 40 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 43 Winter 2018 41 / 57
44Relates to metalevel reasoning (for an example related to Go see ”Selecting
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 45 Winter 2018 42 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 46 Winter 2018 43 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 47 Winter 2018 44 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 48 Winter 2018 45 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 49 Winter 2018 46 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 50 Winter 2018 47 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 51 Winter 2018 48 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 52 Winter 2018 49 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 53 Winter 2018 50 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 54 Winter 2018 51 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 55 Winter 2018 52 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 56 Winter 2018 53 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 57 Winter 2018 54 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 58 Winter 2018 55 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 59 Winter 2018 56 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 14: MCTS 60 Winter 2018 57 / 57