Lecture 16: MCTS 1
Emma Brunskill
CS234 Reinforcement Learning.
Winter 2018
1With many slides from or derived from David Silver Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 1 / 57
Class Structure Last time: Batch RL This Time: MCTS Next time: - - PowerPoint PPT Presentation
Lecture 16: MCTS 1 Emma Brunskill CS234 Reinforcement Learning. Winter 2018 1 With many slides from or derived from David Silver Lecture 16: MCTS 1 Emma Brunskill (CS234 Reinforcement Learning. ) Winter 2018 1 / 57 Class Structure Last time:
1With many slides from or derived from David Silver Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 1 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 2 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 3 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 4 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 5 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 6 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 7 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 8 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 9 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 10 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 11 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 12 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 13 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 14 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 15 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 16 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 17 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 18 / 57
Sampled experience B, 1 B, 0 B, 1 A, 0 B, 1 B, 1 A, 0 B, 1 B, 1 B, 0
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 19 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 20 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 21 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 22 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 23 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 24 / 57
t+1, ..., Sk T}K k=1 ∼ Mv, π
K
P
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 25 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 26 / 57
1With many slides from or derived from David Silver Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 26 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 27 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 28 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 29 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 30 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 31 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 32 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 33 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 34 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 35 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 36 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 37 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 38 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 39 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 40 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 41 / 57
1Relates to metalevel reasoning (for an example related to Go see ”Selecting
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 42 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 43 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 44 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 45 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 46 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 47 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 48 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 49 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 50 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 51 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 52 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 53 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 54 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 55 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 56 / 57
Emma Brunskill (CS234 Reinforcement Learning. ) Lecture 16: MCTS 1 Winter 2018 57 / 57