CS885 Reinforcement Learning Lecture 13c: June 13, 2018
Adversarial Search [RusNor] Sec. 5.1-5.4
CS885 Spring 2018 Pascal Poupart 1 University of Waterloo
CS885 Reinforcement Learning Lecture 13c: June 13, 2018 Adversarial - - PowerPoint PPT Presentation
CS885 Reinforcement Learning Lecture 13c: June 13, 2018 Adversarial Search [RusNor] Sec. 5.1-5.4 University of Waterloo CS885 Spring 2018 Pascal Poupart 1 Outline Minimax search Evaluation functions Alpha-beta pruning University
CS885 Spring 2018 Pascal Poupart 1 University of Waterloo
CS885 Spring 2018 Pascal Poupart 2
University of Waterloo
CS885 Spring 2018 Pascal Poupart 3
University of Waterloo
CS885 Spring 2018 Pascal Poupart 4
University of Waterloo
CS885 Spring 2018 Pascal Poupart 5
University of Waterloo
CS885 Spring 2018 Pascal Poupart 6
MINIMAX-VALUE(n) = Utility(n) if n is a terminal state Maxs Î Succ(n) MINIMAX-VALUE(s) if n is a MAX node Mins Î Succ(n) MINIMAX-VALUE(s) if n is a MIN node
University of Waterloo
CS885 Spring 2018 Pascal Poupart 7
University of Waterloo
CS885 Spring 2018 Pascal Poupart 8
University of Waterloo
CS885 Spring 2018 Pascal Poupart 9
University of Waterloo
CS885 Spring 2018 Pascal Poupart 10
University of Waterloo
CS885 Spring 2018 Pascal Poupart 11
University of Waterloo
CS885 Spring 2018 Pascal Poupart 12
University of Waterloo
CS885 Spring 2018 Pascal Poupart 13
MAX MIN [-inf, inf] 3 [-inf, 3]
University of Waterloo
CS885 Spring 2018 Pascal Poupart 14
MAX MIN 3 12 [-inf,3] [-inf,inf]
University of Waterloo
CS885 Spring 2018 Pascal Poupart 15
MAX MIN 3 12 8 [3,3] [3,inf]
University of Waterloo
CS885 Spring 2018 Pascal Poupart 16
MAX MIN 3 12 8 [3,3] [3,inf] 2 [-inf,2]
University of Waterloo
CS885 Spring 2018 Pascal Poupart 17
MAX MIN 3 12 8 [3,3] [3,inf] 2 [-inf,2] Prune remaining children
University of Waterloo
CS885 Spring 2018 Pascal Poupart 18
MAX MIN 3 12 8 [3,3] 2 [-inf,2] 14 [-inf,14] [3,14]
University of Waterloo
CS885 Spring 2018 Pascal Poupart 19
MAX MIN 3 12 8 [3,3] 2 [-inf,2] 14 [-inf,5] [3,5] 5
University of Waterloo
CS885 Spring 2018 Pascal Poupart 20
MAX MIN 3 12 8 [3,3] 2 [-inf,2] 14 [2,2] [3,3] 5 2
University of Waterloo
CS885 Spring 2018 Pascal Poupart 21
University of Waterloo
CS885 Spring 2018 Pascal Poupart 22
University of Waterloo
CS885 Spring 2018 Pascal Poupart 23
University of Waterloo
CS885 Spring 2018 Pascal Poupart 24
University of Waterloo
CS885 Spring 2018 Pascal Poupart 25
University of Waterloo
CS885 Spring 2018 Pascal Poupart 26
University of Waterloo