Games we will consider Deterministic Discrete states and decisions - PDF document

Games we will consider • Deterministic • Discrete states and decisions CS 331: Artificial Intelligence • Finite number of states and decisions Adversarial Search • Perfect information i.e. fully observable • Two agents whose actions alternate • Their utility values at the end of the game are equal and opposite (we call this zero-sum) “It’s not enough for me to win, I have to see my opponents lose” 1 2 Which of these games fit the What makes games hard? description? • Hard to solve e.g. Chess has a search graph Two-player, zero-sum, discrete, finite, deterministic games of perfect information with about 10 40 distinct nodes • Need to make a decision even though you can’t calculate the optimal decision • Need to make a decision with time limits 4 Formal Definition of a Game Nim A quintuplet (S, I, Succ(), T, U): Many different variations. We’ll do this one. Finite set of states. States include information on which player’s S • Start with 9 beaver logos turn it is to move. • In one player’s turn, that player can I Initial board position and which player is first to move remove 1, 2 or 3 beaver logos Succ() Takes a current state and returns a list of (move,state) pairs, each • The person who takes the last beaver logo indicating a legal move and the resulting state wins T Terminal test which determines when the game ends. Terminal states: subset of S in where the game has ended U Utility function (aka objective function or payoff function): maps from terminal state to real number 5 6 1

Nim Formal Definition of Nim Notation: Max(IIIII) Who’s move # matches left A quintuplet (S, I, Succ(), T, U): S Max(IIIII), Max(III), Max(II), Max(I) Min(IIII), Min(III), Min(II), Min(I) I Max(IIIII) Succ() Succ(Max(IIIII)) = {Min(IIII),Min(III),Min(II)} Succ(Min(IIII)) = {Max(III),Max(II),Max(I)} Succ(Max(III)) = {Min(II),Min(I)} Succ(Min(III)) = {Max(II),Max(I)} Succ(Max(II)) = {Min(I)} Succ(Min(II)) = {Max(I)} T Max(I), Max(II), Max(III), Min(I), Min(II), Min(III) U Utility(Max(I) or Max(II) or Max(III)) = +1, Utility(Min(I) or Min(II) or Min(III)) = -1 7 8 Nim Game Tree How to Use a Game Tree Max IIIII • Max wants to maximize his utility • Min wants to minimize Max’s utility Min IIII III II • Max’s strategy must take into account what Max III II I II I -1 I -1 Min does since they alternate moves • A move by Max or Min is called a ply Min II I +1 I +1 +1 I +1 +1 +1 Max I -1 -1 -1 -1 +1 Min We’ll call the players Max and Min, with Max starting first 10 Nim Game Tree The Minimax Value of a Node Max IIIII The minimax value of a node is the utility for MAX of being in the corresponding state, Min IIII III II assuming that both players play optimally from there to the end of the game Max III II I II I -1 I -1  MINIMAX - VALUE( n ) Min II I +1 I +1 +1 I +1 +1 +1 UTILITY( n ) If n is a terminal state max MINIMAX - VALUE( s ) Max If n is a MAX node I -1 -1 -1 -1 s  Successors ( n ) min MINIMAX - VALUE( s ) If n is a MIN node s  Successors ( n ) +1 Min Minimax value maximizes worst-case outcome for MAX 12 2

Minimax Values in Nim Game Tree Minimax Values in Nim Game Tree Max Max IIIII IIIII Min Min IIII III II IIII III II Max Max III II I II I -1 I -1 III II I II I -1 I -1 II I +1 I +1 +1 I +1 +1 +1 -1 II -1 I +1 -1 I +1 +1 -1 I +1 +1 +1 Min Min Max Max +1 I -1 -1 -1 -1 +1 I -1 -1 -1 -1 +1 +1 Min Min 13 14 Minimax Values in Nim Game Tree Minimax Values in Nim Game Tree Max Max IIIII IIIII Min Min IIII III II +1 IIII -1 III -1 II Max Max +1 III +1 II +1 I +1 II +1 I -1 +1 I -1 +1 III +1 II +1 I +1 II +1 I -1 +1 I -1 Min -1 II -1 I +1 -1 I +1 +1 -1 I +1 +1 +1 Min -1 II -1 I +1 -1 I +1 +1 -1 I +1 +1 +1 Max Max +1 I -1 -1 -1 -1 +1 I -1 -1 -1 -1 +1 +1 Min Min 15 16 Minimax Values in Nim Game Tree Minimax Values in Nim Game Tree Minimax decision at the root: Max Max +1 IIIII +1 IIIII taking this action results in the successor with highest minimax value Min Min +1 IIII -1 III -1 II +1 IIII -1 III -1 II Max Max +1 III +1 II +1 I +1 II +1 I -1 +1 I -1 +1 III +1 II +1 I +1 II +1 I -1 +1 I -1 Min -1 II -1 I +1 -1 I +1 +1 -1 I +1 +1 +1 Min -1 II -1 I +1 -1 I +1 +1 -1 I +1 +1 +1 Max Max +1 I -1 -1 -1 -1 +1 I -1 -1 -1 -1 +1 +1 Min Min 17 18 3

Another Example Another Example = Maximizing player MAX MAX A A = Minimizing player B C D B C D MIN MIN 3 2 2 3 12 8 2 4 6 14 5 2 3 12 8 2 4 6 14 5 2 19 20 The MINIMAX Algorithm Another Example function MINIMAX-DECISION( state ) returns an action inputs : state , current state in game v ← MAX -VALUE( state ) 3 MAX A return the action in SUCCESSORS( state ) with value v function MAX-VALUE( state ) returns a utility value if TERMINAL-TEST( state ) then return UTILITY( state ) v ← - Infinity B C D MIN 3 2 2 for a , s in SUCCESSORS( state ) do v ← MAX( v , MIN-VALUE( s )) return v function MIN-VALUE( state ) returns a utility value if TERMINAL-TEST( state ) then return UTILITY( state ) v ← Infinity for a , s in SUCCESSORS( state ) do v ← MIN( v , MAX-VALUE( s )) 3 12 8 2 4 6 14 5 2 return v 21 22 The MINIMAX algorithm Minimax With 3 Players • Computes minimax decision from the current state A • Depth-first exploration of the game tree • Time Complexity O(b m ) where b=# of legal B moves, m=maximum depth of tree • Space Complexity: C – O(bm) if all successors generated at once – O(m) if only one successor generated at a time (each A partially expanded node remembers which successor to generate next) (1,2,6) (4,2,3) (6,1,2) (7,4,1) (5,1,1) (1,5,2) (7,7,1) (5,4,5) Now have a vector of utilities for players (A,B,C). All players maximize their utilities. Note: In two-player, zero-sum games, we have a single value 23 24 because the values are always opposite. 4

Minimax With 3 Players Minimax With 3 Players A A B B (1,2,6) (1,5,2) C (1,2,6) (6,1,2) (1,5,2) (5,4,5) C (1,2,6) (6,1,2) (1,5,2) (5,4,5) (1,2,6) (4,2,3) (6,1,2) (7,4,1) (5,1,1) (1,5,2) (7,7,1) (5,4,5) (1,2,6) (4,2,3) (6,1,2) (7,4,1) (5,1,1) (1,5,2) (7,7,1) (5,4,5) 25 26 Minimax With 3 Players Subtleties With Multiplayer Games • Alliances can be made and broken A (1,2,6) • For example, if A and B are weaker than C, B (1,2,6) (1,5,2) they can gang up on C • But A and B can turn on each other once C C (1,2,6) (6,1,2) (1,5,2) (5,4,5) is weakened • But society considers the player that breaks the alliance to be dishonorable (1,2,6) (4,2,3) (6,1,2) (7,4,1) (5,1,1) (1,5,2) (7,7,1) (5,4,5) 27 28 Pruning in Nim Pruning Max +1 IIIII • Can we improve on the time complexity of O(b m )? Min +1 IIII -1 III -1 II • Yes if we prune away branches that cannot Max possibly influence the final decision +1 III +1 II +1 I +1 II +1 I -1 +1 I -1 Min -1 II -1 I +1 -1 I +1 +1 -1 I +1 +1 +1 Max +1 I -1 -1 -1 -1 If we know that the only two outcomes are +1 and -1, +1 Min what branches do we not need to explore when minimax backtracks? 29 5

Pruning in Nim Pruning in Nim Max Max +1 IIIII +1 IIIII Min Min +1 IIII -1 III -1 II +1 IIII -1 III -1 II Max Max +1 III +1 II +1 I +1 II +1 I -1 +1 I -1 +1 III +1 II +1 I +1 II +1 I -1 +1 I -1 -1 II -1 I +1 -1 I +1 +1 -1 I +1 +1 +1 -1 II -1 I +1 -1 I +1 +1 -1 I +1 +1 +1 Min Min Max Max +1 I -1 -1 -1 -1 +1 I -1 -1 -1 -1 If we know that the only two outcomes are +1 and -1, What happens if we have more than just two +1 +1 Min Min what branches do we not need to explore when outcomes? minimax backtracks? 32 Pruning Intuition (General Case) Pruning Example MAX A MAX MIN B C D The max player will never ≤1 MIN 5 choose the right subtree once it knows that it is upper bounded by 1 3 12 8 2 x y 14 5 2 MINIMAX-VALUE(root) 5 10 1 = max(min(3,12,8),min(2,x,y),min(14,5,2)) Suppose we just went down this = max(3,min(2,x,y),2) branch. We know that the minimax = max(3,z,2) where z ≤ 2 value of its parent will be ≤ 1 = 3 33 34 ALPHA-BETA Pseudocode Pruning Intuition function ALPHA-BETA-SEARCH( state ) returns an action Remember that minimax search is DFS. inputs : state , current state in game At any one time, we only have to consider the nodes along a single path in the v ← MAX -VALUE( state , - ∞, +∞) tree return the action in SUCCESSORS( state ) with value v In general, let: function MAX-VALUE( state ,  ,  ) returns a utility value  = highest minimax value of all of the MAX player’s choices expanded on • inputs : state , current state in game current path (best score for MAX so far)  , the value of the best alternative for MAX along the path to state •  = lowest minimax value of all of the MIN player’s choices expanded on  , the value of the best alternative for MIN along the path to state current path (best score for MIN so far) If at a MIN player node, prune if minimax value of node ≤  • if TERMINAL-TEST( state ) then return UTILITY( state ) If at a MAX player node, prune if minimax value of node ≥  • v ← - ∞ for a , s in SUCCESSORS( state ) do v ← MAX( v , MIN-VALUE(s,  ,  )) if v ≥  then return v  ← MAX(  , v) return v 35 36 6

Games we will consider Deterministic Discrete states and decisions - PDF document

Games we will consider Deterministic Discrete states and decisions CS 331: Artificial Intelligence Finite number of states and decisions Adversarial Search Perfect information i.e. fully observable Two agents whose actions

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite

S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S +

Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs

Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene,

LOGIC OF GAMES Andreas Blass University of Michigan Ann Arbor, MI 48109 ablass@umich.edu Games

Nash Dynamics and Potential Games Maria Serna Fall 2016 AGT-MIRI, FIB Potential Games Contents

CSC2556 Lecture 11 Noncooperative Games 2: Zero-Sum Games, Stackelberg Games CSC2556 - Nisarg

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

Games with Sequential Actions: (Finite) Extensive- Form Games Xinshuo Weng Outline What are

Digital Games An Introduction What are Digital Games? Commonly referred to as video games

Tom Nichols VP PC Games, North America Aeria Games & Entertainment Agenda Aeria Games?

26 July | 4 August 2 3 2019 | European Masters Games | Torino 2019 | European Masters Games |

The Ancient Olympics The Olympic Games are thought to have started in 776BC in Greece. The Games

Bayesian Games Yiling Chen October 1, 2008 CS286r Fall08 Bayesian Games 1 So far Up to

Dynamic Games & Cartels Johan.Stennek@Economics.gu.se 1 Dynamic Games 2 Dynamic Games

CS325 Artificial Intelligence Ch. 5, Games! Cengiz Gnay, Emory Univ. vs. Spring 2013 Gnay

Welcome to the Course! Professor Chris Callison-Burch Welcome to CIS 421/521 o Professor Chris

GASLIGHT: LANTERN SLIDES FROM THE NINETEENTH CENTURY Download Free Author: Isabel Fargo Cole,

Articulated Meshes Eric Landreneau Scott Schaefer Texas A&M University Introduction

Retail Market Review Implementation Project launch 20 March 2018 C/18/5593 Sarah McDowell A/g

CS 486/686 Artificial Intelligence Lecture 1: May 1, 2017 University of Waterloo 1 CS486/686

Artificial Intelligence Chapter 1 Chapter 1 1 Outline What is AI? A brief history

Artificial Intelligence: Machine Learning and Pattern Recognition University of Venice, Italy

Cisco ASA Provider The Cisco ASA provider is used to interact with Cisco ASA hardware devices or

Games we will consider Deterministic Discrete states and decisions - PDF document

Games we will consider Deterministic Discrete states and decisions CS 331: Artificial Intelligence Finite number of states and decisions Adversarial Search Perfect information i.e. fully observable Two agents whose actions

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite

S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S +

Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs

Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene,

LOGIC OF GAMES Andreas Blass University of Michigan Ann Arbor, MI 48109 ablass@umich.edu Games

Nash Dynamics and Potential Games Maria Serna Fall 2016 AGT-MIRI, FIB Potential Games Contents

CSC2556 Lecture 11 Noncooperative Games 2: Zero-Sum Games, Stackelberg Games CSC2556 - Nisarg

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

Games with Sequential Actions: (Finite) Extensive- Form Games Xinshuo Weng Outline What are

Digital Games An Introduction What are Digital Games? Commonly referred to as video games

Tom Nichols VP PC Games, North America Aeria Games &amp; Entertainment Agenda Aeria Games?

26 July | 4 August 2 3 2019 | European Masters Games | Torino 2019 | European Masters Games |

The Ancient Olympics The Olympic Games are thought to have started in 776BC in Greece. The Games

Bayesian Games Yiling Chen October 1, 2008 CS286r Fall08 Bayesian Games 1 So far Up to

Dynamic Games &amp; Cartels Johan.Stennek@Economics.gu.se 1 Dynamic Games 2 Dynamic Games

CS325 Artificial Intelligence Ch. 5, Games! Cengiz Gnay, Emory Univ. vs. Spring 2013 Gnay

Welcome to the Course! Professor Chris Callison-Burch Welcome to CIS 421/521 o Professor Chris

GASLIGHT: LANTERN SLIDES FROM THE NINETEENTH CENTURY Download Free Author: Isabel Fargo Cole,

Articulated Meshes Eric Landreneau Scott Schaefer Texas A&amp;M University Introduction

Retail Market Review Implementation Project launch 20 March 2018 C/18/5593 Sarah McDowell A/g

CS 486/686 Artificial Intelligence Lecture 1: May 1, 2017 University of Waterloo 1 CS486/686

Artificial Intelligence Chapter 1 Chapter 1 1 Outline What is AI? A brief history

Artificial Intelligence: Machine Learning and Pattern Recognition University of Venice, Italy

Cisco ASA Provider The Cisco ASA provider is used to interact with Cisco ASA hardware devices or

Tom Nichols VP PC Games, North America Aeria Games & Entertainment Agenda Aeria Games?

Dynamic Games & Cartels Johan.Stennek@Economics.gu.se 1 Dynamic Games 2 Dynamic Games

Articulated Meshes Eric Landreneau Scott Schaefer Texas A&M University Introduction