CS 886: Game-theoretic methods for computer science Normal Form - PowerPoint PPT Presentation

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies CS 886: Game-theoretic methods for computer science Normal Form Games Kate Larson Computer Science University of Waterloo Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Outline Review 1 Normal Form Game Examples Strategies Nash Equilibria 2 Dominant and Dominated Strategies 3 Maxmin and Minmax Strategies 4 Kate Larson CS 886

Review Normal Form Game Nash Equilibria Examples Dominant and Dominated Strategies Strategies Maxmin and Minmax Strategies Normal Form A normal form game is defined by Finite set of agents (or players) N , | N | = n Each agent i has an action space A i A i is non-empty and finite Outcomes are defined by action profiles ( a = ( a 1 , . . . , a n ) where a i is the action taken by agent i Each agent has a utility function u i : A 1 × . . . × A n �→ R Kate Larson CS 886

Review Normal Form Game Nash Equilibria Examples Dominant and Dominated Strategies Strategies Maxmin and Minmax Strategies Examples Prisoners’ Dilemma Pure coordination game ∀ action profiles a ∈ A 1 × . . . × A n and ∀ i , j , C D u i ( a ) = u j ( a ) . C a,a b,c L R D c,b d,d L 1,1 0,0 R 0,0 1,1 c > a > d > b Agents do not have conflicting interests. There sole challenge is to coordinate on an action which is good for all. Kate Larson CS 886

Review Normal Form Game Nash Equilibria Examples Dominant and Dominated Strategies Strategies Maxmin and Minmax Strategies Zero-sum games ∀ a ∈ A 1 × A 2 , u 1 ( a ) + u 2 ( a ) = 0. That is, one player gains at the other player’s expense. Matching Pennies H T H T H 1 -1 H 1,-1 -1, 1 T -1 1 T -1,1 1,-1 Given the utility of one agent, the other’s utility is known. Kate Larson CS 886

Review Normal Form Game Nash Equilibria Examples Dominant and Dominated Strategies Strategies Maxmin and Minmax Strategies More Examples Most games have elements of both cooperation and competition. BoS Hawk-Dove H S D H H 2,1 0,0 D 3,3 1,4 S 0,0 1,2 H 4,1 0,0 Kate Larson CS 886

Review Normal Form Game Nash Equilibria Examples Dominant and Dominated Strategies Strategies Maxmin and Minmax Strategies Strategies Notation: Given set X , let ∆ X be the set of all probability distributions over X . Definition Given a normal form game, the set of mixed strategies for agent i is S i = ∆ A i The set of mixed strategy profiles is S = S 1 × . . . × S n . Definition A strategy s i is a probability distribution over A i . s i ( a i ) is the probability action a i will be played by mixed strategy s i . Kate Larson CS 886

Review Normal Form Game Nash Equilibria Examples Dominant and Dominated Strategies Strategies Maxmin and Minmax Strategies Strategies Definition The support of a mixed strategy s i is { a i | s i ( a i ) > 0 } Definition A pure strategy s i is a strategy such that the support has size 1, i.e. |{ a i | s i ( a i ) > 0 }| = 1 A pure strategy plays a single action with probability 1. Kate Larson CS 886

Review Normal Form Game Nash Equilibria Examples Dominant and Dominated Strategies Strategies Maxmin and Minmax Strategies Expected Utility The expected utility of agent i given strategy profile s is � u i ( a )Π n u i ( s ) = j = 1 s j ( a j ) a ∈ A Example Given strategy profile s = (( 1 2 , 1 2 ) , ( 1 10 , 9 10 )) C D C -1,-1 -4,0 u 1 = − 1 ( 1 2 )( 1 10 ) − 4 ( 1 2 )( 9 10 ) − 3 ( 1 2 )( 9 10 ) = − 3 . 2 D 0, -4 -3,-3 u 2 = − 1 ( 1 2 )( 1 10 ) − 4 ( 1 2 )( 1 10 ) − 3 ( 1 2 )( 9 10 ) = − 1 . 6 Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Best-response Given a game, what strategy should an agent choose? We first consider only pure strategies. Definition Given a − i , the best-response for agent i is a i ∈ A i such that u i ( a ∗ i , a − i ) ≥ u i ( a ′ i , a − i ) ∀ a ′ i ∈ A i Note that the best response may not be unique. A best-response set is B i ( a − i ) = { a i ∈ A i | u i ( a i , a − i ) ≥ u i ( a ′ i , a − i ) ∀ a ′ i ∈ A i } Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Nash Equilibrium Definition A profile a ∗ is a Nash equilibrium if ∀ i, a ∗ i is a best response to a ∗ − i . That is ∀ iu i ( a ∗ i , a ∗ − i ) ≥ u i ( a ′ i , a ∗ − i ) ∀ a ′ i ∈ A i Equivalently, a ∗ is a Nash equilibrium if ∀ i a ∗ i ∈ B ( a ∗ − i ) Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Examples PD BoS Matching Pennies C D H T H T C -1,-1 -4,0 H 2,1 0,0 H 1,-1 -1,1 D 0,-4 -3,-3 T 0,0 1,2 T -1,1 1,-1 Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Nash Equilibria We need to extend the definition of a Nash equilibrium. Strategy profile s ∗ is a Nash equilibrium is for all i u i ( s ∗ i , s ∗ − i ) ≥ u i ( s ′ i , s ∗ − i ) ∀ s ′ i ∈ S i Similarly, a best-response set is B ( s − i ) = { s i ∈ S i | u i ( s i , s − i ) ≥ u i ( s ′ i , s − i ) ∀ s ′ i ∈ S i } Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Examples Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Characterization of Mixed Nash Equilibria s ∗ is a (mixed) Nash equilibrium if and only if the expected payoff, given s ∗ − i , to every action to which s ∗ i assigns positive probability is the same, and the expected payoff, given s ∗ − i to every action to which s ∗ i assigns zero probability is at most the expected payoff to any action to which s ∗ i assigns positive probability. Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Existence Theorem (Nash, 1950) Every finite normal form game has a Nash equilibrium. Proof: Beyond scope of course. Basic idea: Define set X to be all mixed strategy profiles. Show that it has nice properties (compact and convex). Define f : X �→ 2 X to be the best-response set function, i.e. given s , f ( s ) is the set all strategy profiles s ′ = ( s ′ 1 , . . . , s ′ n ) such that s ′ i is i ’s best response to s ′ − i . Show that f satisfies required properties of a fixed point theorem (Kakutani’s or Brouwer’s). Then, f has a fixed point, i.e. there exists s such that f ( s ) = s . This s is mutual best-response – NE! Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Interpretations of Nash Equilibria Consequence of rational inference Focal point Self-enforcing agreement Stable social convention ... Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Finding Nash Equilibria Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Dominant and Dominated Strategies For the time being, let us restrict ourselves to pure strategies. Definition Strategy s i is a strictly dominant strategy if for all s ′ i � = s i and for all s − i u i ( s i , s − i ) > u i ( s ′ i , s − i ) Prisoner’s Dilemma C D Dominant-strategy equilibria C -1,-1 -4,0 D 0, -4 -3,-3 Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Dominated Strategies Definition A strategy s i is strictly dominated if there exists another strategy s ′ i such that for all s − i u i ( s ′ i , s − i ) > u i ( s i , s − i ) Definition A strategy s i is weakly dominated if there exists another strategy s ′ i such that for all s − i u i ( s ′ i , s − i ) ≥ u i ( s i , s − i ) with strict inequality for some s − i . Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Example L R L R U 1,-1 -1,1 U 5,1 4,0 M -1,1 1,-1 M 6,0 3,1 D -2,5 -3,2 D 6,4 4,4 D is strictly dominated U and M are weakly dominated Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Iterated Deletion of Strictly Dominated Strategies Algorithm Let R i be the removed set of strategies for agent i R i = ∅ Loop Choose i and s i such that s i ∈ A i \ R i and there exists s ′ i such that u i ( s ′ i , s − i ) > u i ( s i , s − i ) ∀ s − i Add s i to R i Continue Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Example R C L U 3,-3 7,-7 15, -15 D 9,-9 8,-8 10,-10 Kate Larson CS 886

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies Some Results Theorem If a unique strategy profile s ∗ survives iterated deletion then it is a Nash equilibrium. Theorem If s ∗ is a Nash equilibrium then it survives iterated elimination. Weakly dominated strategies cause some problems. Kate Larson CS 886

CS 886: Game-theoretic methods for computer science Normal Form - PowerPoint PPT Presentation

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies CS 886: Game-theoretic methods for computer science Normal Form Games Kate Larson Computer Science University of Waterloo Kate Larson CS 886 Review

CS 886: Game-theoretic methods for computer science Extensive Form Games Kate Larson Computer

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure

Game Theoretic Pragmatics Michael Franke Preliminaries Game Theory Fundamentals Interpretation

Game Loops CIS 580 - Fundamentals of Game Programming Hangman Game Phases Game Loop

VIDEOGAMES ARE A MESS Ian Bogost WHAT IS A GAME? Is a game a system of rules, or is a game a

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Connect your device to application GAME ENGINE ON ANDROID Julian Chu Agenda We Love Game Why

Inductive general game playing Andrew Cropper, Richard Evans, and Mark Law General game playing

Incentives in Crowdsourcing: A Game-theoretic Approach ARPITA GHOSH Cornell University NIPS

A Game-Theoretic Approach to Network Security Mohammad Pirani and Henrik Sandberg Department of

Computing Game-Theoretic Solutions for Security Vincent Conitzer Dmytro Korzhyk Dmytro Korzhyk

Intoduction to the Fifth Workshop Game-Theoretic Probability and Related Topics Glenn Shafer 13

Derandomization in Game- Theoretic Probability Kenshi Miyabe, Meiji University, Japan (joint

Recursion Theoretic Results for the Game of Cops and Robbers on Graphs Shelley Stahl University

Preparing for the Worst but Hoping for the Best: Robust (Bayesian) Persuasion Piotr Dworczak

Bidding in First-Price Auctions Game Theory Course: Jackson, Leyton-Brown & Shoham Game

Can heavy neutrinos dominate Neutrinoless double beta decay? Jacobo Lpez-Pavn IPPP Durham

A Question of Loyalty: How to analyse loyalty rebates and discounts Moderator: Martin Mandorff,

Multiagent Systems: Spring 2006 Ulle Endriss Institute for Logic, Language and Computation

Avoiding Scheduler Subversion usin ing Scheduler-Cooperative Locks Yuvraj Patel, Leon Yang * ,

The Subspace Method for Diagnosing Network-Wide Traffic Anomalies Anukool Lakhina, Mark Crovella,

On Dominating Your Neighborhood Profitably

CS 886: Game-theoretic methods for computer science Normal Form - PowerPoint PPT Presentation

Review Nash Equilibria Dominant and Dominated Strategies Maxmin and Minmax Strategies CS 886: Game-theoretic methods for computer science Normal Form Games Kate Larson Computer Science University of Waterloo Kate Larson CS 886 Review

CS 886: Game-theoretic methods for computer science Extensive Form Games Kate Larson Computer

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Game interoperability with functors functor AgsFun (structure Game : GAME) :&gt; sig structure

Game Theoretic Pragmatics Michael Franke Preliminaries Game Theory Fundamentals Interpretation

Game Loops CIS 580 - Fundamentals of Game Programming Hangman Game Phases Game Loop

VIDEOGAMES ARE A MESS Ian Bogost WHAT IS A GAME? Is a game a system of rules, or is a game a

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Connect your device to application GAME ENGINE ON ANDROID Julian Chu Agenda We Love Game Why

Inductive general game playing Andrew Cropper, Richard Evans, and Mark Law General game playing

Incentives in Crowdsourcing: A Game-theoretic Approach ARPITA GHOSH Cornell University NIPS

A Game-Theoretic Approach to Network Security Mohammad Pirani and Henrik Sandberg Department of

Computing Game-Theoretic Solutions for Security Vincent Conitzer Dmytro Korzhyk Dmytro Korzhyk

Intoduction to the Fifth Workshop Game-Theoretic Probability and Related Topics Glenn Shafer 13

Derandomization in Game- Theoretic Probability Kenshi Miyabe, Meiji University, Japan (joint

Recursion Theoretic Results for the Game of Cops and Robbers on Graphs Shelley Stahl University

Preparing for the Worst but Hoping for the Best: Robust (Bayesian) Persuasion Piotr Dworczak

Bidding in First-Price Auctions Game Theory Course: Jackson, Leyton-Brown &amp; Shoham Game

Can heavy neutrinos dominate Neutrinoless double beta decay? Jacobo Lpez-Pavn IPPP Durham

A Question of Loyalty: How to analyse loyalty rebates and discounts Moderator: Martin Mandorff,

Multiagent Systems: Spring 2006 Ulle Endriss Institute for Logic, Language and Computation

Avoiding Scheduler Subversion usin ing Scheduler-Cooperative Locks Yuvraj Patel, Leon Yang * ,

The Subspace Method for Diagnosing Network-Wide Traffic Anomalies Anukool Lakhina, Mark Crovella,

On Dominating Your Neighborhood Profitably

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure

Bidding in First-Price Auctions Game Theory Course: Jackson, Leyton-Brown & Shoham Game