extensive games with perfect information a mini tutorial
play

Extensive Games with Perfect Information A Mini Tutorial Krzysztof - PowerPoint PPT Presentation

Extensive Games with Perfect Information A Mini Tutorial Krzysztof R. Apt (so not Krzystof and definitely not Krystof) CWI, Amsterdam, the Netherlands , University of Amsterdam Extensive Games withPerfect InformationA Mini Tutorial p. 1/29


  1. Extensive Games with Perfect Information A Mini Tutorial Krzysztof R. Apt (so not Krzystof and definitely not Krystof) CWI, Amsterdam, the Netherlands , University of Amsterdam Extensive Games withPerfect InformationA Mini Tutorial – p. 1/29

  2. Strategic Games: Definition Strategic game for n ≥ 2 players: (possibly infinite) set S i of strategies, payoff function p i : S 1 × · · · × S n → R , for each player i . Basic assumptions: players choose their strategies simultaneously, each player is rational: his objective is to maximize his payoff, players have common knowledge of the game and of each others’ rationality. Extensive Games withPerfect InformationA Mini Tutorial – p. 2/29

  3. Three Examples Prisoner’s Dilemma C D 2 , 2 0 , 3 C 3 , 0 1 , 1 D The Battle of the Sexes F B 2 , 1 0 , 0 F 0 , 0 1 , 2 B Matching Pennies H T 1 , − 1 − 1 , 1 H − 1 , 1 1 , − 1 T Extensive Games with Perfect Information A Mini Tutorial – p. 3/29

  4. Nash Equilibrium Notation: G := ( S 1 , . . ., S n , p 1 , . . ., p n ) . Notation: s i , s ′ i ∈ S i , s, s ′ , ( s i , s − i ) ∈ S 1 × · · · × S n . s i is a best response to s − i if ∀ s ′ i ∈ S i p i ( s i , s − i ) ≥ p i ( s ′ i , s − i ) . s is a Nash equilibrium if ∀ i s i is a best response to s − i : ∀ i ∈ { 1 , . . ., n } ∀ s ′ i ∈ S i p i ( s i , s − i ) ≥ p i ( s ′ i , s − i ) . Intuition: In a Nash equilibrium no player can gain by unilaterally switching to another strategy. Extensive Games with Perfect Information A Mini Tutorial – p. 4/29

  5. Nash Equilibrium 1 Nash equilibrium Prisoner’s Dilemma: C D 2 , 2 0 , 3 C 3 , 0 1 , 1 D 2 Nash equilibria The Battle of the Sexes: F B 2 , 1 0 , 0 F 0 , 0 1 , 2 B no Nash equlibrium Matching Pennies: H T 1 , − 1 − 1 , 1 H − 1 , 1 1 , − 1 T Extensive Games with Perfect Information A Mini Tutorial – p. 5/29

  6. Example 1: Prisoner’s Dilemma 1 C D 2 2 C D C D (2,2) (0,3) (3,0) (1,1) Extensive Games with Perfect Information A Mini Tutorial – p. 6/29

  7. Example 2: Battle of the Sexes 1 F B 2 2 F B F B (2,1) (0,0) (0,0) (1,2) Extensive Games with Perfect Information A Mini Tutorial – p. 7/29

  8. Example 3: Matching Pennies 1 H T 2 2 H T H T (1,-1) (-1,1) (-1,1) (1,-1) Extensive Games with Perfect Information A Mini Tutorial – p. 8/29

  9. Discussion These are examples of two-player games with two stages. In general there may be more players and more stages. We limit ourselves to the games with finitely many stages (games with finite horizon) and such that at each stage exactly one player proceeds. Note. At each stage a player can have infinitely many choices. We assume here perfect information: each player knows the previous moves. Extensive Games with Perfect Information A Mini Tutorial – p. 9/29

  10. Extensive Game: Definition Extensive game for n ≥ 1 players: game tree: a finite depth tree T := ( V, E ) with a turn function D : V \ Z → { 1 , . . ., n } , where Z is the set of leaves of T , outcome function o i : Z → R , for each player i . We denote it by ( T, D, o 1 , . . ., o n ) . Given v ∈ V \ Z we call { w | ( v, w ) ∈ E } the set of actions available to player D ( v ) at node v . Sometimes we identify the actions with the labels put on the edges. Extensive Games with Perfect Information A Mini Tutorial – p. 10/29

  11. Strategies Consider an extensive game EG := ( T, D, o 1 , . . ., o n ) . Let N i := { v ∈ V | D ( v ) = i } . N i is the set of nodes at which player i takes an action. Strategy for player i : s i : N i → V , such that for all v ∈ N i , ( v, s i ( v )) ∈ E . Joint strategy: s = ( s 1 , . . ., s n ) . It assigns a unique edge to every node in V \ Z . To each joint strategy s there corresponds a finite path path ( s ) := ( v 1 , . . ., v h ) in T defined inductively: v 1 is the root of T , if v k �∈ Z , then v k +1 := s i ( v k ) , where D ( v k ) = i . When each player i selects s i we call ( o 1 ( z ) , . . ., o n ( z )) , where z is the last element of path ( s ) , the outcome of EG . Extensive Games with Perfect Information A Mini Tutorial – p. 11/29

  12. Example of Strategies: Matching Pennies 1 H T 2 2 H T H T (1,-1) (-1,1) (-1,1) (1,-1) Strategies for player 1: H, T. Strategies for player 2: HH, HT, TH, TT. Thick lines correspond with the joint strategy (T,HH). Extensive Games with Perfect Information A Mini Tutorial – p. 12/29

  13. Strategic Forms With each extensive game EG := ( T, D, o 1 , . . ., o n ) we associate a strategic game G := ( S 1 , . . ., S n , p 1 , . . ., p n ) defined as follows: S i is the set of strategies of player i in EG , p i ( s ) := o i ( z ) , where z is the last element of path ( s ) . G is called the strategic form of EG . s is called a Nash equilibrium of EG if it is a Nash equilibrium of G . Extensive Games with Perfect Information A Mini Tutorial – p. 13/29

  14. Example: Matching Pennies 1 H T 2 2 H T H T (1,-1) (-1,1) (-1,1) (1,-1) Strategic form HH HT TH TT 1 , − 1 1 , − 1 − 1 , 1 − 1 , 1 H − 1 , 1 1 , − 1 − 1 , 1 1 , − 1 T Note. Two Nash equilibria: ( H, TH ) and ( T, TH ) . Extensive Games with Perfect Information A Mini Tutorial – p. 14/29

  15. Win or Lose Games A two-player extensive game is called a win or lose game if the only possible outcomes are (1 , − 1) and ( − 1 , 1) . s i is called a winning strategy of player i in a win or lose game EG if ∀ s − i ∈ S − i p i ( s i , s − i ) = 1 , where ( S 1 , S 2 , p 1 , p 2 ) is the strategic form of EG . Theorem (Zermelo, 1913) In every win or lose game one of the players has a winning strategy. Extensive Games with Perfect Information A Mini Tutorial – p. 15/29

  16. Proof of Zermelo’s Theorem Theorem In every win or lose game one of the players has a winning strategy. We can assume that the players alternate their moves. We can extend all the paths in the game so that all paths in T are of the same depth, say 2 k . Let W denote the sentence “player 1 wins after 2 k stages”. Then the formula φ 1 := ∃ x 1 ∀ y 1 . . . ∃ x k ∀ y k W denotes “player 1 has a winning strategy” and φ 2 := ∀ x 1 ∃ y 1 . . . ∀ x k ∃ y k ¬ W denotes “player 2 has a winning strategy”. But ¬ φ 1 ≡ φ 2 , i.e., φ 1 ∨ φ 2 holds. Extensive Games with Perfect Information A Mini Tutorial – p. 16/29

  17. Example: Ultimatum Game Player 1 claims x ∈ { 0 , 1 , . . ., 100 } . Player 2 accepts - the outcome is then ( x, 100 − x ) , or rejects - the outcome is then (0 , 0) . For each x ∈ { 0 , 1 , . . ., 100 } the root 1 has the subtree 1 x 2 A R ( x, 100 − x ) (0 , 0) Extensive Games with Perfect Information A Mini Tutorial – p. 17/29

  18. Nash Equilibria in the Ultimatum Game 1 x 2 A R ( x, 100 − x ) (0 , 0) Note. For each x ∈ { 0 , 1 , . . ., 100 } there is a Nash equilibrium with the outcome ( x, 100 − x ) . Proof. Take x ∗ ∈ { 0 , 1 , . . ., 100 } . Strategy for player 1: x ∗ . Strategy for player 2: if x ≤ x ∗ then A else R fi . This is a Nash equilibrium with the outcome ( x ∗ , 100 − x ∗ ) . Extensive Games with Perfect Information A Mini Tutorial – p. 18/29

  19. Example: Ultimatum Game, ctd Illustration. Strategy for player 1: x ∗ . Strategy for player 2: s 2 := if x ≤ x ∗ then A else R fi . Consider two deviations of player 1, x 1 < x ∗ < x 2 . s 2 . . . . . . . . . x 1 , 100 − x 1 x 1 . . . . . . x ∗ x ∗ 1 , 100 − x ∗ . . . . . . 1 0 , 0 x 2 . . . . . . . . . . . . . . . Conclusion. The notion of a Nash equilibrium is not informative here. Extensive Games with Perfect Information A Mini Tutorial – p. 19/29

  20. Subgames Consider EG := ( T, D, o 1 , . . ., o n ) . We define the subgame of EG rooted at node v of T , EG v , as expected. Note. Each strategy s i of player i in EG uniquely determines his strategy s v i in EG v . ( s 1 , . . ., s n ) is called a subgame perfect equilibrium in EG if for each node v of T ( s v 1 , . . ., s v n ) is a Nash equilibrium in EG v . Informally: s is subgame perfect equilibrium in EG if it induces a Nash equilibrium in every subgame of EG . Extensive Games with Perfect Information A Mini Tutorial – p. 20/29

  21. Backward Induction Given a tree ( V, E ) and v ∈ V , let desc ( v ) := { w | ( v, w ) ∈ E } . Fix a finite extensive game EG := (( V, E ) , D, o 1 , . . ., o n ) . Backward induction algorithm while | V | > 1 do choose v ∈ V such that all its descendants are leaves; i := D ( v ) ; choose w ∈ desc ( v ) such that o i ( w ) is maximal; s i ( v ) := w ; for j ∈ { 1 , . . ., n } do o j ( v ) := o j ( w ) od ; V := V \ desc ( v ) ; E := E ∩ ( V × V ) ; od Note. This process generates a set of joint strategies. Multiple joint strategies may arise due to the second choose statement. Extensive Games with Perfect Information A Mini Tutorial – p. 21/29

  22. Kuhn and Selten Theorems Theorem (Kuhn, 1950) Every finite extensive game (with perfect information) has a Nash equilibrium. Theorem (Selten, 1965) Every finite extensive game (with perfect information) has a subgame perfect equilibrium. Proof. A stronger claim holds: A joint strategy is a subgame perfect equilibrium iff it can be generated by the backward induction algorithm. Extensive Games with Perfect Information A Mini Tutorial – p. 22/29

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend