This Segment: Computational game theory Lecture 1: Game - PowerPoint PPT Presentation

This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University

The heart of the problem • In a 1-agent setting, agent’s expected utility maximizing strategy is well-defined • But in a multiagent system, the outcome may depend on others’ strategies also depend on others’ strategies also

Terminology • Agent = player • Action = move = choice that agent can make at a point in the game • Strategy s i = mapping from history (to the extent that the agent i can distinguish) to actions • Strategy set S i = strategies available to the agent • Strategy profile (s 1 , s 2 , ..., s |A| ) = one strategy for each agent • Agent’s utility is determined after each agent (including nature that is used to model uncertainty) has chosen its strategy, and game has been played: u i = u i (s 1 , s 2 , ..., s |A| )

Game representations Matrix form Extensive form (aka normal form aka strategic form) player 2’s strategy 1, 2 Left Left, Left, Right, Right, Left Left Right Right Left Left Right Right player 2 Up 3, 4 Up Right 1, 2 1, 2 3, 4 3, 4 player 1’s player 1 strategy 5, 6 Left 5, 6 7, 8 5, 6 7, 8 Down Down player 2 7, 8 Right Potential combinatorial explosion

Dominant strategy equilibrium Best response s i *: for all s i ’, u i (s i *,s -i ) ≥ u i (s i ’,s -i ) • Dominant strategy s i *: s i * is a best response for all s -i • – Does not always exist – Inferior strategies are called “dominated” Dominant strategy equilibrium is a strategy profile where • each agent has picked its dominant strategy – Does not always exist – Does not always exist – Requires no counterspeculation cooperate defect Pareto optimal? cooperate 3, 3 0, 5 Social welfare maximizing? 5, 0 defect 1, 1

Nash equilibrium [Nash50] • Sometimes an agent’s best response depends on others’ strategies: a dominant strategy does not exist A strategy profile is a Nash equilibrium if no player has • incentive to deviate from his strategy given that others do not deviate: for every agent i, u i (s i *,s -i ) ≥ u i (s i ’,s -i ) for all s i ’ – Dominant strategy equilibria are Nash equilibria but not vice versa vice versa – Defect-defect is the only Nash eq. in Prisoner’s Dilemma – Battle of the Sexes game • Has no dominant strategy equilibria Woman boxing ballet boxing 2, 1 0, 0 Man ballet 0, 0 1, 2

Criticisms of Nash equilibrium • Not unique in all games, e.g. Battle of the Sexes – Approaches for addressing this problem • Refinements of the equilibrium concept – Choose the Nash equilibrium with highest welfare – Subgame perfection – … – … • Focal points • Mediation • Communication 1, 0 • Convention 0, 1 • Learning • Does not exist in all games 1, 0 0, 1 • May be hard to compute

Existence of (pure strategy) Nash equilibria • IF a game is finite – and at every point in the game, the agent whose turn it is to move knows what moves have been played so far have been played so far • THEN the game has a (pure strategy) Nash equilibrium • (solvable by minimax search at least as long as ties are ruled out)

Rock-scissors-paper game Sequential moves

Rock-scissors-paper game Simultaneous moves

Mixed strategy Nash equilibrium Mixed strategy = agent’s chosen probability distribution over pure strategies from its strategy set rock 0, 0 move of Each agent has a best response strategy agent 2 scissors and beliefs 1, -1 paper (consistent with each rock other) -1, 1 Symmetric mixed Symmetric mixed rock rock -1, 1 -1, 1 strategy Nash eq: Each player move of scissors scissors 0, 0 plays each pure agent 1 paper strategy with paper probability 1/3 1, -1 In mixed strategy rock 1, -1 equilibrium, each Information set strategy that occurs in scissors (the mover does not -1, 1 the mix of agent i has know which node of the paper equal expected utility to i set she is in) 0, 0

Existence of mixed strategy Nash equilibria • Every finite player, finite strategy game has at least one Nash equilibrium if we admit mixed strategy equilibria as well as pure [Nash 50] strategy equilibria as well as pure [Nash 50] – (Proof is based on Kakutani’s fix point theorem)

Subgame perfect equilibrium & credible threats [Selten 72] • Proper subgame = subtree (of the game tree) whose root is alone in its information set • Subgame perfect equilibrium = strategy profile that is in Nash equilibrium in every proper subgame (including the root), whether or not that subgame is reached along the equilibrium path of play • E.g. Cuban missile crisis - 100, - 100 Nuke Kennedy Arm Fold 10, -10 Khrushchev Retract -1, 1 • Pure strategy Nash equilibria: (Arm,Fold), (Retract,Nuke) • Pure strategy subgame perfect equilibria: (Arm,Fold) • Conclusion: Kennedy’s Nuke threat was not credible

Different solution concepts Strong Nash eq Strength against collusion Coalition-Proof Nash eq Coalition-Proof Nash eq Nash eq Dominant Subgame perfect eq strategy eq Sequential eq Bayes-Nash eq Perfect Bayesian eq Strength There are other equilibrium refinements too (see, e.g., wikipedia).

Definition of a Bayesian game • N is the set of players. � is the set of the states of nature. • – For instance, in a card game, it can be any order of the cards. A i is the set of actions for player i. A = A 1 � A 2 � … � A n • • T i is the type set of player i. For each state of nature, the game will have different types of players (one type per player). – For instance, in a car selling game, it will be how much the player – For instance, in a car selling game, it will be how much the player values the car C i � A i × T i defines the available actions for player i of some type in T i . • u : � × A → R is the payoff function for player i. • p i is the probability distribution over � for each player i, that is to say, • each player has different views of the probability distribution over the states of the nature. In the game, they never know the exact state of the nature.

Solution concepts for Bayesian games • A (Bayesian) Nash equilibrium is a strategy profile and beliefs specified for each player about the types of the other players that maximizes the expected utility for each player given their beliefs about the other players' types and given the strategies played by the other players. • Perfect Bayesian equilibrium (PBE) – Players place beliefs on nodes occurring in their information sets – A belief system is consistent for a given strategy profile if the probability assigned by the system to every node is computed as the probability of that node being reached given the strategy profile, i.e., by Bayes’ rule. – A strategy profile is sequentially rational at a particular information set for a particular belief system if the expected utility of the player whose information set it is is maximal given the system if the expected utility of the player whose information set it is is maximal given the strategies played by the other players. • A strategy profile is sequentially rational for a particular belief system if it satisfies the above for every information set. – A PBE is a strategy profile and a belief system such that the strategies are sequentially rational given the belief system and the belief system is consistent , wherever possible, given the strategy profile. • 'wherever possible' clause is necessary: some information sets might be reached with zero probability given the strategy profile; hence Bayes' rule cannot be employed to calculate the probability of nodes in those sets. Such information sets are said to be off the equilibrium path and any beliefs can be assigned to them. – Sequential equilibrium is a refinement of PBE that specifies constraints on the beliefs in such zero-probability information sets. Strategies and beliefs should be a limit point of a sequence of totally mixed strategy profiles and associated sensible (in PBE sense) beliefs.

This Segment: Computational game theory Lecture 1: Game - PowerPoint PPT Presentation

This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University The heart of the problem In a 1-agent setting, agents

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure

Computational Geometry Lecture 2: Line segment intersection for map overlay 1 Computational

Game Theory and Nuclear Weapons Game Theory and Nuclear Weapons Game Theory and Nuclear Warfare

Game theory (Ch. 17.5) Announcements Midterm Thursday Game theory Typically game theory uses a

EBLL Response in HCV Units Segment 1: The Basics EBLL Response in in HCV Units Segment 1:

PCEP Extensions for Service Segment Support in Segment Routing

Game Theory: Definition and Assumptions Game Theory and Strategy Game theory studies strategic

Introduction to game theory Introduction to game theory Jie Gao Computer Science Department

Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language and Computation University

Introduction to Game Theory (1) Mehdi Dastani BBL-521 M.M.Dastani@uu.nl Game Theory What is

Coalitional Game Theory Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC

Game theory (Ch. 17.5) Game theory Typically game theory uses a payoff matrix to represent the

Lecture 7: Game theory David Aldous February 24, 2016 STAT 155 is an entire course on Game

Game Theory CS 188: Artificial Intelligence Game theory: study of strategic situations,

An Approximate Subgame-Perfect Equilibrium Computation Technique for Repeated Games Andriy Burkov

Internal Implementation Ashton Anderson, Yoav Shoham, Alon Altman Stanford University May 2010

Extensive Games with Perfect Information A Mini Tutorial Krzysztof R. Apt (so not Krzystof and

Mixed Strategies Krzysztof R. Apt CWI, Amsterdam, the Netherlands , University of Amsterdam

Levels of Analysis in International Relations J2P216 SE: International Cooperation and Conflict

Previously in Game Theory Previously in Game Theory decision makers: choices

Models of Language Evolution Session 03 : Evolutionary Game Theory: Games & Stable Outcomes

Introduction to Mechanism Design Thodoris Lykouris National Technical University of Athens May

Sambuz

Useful Links

Newsletter

Mail Us

This Segment: Computational game theory Lecture 1: Game - PowerPoint PPT Presentation

This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University The heart of the problem In a 1-agent setting, agents

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Game interoperability with functors functor AgsFun (structure Game : GAME) :&gt; sig structure

Computational Geometry Lecture 2: Line segment intersection for map overlay 1 Computational

Game Theory and Nuclear Weapons Game Theory and Nuclear Weapons Game Theory and Nuclear Warfare

Game theory (Ch. 17.5) Announcements Midterm Thursday Game theory Typically game theory uses a

EBLL Response in HCV Units Segment 1: The Basics EBLL Response in in HCV Units Segment 1:

PCEP Extensions for Service Segment Support in Segment Routing

Game Theory: Definition and Assumptions Game Theory and Strategy Game theory studies strategic

Introduction to game theory Introduction to game theory Jie Gao Computer Science Department

Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language and Computation University

Introduction to Game Theory (1) Mehdi Dastani BBL-521 M.M.Dastani@uu.nl Game Theory What is

Coalitional Game Theory Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC

Game theory (Ch. 17.5) Game theory Typically game theory uses a payoff matrix to represent the

Lecture 7: Game theory David Aldous February 24, 2016 STAT 155 is an entire course on Game

Game Theory CS 188: Artificial Intelligence Game theory: study of strategic situations,

An Approximate Subgame-Perfect Equilibrium Computation Technique for Repeated Games Andriy Burkov

Internal Implementation Ashton Anderson, Yoav Shoham, Alon Altman Stanford University May 2010

Extensive Games with Perfect Information A Mini Tutorial Krzysztof R. Apt (so not Krzystof and

Mixed Strategies Krzysztof R. Apt CWI, Amsterdam, the Netherlands , University of Amsterdam

Levels of Analysis in International Relations J2P216 SE: International Cooperation and Conflict

Previously in Game Theory Previously in Game Theory decision makers: choices

Models of Language Evolution Session 03 : Evolutionary Game Theory: Games &amp; Stable Outcomes

Introduction to Mechanism Design Thodoris Lykouris National Technical University of Athens May

Sambuz

Useful Links

Newsletter

Mail Us

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure

Models of Language Evolution Session 03 : Evolutionary Game Theory: Games & Stable Outcomes