Bayesian Games Yiling Chen October 1, 2008 CS286r Fall08 Bayesian - PowerPoint PPT Presentation

Bayesian Games Yiling Chen October 1, 2008 CS286r Fall’08 Bayesian Games 1

So far Up to this point, we have assumed that players know all relevant information about each other. Such games are known as games with complete information. CS286r Fall’08 Bayesian Games 2

Games with Incomplete Information Bayesian Games = Games with Incomplete Information Incomplete Information: Players have private information about something relevant to his decision making. ◮ Incomplete information introduces uncertainty about the game being played. Imperfect Information: Players do not perfectly observe the actions of other players or forget their own actions. We will see that Bayesian games can be represented as extensive-form games with imperfect information. CS286r Fall’08 Bayesian Games 3

Example 1: A Modified Prisoner’s Dilemma Game With probability λ , player 2 has the normal preferences as before (type I), while with probability (1 − λ ), player 2 hates to rat on his accomplice and pays a psychic penalty equal to 6 years in prison for confessing (type II). 1 − λ λ C D C D 5, 5 0, 8 5, 5 0, 2 C C 8, 0 1, 1 8, 0 1, -5 D D Type I Type II CS286r Fall’08 Bayesian Games 4

Simultaneous-Move Bayesian Games A simultaneous-move Bayesian game is ( N , A , Θ , F , u ) ◮ N = { 1 , ..., n } is the set of players ◮ A = { A 1 , A 2 , ..., A n } is the set of actions A i = { Cooperation, Defection } . ◮ Θ = { Θ 1 , Θ 2 , ..., Θ n } is the set of types. θ i ∈ Θ i is a realization of types for player i . Θ 2 = { I, II } . ◮ F : Θ → [0 , 1] is a joint probability distribution, according to which types of players are drawn p( θ 2 = type I) = λ ◮ u = { u 1 , u 2 , ..., u n } where u i : A × Θ → R is the utility function of player i Two assumptions ◮ All possible games have the same number of agents and the same action spaces for each agent ◮ Agents have common prior. The different beliefs of agents are posteriors. CS286r Fall’08 Bayesian Games 5

Imperfect-Information Extensive-Form Representation of Bayesian Games Add a player Nature who has a unique strategy of randomizing in a commonly known way. Nature 1 − λ λ 1 C D C D 2 2 C D C D C D C D (5, 5) (0, 8) (8,0) (1, 1) (5, 5) (0, 2) (8, 0) (1, -5) CS286r Fall’08 Bayesian Games 6

Strategies in Bayesian Games A pure strategy s i : Θ i → A i of player i is a mapping from every type player i could have to the action he would play if he had that type. Denote the set of pure strategies of player i as S i . S 1 = {{ C } , { D }} S 2 = {{ C if type I, C if type II } , { C if type I, D if type II } , { D if type I, C if type II } , { D if type I, D if type II }} A mixed strategy σ i : S i → [0 , 1] of player i is a distribution over his pure strategies. CS286r Fall’08 Bayesian Games 7

Best Response and Bayesian Nash Equilibrium We use pure strategies to illustrate the concepts. But they hold the same for mixed strategies. Player i ’s ex ante expected utility is � E θ [ u i ( s ( θ ) , θ )] = p( θ i ) E θ − i [ u i ( s ( θ ) , θ ) | θ i ] θ i ∈ Θ i Player i ’s best responses to s − i ( θ − i ) is BR i = arg max s i ( θ i ) ∈ S i E θ [ u i ( s i ( θ i ) , s − i ( θ − i ) , θ )] � � � = p( θ i ) arg max s i ( θ i ) ∈ S i E θ − i [ u i ( s i ( θ i ) , s − i ( θ − i ) , θ ) | θ i ] θ i ∈ Θ i A strategy profile s i ( θ i ) is a Bayesian Nash Equilibrium iif ∀ i s i ( θ i ) ∈ BR i . CS286r Fall’08 Bayesian Games 8

Bayesian Nash Equilibrium: Example 1 Playing D is a dominant strategy for type I player 2; playing C is a dominant strategy for type II player 2. Player 1’s expected utility by playing C is λ × 0+(1 − λ ) × 5 = 5 − 5 λ . Player 1’s expected utility by playing D is λ × 1 + (1 − λ ) × 8 = 8 − 7 λ > 5 − 5 λ . (D, (D if type I, C if type II)) is a BNE of the game. λ 1 − λ C D C D C 5, 5 0, 8 5, 5 0, 2 C 8, 0 1, 1 D D 8, 0 1, -5 Type I Type II CS286r Fall’08 Bayesian Games 9

Example 2: An Exchange Game Each of two players receives a ticket t on which there is a number in [0,1]. The number on a player’s ticket is the size of a prize that he may receive. The two prizes are identically and independently distributed according to a uniform distribution. Each player is asked independently and simultaneously whether he wants to exchange his price for the other player’s prize. If both players agree than the prizes are exchanges; otherwise each player receives his own prize. CS286r Fall’08 Bayesian Games 10

A Bayesian Nash Equilibrium for Example 2 Strategies of player 1 can be describe as “Exchange if t 1 ≤ k ” Given player 1 plays such a strategy, what is the best response of player 2? ◮ If t 2 ≥ k , no exchange ◮ If t 2 < k , exchange when t 2 ≤ k / 2 Since players are symmetric, player 1’s best response is of the same form. Hence, at a Bayesian Nash equilibrium, both players are willing to exchange only when t i = 0. CS286r Fall’08 Bayesian Games 11

Signaling (Sender-Receiver Games) There are two types of works, bright and dull. Before entering the job market a worker can choose to get an education (i.e. go to college), or enjoy life (i.e. go to beach). The employer can observe the educational level of the worker but not his type. The employer can hire or reject the worker. CS286r Fall’08 Bayesian Games 12

Example 3: Signaling Nature 1 − λ λ Bright Dull Worker Worker C B C B Employer Employer H R H R H R H R (2, 2) (-1, 0) (4,-1) (1, 0) (2, 1) (-1, 0) (4, -2) (1, 0) CS286r Fall’08 Bayesian Games 13

Bayesian Extensive Games with Observable Actions A Bayesian extensive game with observable actions is ( N , H , P , Θ , p , u ) ◮ ( N , H , P ) is the same as those in an extensive-form game with perfect information ◮ Θ = { Θ 1 , Θ 2 , ..., Θ n } is the set of types. θ i ∈ Θ i is a realization of types for player i . Θ 1 = { Bright, Dull } . ◮ F : Θ → [0 , 1] is a joint probability distribution, according to which types of players are drawn p( θ 1 = Bright) = λ ◮ u = { u 1 , u 2 , ..., u n } where u i : Z × Θ → R is the utility function of player i . Z ∈ H is the set of terminal histories. CS286r Fall’08 Bayesian Games 14

Best Responses for Example 3 E.g. If the employer always plays H, then the best response for the worker is B. But how to define best responses for the employer? ◮ Beliefs on information sets ◮ Beliefs derived from strategies Nature 1 − λ λ Bright Dull Worker Worker C B C B Employer Employer H R H R H R H R (2, 2) (-1, 0) (4,-1) (1, 0) (2, 1) (-1, 0) (4, -2) (1, 0) CS286r Fall’08 Bayesian Games 15

A Bayesian Nash Equilibrium of Example 3 Nature 1 − λ λ Bright Dull Worker Worker C B C B Employer Employer λ 1 − λ H R H R H R H R (2, 2) (-1, 0) (4,-1) (1, 0) (2, 1) (-1, 0) (4, -2) (1, 0) p(Bright | Beach) = p(Bright) σ (Beach | Bright) λ · 1 p(Bright) σ (Beach | Bright)+p(Dull) σ (Beach | Dull) = λ · 1+(1 − λ ) · 1 = λ CS286r Fall’08 Bayesian Games 16

“Subgame Perfection” The previous Bayesian Nash Equilibrium is not “subgame perfect”. When the information set College is reached, the employer should choose to hire no matter what belief he has. We need to require sequential rationality even for off-equilibrium-path information sets. Then, beliefs on off-equilibrium-path information sets matter. CS286r Fall’08 Bayesian Games 17

Perfect Bayesian Equilibrium A strategy-belief pair, ( σ, µ ) is a perfect Bayesian equilibrium if (Beliefs) At every information set of player i , the player has beliefs about the node that he is located given that the information set is reached. (Sequential Rationality) At any information set of player i , the restriction of ( σ, µ ) to the continuation game must be a Bayesian Nash equilibrium. (On-the-path beliefs) The beliefs for any on-the-equilibrium-path information set must be derived from the strategy profile using Bayes’ Rule. (Off-the-path beliefs) The beliefs at any off-the-equilibrium-path information set must be determined from the strategy profile according to Bayes Rule whenever possible. CS286r Fall’08 Bayesian Games 18

Perfect Bayesian Equilibrium Perfect Bayesian equilibrium is a similar concept to sequential equilibrium, both trying to achieve some sort of “subgame perfection”. Perfect Bayesian equilibrium is defined for all extensive-form games with imperfect information, not just for Bayesian extensive games with observable actions. Thm : For Bayesian extensive games with observable actions, every sequential equilibrium is a Perfect Bayesian equilibrium. CS286r Fall’08 Bayesian Games 19

A Perfect Bayesian Equilibrium of Example 3 Nature λ 1 − λ Bright Dull Worker Worker C B C B Employer Employer β 1 − λ 1 − β λ H R H R H R H R (2, 2) (-1, 0) (4,-1) (1, 0) (2, 1) (-1, 0) (4, -2) (1, 0) β ∈ [0 , 1] CS286r Fall’08 Bayesian Games 20

Summary of Non-Cooperative Game Theory Normal-Form Games ◮ Nash Equilibrium (pure strategy and mixed strategy) Extensive-Form Game with Perfect Information ◮ Subgame Perfect Nash Equilibrium Extensive-Form Game with Imperfect Information ◮ Sequential Equilibrium Bayesian games ◮ Bayesian Nash Equilibrium ◮ Perfect Bayesian Equilibrium CS286r Fall’08 Bayesian Games 21

Bayesian Games Yiling Chen October 1, 2008 CS286r Fall08 Bayesian - PowerPoint PPT Presentation

Bayesian Games Yiling Chen October 1, 2008 CS286r Fall08 Bayesian Games 1 So far Up to this point, we have assumed that players know all relevant information about each other. Such games are known as games with complete information.

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite

S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S +

Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs

Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene,

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

LOGIC OF GAMES Andreas Blass University of Michigan Ann Arbor, MI 48109 ablass@umich.edu Games

Nash Dynamics and Potential Games Maria Serna Fall 2016 AGT-MIRI, FIB Potential Games Contents

CSC2556 Lecture 11 Noncooperative Games 2: Zero-Sum Games, Stackelberg Games CSC2556 - Nisarg

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression

CS 331: Bayesian Networks 2 1 Bayesian Networks Youve heard about how Bayesian networks

Games with Sequential Actions: (Finite) Extensive- Form Games Xinshuo Weng Outline What are

Digital Games An Introduction What are Digital Games? Commonly referred to as video games

Tom Nichols VP PC Games, North America Aeria Games & Entertainment Agenda Aeria Games?

August 7 7, , 2018 Firstly, be assured that we share the same goal Its about the students

LANXESS Q2 2020 results Tackling the crisis Matthias Zachert, CEO Michael Pontzen, CFO

Housing First in Finland Helsinki 8.2.2018 Juha Kaakinen, CEO 1 What we wanted to do and achieve

new w co comput putational ational asp spects ects Hoang Van-Long, Nguyen Dang-Hung

Latent Transition Analysis Exercises We are going to use Mplus to conduct some analyses in a

The Effects of LOCA Dose Estimation by Spray Droplet Surface Area Seung Chan LEE* Korea Hydro

CEM IBE Student Guidance and Performance Monitoring for the IB Diploma Key aims After this

Welcome to the World Handicap System Rules of Handicapping Education Seminar INSERT DATE Seminar

Bayesian Games Yiling Chen October 1, 2008 CS286r Fall08 Bayesian - PowerPoint PPT Presentation

Bayesian Games Yiling Chen October 1, 2008 CS286r Fall08 Bayesian Games 1 So far Up to this point, we have assumed that players know all relevant information about each other. Such games are known as games with complete information.

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite

S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S +

Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs

Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene,

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

LOGIC OF GAMES Andreas Blass University of Michigan Ann Arbor, MI 48109 ablass@umich.edu Games

Nash Dynamics and Potential Games Maria Serna Fall 2016 AGT-MIRI, FIB Potential Games Contents

CSC2556 Lecture 11 Noncooperative Games 2: Zero-Sum Games, Stackelberg Games CSC2556 - Nisarg

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression

CS 331: Bayesian Networks 2 1 Bayesian Networks Youve heard about how Bayesian networks

Games with Sequential Actions: (Finite) Extensive- Form Games Xinshuo Weng Outline What are

Digital Games An Introduction What are Digital Games? Commonly referred to as video games

Tom Nichols VP PC Games, North America Aeria Games &amp; Entertainment Agenda Aeria Games?

August 7 7, , 2018 Firstly, be assured that we share the same goal Its about the students

LANXESS Q2 2020 results Tackling the crisis Matthias Zachert, CEO Michael Pontzen, CFO

Housing First in Finland Helsinki 8.2.2018 Juha Kaakinen, CEO 1 What we wanted to do and achieve

new w co comput putational ational asp spects ects Hoang Van-Long, Nguyen Dang-Hung

Latent Transition Analysis Exercises We are going to use Mplus to conduct some analyses in a

The Effects of LOCA Dose Estimation by Spray Droplet Surface Area Seung Chan LEE* Korea Hydro

CEM IBE Student Guidance and Performance Monitoring for the IB Diploma Key aims After this

Welcome to the World Handicap System Rules of Handicapping Education Seminar INSERT DATE Seminar

Tom Nichols VP PC Games, North America Aeria Games & Entertainment Agenda Aeria Games?