Previously in Game Theory Previously in Game Theory decision makers: - PowerPoint PPT Presentation

Previously in Game Theory

Previously in Game Theory ◮ decision makers: ◮ choices ◮ preferences

Previously in Game Theory ◮ decision makers: ◮ choices ◮ preferences ◮ solution concepts: ◮ best response ◮ Nash equilibrium

Rock, paper, scissors

Rock, paper, scissors R P S R 0 , 0 − 1 , 1 1 , − 1 1 , − 1 0 , 0 − 1 , 1 P S − 1 , 1 1 , − 1 0 , 0

Learning in games

Learning in games Repeated games

Learning in games

Best Response learning

Best Response learning 1. Guess what the opponent(s) will play

Best Response learning 1. Guess what the opponent(s) will play 2. Play a Best Response to that guess

Best Response learning 1. Guess what the opponent(s) will play 2. Play a Best Response to that guess 3. Observe the play

Best Response learning 1. Guess what the opponent(s) will play 2. Play a Best Response to that guess 3. Observe the play 4. Update the guess

BR learning: Cournot dynamics

BR learning: Cournot dynamics Guess = last action played

BR learning: Cournot dynamics Guess = last action played C D C 2 , 2 − 1 , 3 3 , − 1 0 , 0 D

BR learning: Cournot dynamics Guess = last action played C D C 2 , 2 − 1 , 3 3 , − 1 0 , 0 D R P S 0 , 0 − 1 , 1 1 , − 1 R P 1 , − 1 0 , 0 − 1 , 1 − 1 , 1 1 , − 1 0 , 0 S

BR learning: Fictitious play

BR learning: Fictitious play Guess = empirical distribution of play

BR learning: Fictitious play Guess = empirical distribution of play R P S 0 , 0 − 1 , 1 1 , − 1 R P 1 , − 1 0 , 0 − 1 , 1 − 1 , 1 1 , − 1 0 , 0 S

BR learning: Fictitious play Guess = empirical distribution of play R P S 0 , 0 − 1 , 1 1 , − 1 R P 1 , − 1 0 , 0 − 1 , 1 − 1 , 1 1 , − 1 0 , 0 S L C R 0 , 0 0 , 1 1 , 0 U M 1 , 0 0 , 0 0 , 1 0 , 1 1 , 0 0 , 0 D

Evolutionary learning

Evolutionary learning Action set: A Utility function: u

Evolutionary learning Action set: A Utility function: u p ∈ ∆( A ) , k ∈ A p k = p k ( u ( k, p ) − u ( p, p )) ˙

Battle of the Sexes

Battle of the Sexes O F 3 , 2 0 , 0 O 0 , 0 2 , 3 F

Correlated equilibrium (CE)

Correlated equilibrium (CE) a ∗ ∈ A = � i A i is a NE: ∀ i, ∀ a ′ i , u i ( a ∗ i , a ∗ − i ) ≥ u i ( a ′ i , a ∗ − i )

Correlated equilibrium (CE) a ∗ ∈ A = � i A i is a NE: ∀ i, ∀ a ′ i , u i ( a ∗ i , a ∗ − i ) ≥ u i ( a ′ i , a ∗ − i ) i ∆( A i ) is a NE: ∀ i, ∀ a i , ∀ a ′ α ∈ � i , � � u i ( a ′ u i ( a i , a − i ) α ( a ) ≥ i , a − i ) α ( a ) a − i a − i

Correlated equilibrium (CE) a ∗ ∈ A = � i A i is a NE: ∀ i, ∀ a ′ i , u i ( a ∗ i , a ∗ − i ) ≥ u i ( a ′ i , a ∗ − i ) i ∆( A i ) is a NE: ∀ i, ∀ a i , ∀ a ′ α ∈ � i , � � u i ( a ′ u i ( a i , a − i ) α ( a ) ≥ i , a − i ) α ( a ) a − i a − i π ∈ ∆( A ) is a CE: ∀ i, ∀ a i , ∀ a ′ i , � � u i ( a ′ u i ( a i , a − i ) π ( a ) ≥ i , a − i ) π ( a ) a − i a − i

No regret learning

No regret learning u i ( k, a − i ) − u i ( j, a − i )

No regret learning u i ( k, a − i ) − u i ( j, a − i ) t � R i jk ( t ) = u i ( k, a − i ( τ )) − u i ( j, a − i ( τ )) τ =0: a i ( τ )= j

No regret learning u i ( k, a − i ) − u i ( j, a − i ) t � R i jk ( t ) = u i ( k, a − i ( τ )) − u i ( j, a − i ( τ )) τ =0: a i ( τ )= j Regret matching converges to the correlated equilibria set.

Learning in games

Learning in games ◮ Best response

Learning in games ◮ Best response ◮ Replicator dynamics

Learning in games ◮ Best response ◮ Replicator dynamics ◮ No regret

Repeated games

Markov Decision Process (MDP)

Markov Decision Process (MDP) state space X action space U transition P : X × U → ∆( X ) reward r : X × U → R discount factor δ ∈ [0 , 1]

Markov Decision Process (MDP) state space X action space U transition P : X × U → ∆( X ) reward r : X × U → R discount factor δ ∈ [0 , 1] + ∞ � δ t r ( x ( t ) , u ( t )) U ( x ( · ) , u ( · )) = t =0

MDP (continued) history H ∈ � ( X, U ) policy π : H → ∆( U )

MDP (continued) history H ∈ � ( X, U ) policy π : H → ∆( U ) V π ( x 0 ) = E π [ U ( x ( · ) , u ( · ))]

MDP (continued) history H ∈ � ( X, U ) policy π : H → ∆( U ) V π ( x 0 ) = E π [ U ( x ( · ) , u ( · ))] V π ( x 0 ) V ( x 0 ) = max π

Principle of Optimality Bellman’s equation: V ( x 0 ) = max u 0 [ r ( x 0 , u 0 ) + δV ( P ( x 0 , u 0 ))]

Dynamic Programming Solving the MDP:

Dynamic Programming Solving the MDP: ◮ knowing P : value iteration

Dynamic Programming Solving the MDP: ◮ knowing P : value iteration ◮ not knowing P : online learning

Repeated game

Repeated game Game ( I , � i A i , � i u i )

Repeated game Game ( I , � i A i , � i u i ) Discount factor δ + ∞ � δ t u i ( a ( t )) U i ( a ( · )) = t =0

Repeated game Game ( I , � i A i , � i u i ) Discount factor δ + ∞ � δ t u i ( a ( t )) U i ( a ( · )) = t =0 Strategy σ : H → � i ∆( A i x ) V i ( σ ) = E σ [ U i ( a ( · ))]

Nash equilibrium Player i : ◮ choices σ i ◮ utility V i

Nash equilibrium Player i : ◮ choices σ i ◮ utility V i Nash equilibrium is not strong enough! (Explanation on the whiteboard ⇒ )

Information structure

Information structure ◮ perfect ◮ imperfect

Information structure ◮ perfect ◮ imperfect ◮ public ◮ private (beliefs)

Folk theorem Any feasible, strictly individually rational payoff can be sustained by a sequentially rational equilibrium.

Folk theorem Any feasible, strictly individually rational payoff can be sustained by a sequentially rational equilibrium. Holy grail for repeated games.

u 2 u 1

u 2 u 1 DC

u 2 CD CC u 1 DD DC

Research

Weakly belief-free equilibria Characterization of repeated games with correlated equilibria.

Repeated games

Repeated games ◮ Dynamic programming

Repeated games ◮ Dynamic programming ◮ Repeated games

Repeated games ◮ Dynamic programming ◮ Repeated games ◮ Folk theorem

Learning in games

Learning in games Repeated games

Questions, Comments

Previously in Game Theory Previously in Game Theory decision makers: - PowerPoint PPT Presentation

Previously in Game Theory Previously in Game Theory decision makers: choices preferences Previously in Game Theory decision makers: choices preferences solution concepts: best response Nash equilibrium Rock,

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure

Game Theory and Nuclear Weapons Game Theory and Nuclear Weapons Game Theory and Nuclear Warfare

Game theory (Ch. 17.5) Announcements Midterm Thursday Game theory Typically game theory uses a

Game Theory: Definition and Assumptions Game Theory and Strategy Game theory studies strategic

Introduction to game theory Introduction to game theory Jie Gao Computer Science Department

Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language and Computation University

Introduction to Game Theory (1) Mehdi Dastani BBL-521 M.M.Dastani@uu.nl Game Theory What is

Coalitional Game Theory Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC

Game theory (Ch. 17.5) Game theory Typically game theory uses a payoff matrix to represent the

Lecture 7: Game theory David Aldous February 24, 2016 STAT 155 is an entire course on Game

Game Loops CIS 580 - Fundamentals of Game Programming Hangman Game Phases Game Loop

VIDEOGAMES ARE A MESS Ian Bogost WHAT IS A GAME? Is a game a system of rules, or is a game a

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Connect your device to application GAME ENGINE ON ANDROID Julian Chu Agenda We Love Game Why

Levels of Analysis in International Relations J2P216 SE: International Cooperation and Conflict

This Segment: Computational game theory Lecture 1: Game representations, solution concepts and

An Approximate Subgame-Perfect Equilibrium Computation Technique for Repeated Games Andriy Burkov

Internal Implementation Ashton Anderson, Yoav Shoham, Alon Altman Stanford University May 2010

Models of Language Evolution Session 03 : Evolutionary Game Theory: Games & Stable Outcomes

Introduction to Mechanism Design Thodoris Lykouris National Technical University of Athens May

Alpha- -beta pruning beta pruning Example Alpha Example reduce the branching factor of

Dynamics in Near-Potential Games Ozan Candogan, Asu Ozdaglar, and Pablo Parrilo Laboratory for

Previously in Game Theory Previously in Game Theory decision makers: - PowerPoint PPT Presentation

Previously in Game Theory Previously in Game Theory decision makers: choices preferences Previously in Game Theory decision makers: choices preferences solution concepts: best response Nash equilibrium Rock,

e-Bug Junior Game Junior Game Game Style Game Process Demo Game Mechanics and

e-Bug Senior Game Senior Game Game Style Game Process Demo Game Puzzles and

Game interoperability with functors functor AgsFun (structure Game : GAME) :&gt; sig structure

Game Theory and Nuclear Weapons Game Theory and Nuclear Weapons Game Theory and Nuclear Warfare

Game theory (Ch. 17.5) Announcements Midterm Thursday Game theory Typically game theory uses a

Game Theory: Definition and Assumptions Game Theory and Strategy Game theory studies strategic

Introduction to game theory Introduction to game theory Jie Gao Computer Science Department

Game Theory: Spring 2020 Ulle Endriss Institute for Logic, Language and Computation University

Introduction to Game Theory (1) Mehdi Dastani BBL-521 M.M.Dastani@uu.nl Game Theory What is

Coalitional Game Theory Game Theory MohammadAmin Fazli Algorithmic Game Theory 1 TOC

Game theory (Ch. 17.5) Game theory Typically game theory uses a payoff matrix to represent the

Lecture 7: Game theory David Aldous February 24, 2016 STAT 155 is an entire course on Game

Game Loops CIS 580 - Fundamentals of Game Programming Hangman Game Phases Game Loop

VIDEOGAMES ARE A MESS Ian Bogost WHAT IS A GAME? Is a game a system of rules, or is a game a

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Connect your device to application GAME ENGINE ON ANDROID Julian Chu Agenda We Love Game Why

Levels of Analysis in International Relations J2P216 SE: International Cooperation and Conflict

This Segment: Computational game theory Lecture 1: Game representations, solution concepts and

An Approximate Subgame-Perfect Equilibrium Computation Technique for Repeated Games Andriy Burkov

Internal Implementation Ashton Anderson, Yoav Shoham, Alon Altman Stanford University May 2010

Models of Language Evolution Session 03 : Evolutionary Game Theory: Games &amp; Stable Outcomes

Introduction to Mechanism Design Thodoris Lykouris National Technical University of Athens May

Alpha- -beta pruning beta pruning Example Alpha Example reduce the branching factor of

Dynamics in Near-Potential Games Ozan Candogan, Asu Ozdaglar, and Pablo Parrilo Laboratory for

Game interoperability with functors functor AgsFun (structure Game : GAME) :> sig structure

Models of Language Evolution Session 03 : Evolutionary Game Theory: Games & Stable Outcomes