CSCI 3210: Computational Game Theory www.mtirfan.com/CSCI-3210 - - PDF document

csci 3210 computational game theory mtirfan com csci 3210
SMART_READER_LITE
LIVE PREVIEW

CSCI 3210: Computational Game Theory www.mtirfan.com/CSCI-3210 - - PDF document

9/28/20 CSCI 3210: Computational Game Theory www.mtirfan.com/CSCI-3210 Mohammad T . Irfan Email: mirfan@bowdoin.edu Website: www.mtirfan.com 1 Syllabus and required background u Course website u www.mtirfan.com/CSCI-3210 10 1 9/28/20


slide-1
SLIDE 1

9/28/20 1

CSCI 3210: Computational Game Theory

Mohammad T . Irfan Email: mirfan@bowdoin.edu Website: www.mtirfan.com

www.mtirfan.com/CSCI-3210

1

Syllabus and required background

u Course website

u www.mtirfan.com/CSCI-3210

10

slide-2
SLIDE 2

9/28/20 2

You said you are here because:

u Be intellectually challenged in a class that is

  • utside of my comfort zone!

u The game theory concept! u new algorithms! u I’ve been fascinated by game theory in the

past, so I’m excited to take a look at it through a cs lens.

u Learning something new 😂 u The difference between human decisions and

rational thinking

u Don’t know a lot about game theory, but

most curious about application to human choice

11

Games and game theory: A brief introduction

Reading:

  • 1. Ch 6 of Easley-Kleinberg (pdf on class

website)

  • 2. [EGT] Ch 1-3

12

slide-3
SLIDE 3

9/28/20 3

Game Theory

u “Game”

u Ernst Zermelo (1913): In any chess game that

does not end in a draw, a player has a winning strategy u Mathematical theory of strategic

decision making

u John von Neumann (1944)

13

Example: Split or Steal

u https://www.youtube.com/watch?v=yM38mR

HY150

u Rules of the game u Outcome

14

slide-4
SLIDE 4

9/28/20 4

One possible model

u What will happen? Payoff matrix

Split Steal Split

$33K, $33K $0+fr., $66K

Steal

$66K, $0+fr. $0, $0

15

Payoff matrix

Split Steal Split

$33K, $33K $0+fr., $66K

Steal

$66K, $0+fr. $0, $0

Why did they end up with 0?

Lucy Tony

Nash Equilibrium

Everyone plays their best response to others simultaneously

John F . Nash Nobel Prize, 1994

16

slide-5
SLIDE 5

9/28/20 5

Nash equilibrium

Practical scenarios = Stable outcome = Nash equilibrium

17

Applications

u Application: market equilibria

u Predict where the market is heading to

u Mechanism design and auctions

u Google and Yahoo apply game-theoretic techniques

u Keyword search auction

u Spectrum allocation among wireless companies

18

18

slide-6
SLIDE 6

9/28/20 6

Applications

u Understanding the Internet

u Selfish routing is a constant-factor off from

  • ptimal

19

19

Other applications

u Load balancing and resource allocation u p2p and file sharing systems u Cryptography and security u Social and economic networks, etc.

20

slide-7
SLIDE 7

9/28/20 7

Next: Formal discussion

(Without math)

21

Game

u One-shot games (simultaneous move) u 3 components

u Players u Strategies/actions u Payoffs

Payoff matrix

Split Steal Split

$33K, $33K $0+fr., $66K

Steal

$66K, $0+fr. $0, $0

Tony Lucy

Call these pure strategies Pure-strategy NE

22

slide-8
SLIDE 8

9/28/20 8

Nash equilibrium (NE)

u A joint strategy (one strategy/player) s.t.

every player plays their best response to

  • thers simultaneously

u (Equiv.) A joint strategy s.t. no player gains

by deviating unilaterally

u Useful for checking whether a cell is NE

23

Famous example: prisoner's dilemma

u What will they do? Payoff matrix

Not Confess Confess Not Confess

  • 1, -1
  • 10, 0

Confess

0, -10

  • 5, -5

Suspect 1 Suspect 2

24

slide-9
SLIDE 9

9/28/20 9

Assumptions

u Payoffs reflect player’s preference u Payoffs are known to all u Actions are known to all (different players

could have different actions– but everyone knows everyone’s actions)

u Each player wants to maximize own payoff

subject to others' actions

25

Commonly Used Terms

26

slide-10
SLIDE 10

9/28/20 10

Best response

u Best strategy of a player, given the other

players’ strategies

u Always exists!

27

(Strictly/weakly) dominant strategy

u A strategy of a player that is

(strictly/weakly) better than any of their

  • ther strategies, no matter what the other

players do

u Does not always exist

28

slide-11
SLIDE 11

9/28/20 11

Pure-strategy Nash equilibrium (PSNE)

u Players do not use any probability in

choosing strategies as they do in "mixed- strategy" (to be covered later)

u Every player plays their best response to others

simultaneously

29

Checkpoint

u What is the difference between a dominant

strategy and a best response?

u What is the difference between best

response and PSNE?

u What is the difference between weakly and

strictly dominant strategies? Will a player always have one?

30

slide-12
SLIDE 12

9/28/20 12

Checkpoint: “Generalization”

u Watch the following clip from the movie A

Beautiful Mind that tries to portray John Nash’s discovery of Nash equilibrium.

u Is this actually a Nash equilibrium?

32 33

slide-13
SLIDE 13

9/28/20 13

Misconceptions

u Equilibrium signifies a tie/draw/balance u Equilibrium outcome is the best possible

  • utcome for all players (A Beautiful Mind)

u Self-interested players want to hurt each

  • ther

34

Questions

u Does NE always exist? (Answer later ...) u If it exists, is it unique?

35

slide-14
SLIDE 14

9/28/20 14

Games with multiple NE

  • 1. Battle of the sexes (Coordination)
  • 2. Hawk-dove game (anti-coordination)

36

Does NE always exist? Mixed-strategy NE

44

slide-15
SLIDE 15

9/28/20 15

Penalty kick game

45

45

Penalty kick game (continued)

46

46

slide-16
SLIDE 16

9/28/20 16

Penalty kick game (continued)

47

47

Penalty kick game- first model

u E[GK plays Left] = p(1) + (1-p)(-1) = 2p – 1 u E[GK plays Right] = p(-1) + (1-p)(1) = 1 – 2p u 2p – 1 = 1 – 2p è p = ½ u Similarly, q = ½

48

Left (q) Right (1- q) Left (p) -1, +1 +1, -1 Right (1- p)

+1, -1 -1, +1

Goalkeeper Shooter

Zero-sum Game

48

slide-17
SLIDE 17

9/28/20 17

Penalty kick game (real-world)

u “Professionals Play Minimax”- Ignacio

Palacios-Huerta

Left (0.42) Right (0.58) Left (0.38)

0.58, 0.42 0.95, 0.05

Right (0.62)

0.93, 0.07 0.70, 0.30

Shooter Goalkeeper

Equilibrium probabilities (computed by solving equations) match real-world probabilities from data! From real- world data

49

What does mixed strategy mean?

u Active randomization – tennis, soccer u Proportion interpretation – evolutionary

biology

u Probabilities of player 1 are the beliefs of

player 2 about what player 1 is doing (Bob Aumann)

u Misconception

u Players just choose probabilities

u Correct

u players play pure strategies chosen according to

these probabilities

50

slide-18
SLIDE 18

9/28/20 18

Von Neumann’s Theorem (1928)

u Every finite 2-person zero-sum game has a

mixed equilibrium John von Neumann (1903 – 1957)

51

51

Theorem of Nash (1950)

u Every finite game has an equilibrium in

mixed strategies

u Reading

John F . Nash (1928 – 2015) Nobel Prize, 1994

52

52

slide-19
SLIDE 19

9/28/20 19

Pure-strategy vs. mixed-strategy NE

Hawk-dove game

53

Key take-away messages

u Players act simultaneously, but NE outcome

is stable in the sense that there is no incentive for unilateral deviation.

u There is always at least one mixed-strategy

  • NE. A pure-strategy NE is not guaranteed.

u The concept of NE doesn't say how NE

happens.

u NE is not a balance or tie. It is often times a

socially-inefficient outcome.

59

slide-20
SLIDE 20

9/28/20 20

Formal (Mathematical) Definitions

61

Background: discrete math

u Set theory

u Sets u Representation: list, describe properties u Belongs to u Subset u Empty set u Power set u Operations: union, intersection, difference,

product

u Sum, product

62

slide-21
SLIDE 21

9/28/20 21

Normal form games

63

Example

u Idea: Connor Marrs u Assume: Cars don’t want to wait; all turns/S same One way St

L/R L/R/S

Photo modified from topdriver.com

Car1 Car2

64

slide-22
SLIDE 22

9/28/20 22

  • 1. Who are the players?

u N = {1, 2}

  • 2. What can each player do?

u A1 = {L, R, S}; A2 = {L, R} u Set of action profiles

A= A1 x A2 = {(L, L), (L, R), (R, L), (R, R), (S, L), (S, R)}

u Each element (e.g., (L,R)) is called an action profile

  • 3. What are the payoffs?

65

L R L 1, 1 0, 0 R 0, 0 1, 1 S 0, 0 1, 1

One way St

L/R L/R/S Car 1 Car 2

66

slide-23
SLIDE 23

9/28/20 23

Zero-sum/constant-sum game

u Is constant sum the same as zero sum?

action profile

67

Example (zero-sum game)

u You and opponent flip two coins u Same parity (both heads up or tails up)=> you

win (keep both coins)

u Otherwise, opponent wins

You Opponent

68

slide-24
SLIDE 24

9/28/20 24

Mixed-strategy NE

u Mixed strategy of a player and mixed-

strategy profile of all players

u Expected utility u Best response u Nash equilibrium

69

Mixed strategy

  • A mixed strategy is an element of Si

70

slide-25
SLIDE 25

9/28/20 25

Expected utility

71

Best response

72

slide-26
SLIDE 26

9/28/20 26

Nash equilibrium

73