Nash Equilibrium in Tullock Contests Aidas Masiliunas 1 1 - - PowerPoint PPT Presentation

nash equilibrium in tullock contests
SMART_READER_LITE
LIVE PREVIEW

Nash Equilibrium in Tullock Contests Aidas Masiliunas 1 1 - - PowerPoint PPT Presentation

Nash equilibrium Non-standard preferences Experimental design Results Other projects Nash Equilibrium in Tullock Contests Aidas Masiliunas 1 1 Aix-Marseille School of Economics Controversies in Game Theory III, ETH Zurich 2 June, 2016 Nash


slide-1
SLIDE 1

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Nash Equilibrium in Tullock Contests

Aidas Masiliunas1

1Aix-Marseille School of Economics

Controversies in Game Theory III, ETH Zurich 2 June, 2016

slide-2
SLIDE 2

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Rent-seeking (Tullock) contest

Two players compete for a prize (16 ECU) by making costly investments (x1, x2 ≤ 16) Higher investments increase the probability to win the prize Probability that player i receives the prize:

xi xi+xj

slide-3
SLIDE 3

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Rent-seeking (Tullock) contest

Two players compete for a prize (16 ECU) by making costly investments (x1, x2 ≤ 16) Higher investments increase the probability to win the prize Probability that player i receives the prize:

xi xi+xj

Applications:

Competition for monopoly rents Investments in R&D Competition for a promotion/bonus Political contests

slide-4
SLIDE 4

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Theory

E(π) =

xi xi+xj · 16 + 16 − xi

BRi(xj) : x∗

i = 16xj − xj

RNNE : x∗

i = 4, dominance solvable in three steps.

5 10 15 5 10 15

Standard preferences

Other plays Best Response 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 1 2 3 4 5 6 7 8 9 10 12 14 16

slide-5
SLIDE 5

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Explanatory power of Nash equilibrium in experiments

7.04% of choices are exactly Nash 60.19% of choices are strictly dominated Investments are spread across the whole strategy space Experience does not help Less stability compared to auctions

slide-6
SLIDE 6

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Comparative statics of Nash equilibrium

An alternative to point predictions is comparative statics Is behaviour sensitive to changes in the Nash prediction?

slide-7
SLIDE 7

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Comparative statics of Nash equilibrium

An alternative to point predictions is comparative statics Is behaviour sensitive to changes in the Nash prediction? Players Nash Mean investment 2 250 325 3 222 283 4 188 302 5 160 322 9 99 326

Source: Lim, Matros & Turocy, 2014

slide-8
SLIDE 8

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Why should players choose Nash equilibrium?

Interpretation #1: Nash equilibrium is the unique action profile that can be justified by common knowledge of rationality. Rationality = maximization of expected payoff given some belief.

slide-9
SLIDE 9

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Rationalizable strategies

xi 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 BR(xi) 3 4 4 4 4 4 4 3 3 3 2 2 1 1 1 1

slide-10
SLIDE 10

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Rationalizable strategies

xi 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 BR(xi) 3 4 4 4 4 4 4 3 3 3 2 2 1 1 1 1

Rationality

Rationalizable: 3, 4, 2, 1

slide-11
SLIDE 11

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Rationalizable strategies

xi 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 BR(xi) 3 4 4 4 4 4 4 3 3 3 2 2 1 1 1 1 BR(BR(xi)) 4 4 4 4 4 4 4 4 4 4 4 4 3 3 3 3

Rationality

Rationalizable: 3, 4, 2, 1

Rationality + belief that the opponent is rational

Rationalizable: 3, 4

slide-12
SLIDE 12

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Rationalizable strategies

xi 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 BR(xi) 3 4 4 4 4 4 4 3 3 3 2 2 1 1 1 1 BR(BR(xi)) 4 4 4 4 4 4 4 4 4 4 4 4 3 3 3 3 BR(BR(BR(xi))) 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4

Rationality

Rationalizable: 3, 4, 2, 1

Rationality + belief that the opponent is rational

Rationalizable: 3, 4

Rationality + belief that the opponent is rational + belief that the opponent believes in my rationality

Rationalizable: 4

slide-13
SLIDE 13

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Rationalizable strategies

xi 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 BR(xi) 3 4 4 4 4 4 4 3 3 3 2 2 1 1 1 1 BR(BR(xi)) 4 4 4 4 4 4 4 4 4 4 4 4 3 3 3 3 BR(BR(BR(xi))) 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4

Rationality

Rationalizable: 3, 4, 2, 1

Rationality + belief that the opponent is rational

Rationalizable: 3, 4

Rationality + belief that the opponent is rational + belief that the opponent believes in my rationality

Rationalizable: 4

Epistemic definition of Nash equilibrium: common belief in rationality + simple belief hierarchy

slide-14
SLIDE 14

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Why should players choose Nash equilibrium?

Nash equilibrium is the unique action profile that cannot be ruled out by common knowledge of rationality.

1

Players care about expected payoffs

2

Players have the ability to calculate expected payoffs and identify dominated strategies

3

Players believe that other players satisfy 1-2, and believe that they believe that they satisfy 1-2...

slide-15
SLIDE 15

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Why should players choose Nash equilibrium?

Nash equilibrium is the unique action profile that cannot be ruled out by common knowledge of rationality.

1

Players care about expected payoffs

2

Players have the ability to calculate expected payoffs and identify dominated strategies

3

Players believe that other players satisfy 1-2, and believe that they believe that they satisfy 1-2...

Nash equilibrium is the rest point of various learning dynamics

Belief-based learning, e.g. Cournot best-response, fictitious play

Assumption 3 is not necessary

slide-16
SLIDE 16

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Why should players choose Nash equilibrium?

Nash equilibrium is the unique action profile that cannot be ruled out by common knowledge of rationality.

1

Players care about expected payoffs

2

Players have the ability to calculate expected payoffs and identify dominated strategies

3

Players believe that other players satisfy 1-2, and believe that they believe that they satisfy 1-2...

Nash equilibrium is the rest point of various learning dynamics

Belief-based learning, e.g. Cournot best-response, fictitious play

Assumption 3 is not necessary

Payoff-based learning, e.g. reinforcement learning

Players must be willing to explore, remember past payoffs, receive accurate feedback.

slide-17
SLIDE 17

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Which assumptions are violated?

slide-18
SLIDE 18

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Preference-based explanations: joy of winning

Participants receive non-monetary utility from winning (Parco et al, 2005, Sheremeta, 2011) or lose utility after losing (Delgado et al., 2008). Sheremeta (2011) elicits joy of winning by implementing a contest where prize has no value.

5 10 15 5 10 15

Joy of winning with w=3

Other plays Best Response 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 1 2 3 4 5 6 7 8 9 10 12 14 16 5 10 15 5 10 15

Joy of winning with w = 8

Other plays Best Response 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 1 2 3 4 5 6 7 8 9 10 12 14 16

slide-19
SLIDE 19

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Preference-based explanations: risk preferences

CRRA untility function: u(πi) = π1−ρ

i

1−ρ

Risk aversion if ρ = 0.5, risk seeking if ρ = −0.5

5 10 15 5 10 15

Risk aversion

Other plays Best Response 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 1 2 3 4 5 6 7 8 9 10 12 14 16 5 10 15 5 10 15

Risk seeking

Other plays Best Response 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 1 2 3 4 5 6 7 8 9 10 12 14 16

slide-20
SLIDE 20

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Preference-based explanations: social preferences

Fehr & Schmidt (1999) inequality aversion: u(πi, πj) = πi − α(πj − πi) if πi ≤ πj πi − β(πi − πj) if πi > πj

5 10 15 5 10 15

Fehr and Schmidt (1999) inequality aversion

Other plays Best Response 1 2 3 4 5 6 7 8 9 10 12 14 16 1 2 3 4 5 6 7 8 9 11 13 15 a=0, b=0 a=0.5, b=0 a=1, b=0

slide-21
SLIDE 21

Nash equilibrium Non-standard preferences Experimental design Results Other projects

All preferences from Sheremeta (2015)

slide-22
SLIDE 22

Nash equilibrium Non-standard preferences Experimental design Results Other projects

”Behavioral Variation in Tullock Contests”, joint with F. Mengel and Ph. Reiss

Deviations from NE could be a result of bounded rationality Players optimize given the feedback in previous rounds. Noisy feedback prevents players from discovering optimal actions

slide-23
SLIDE 23

Nash equilibrium Non-standard preferences Experimental design Results Other projects

”Behavioral Variation in Tullock Contests”, joint with F. Mengel and Ph. Reiss

Deviations from NE could be a result of bounded rationality Players optimize given the feedback in previous rounds. Noisy feedback prevents players from discovering optimal actions Research questions:

Can we identify whether deviations from NE are a result of bounded rationality or of preferences? Is behavioral variability lower and choices closer to theoretical predictions when feedback is more informative?

slide-24
SLIDE 24

Nash equilibrium Non-standard preferences Experimental design Results Other projects

How informative is the feedback that players observe?

Reinforcement learning converges to NE as t → ∞ In experiments players rely on small samples of experience

slide-25
SLIDE 25

Nash equilibrium Non-standard preferences Experimental design Results Other projects

How informative is the feedback that players observe?

Reinforcement learning converges to NE as t → ∞ In experiments players rely on small samples of experience Suppose that players always choose the action that yielded highest average payoff in the past.

slide-26
SLIDE 26

Nash equilibrium Non-standard preferences Experimental design Results Other projects

How informative is the feedback that players observe?

Reinforcement learning converges to NE as t → ∞ In experiments players rely on small samples of experience Suppose that players always choose the action that yielded highest average payoff in the past.

slide-27
SLIDE 27

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Feedback depends on other’s choices and lottery outcomes

slide-28
SLIDE 28

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Treatment 1: eliminate lottery allocation

slide-29
SLIDE 29

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Treatment 2: eliminate variability of opponent’s choices

slide-30
SLIDE 30

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Treatment 3: eliminate both

slide-31
SLIDE 31

Nash equilibrium Non-standard preferences Experimental design Results Other projects

How easy is it to learn in different treatments?

Estimate the likelihood that action 4 will yield a higher average payoff than action 6.

Π(4) > Π(6)

Memory length % of iterations 10 20 30 40 50 25 50 75 100

  • Shared prize, fixed actions

Shared prize, changing actions Lottery, fixed actions Lottery, changing actions

slide-32
SLIDE 32

Nash equilibrium Non-standard preferences Experimental design Results Other projects

How easy is it to learn in different treatments?

Estimate the likelihood that action 4 will yield a higher average payoff than action 6.

Π(4) > Π(6)

Memory length % of iterations 10 20 30 40 50 25 50 75 100

  • Shared prize, fixed actions

Shared prize, changing actions Lottery, fixed actions Lottery, changing actions

slide-33
SLIDE 33

Nash equilibrium Non-standard preferences Experimental design Results Other projects

How easy is it to learn in different treatments?

Estimate the likelihood that action 4 will yield a higher average payoff than action 6.

Π(4) > Π(6)

Memory length % of iterations 10 20 30 40 50 25 50 75 100

  • Shared prize, fixed actions

Shared prize, changing actions Lottery, fixed actions Lottery, changing actions

slide-34
SLIDE 34

Nash equilibrium Non-standard preferences Experimental design Results Other projects

How easy is it to learn in different treatments?

Estimate the likelihood that action 4 will yield a higher average payoff than action 6.

Π(4) > Π(6)

Memory length % of iterations 10 20 30 40 50 25 50 75 100

  • Shared prize, fixed actions

Shared prize, changing actions Lottery, fixed actions Lottery, changing actions

slide-35
SLIDE 35

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Procedure

40 rounds, divided into 4 blocks of 10 rounds Each block divided into experimentation phase (rounds 1-5) and incentivized phase (rounds 6-10)

1 5 10 6 11 15 16 20 21 26 30 25 35 31 36 40

Non-incentivized Non-incentivized Non-incentivized Non-incentivized Incentivized Incentivized Incentivized Incentivized

Block 4 Block 3 Block 2 Block 1

One round from each block randomly chosen for payment Incentivized numeracy test at the end of the experiment Average earnings 15.15 euro, duration 60 minutes

slide-36
SLIDE 36

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Explanatory power of Nash equilibrium

Changing actions Fixed actions Lottery EV Lottery EV P(x = NE) 7.04% 13.33%

  • P(x = BR)
  • 22.50%

65.23% P(|x − NE| ≤ 1) 25.74% 32.78%

  • P(|x − BR| ≤ 1)
  • 47.95%

83.64% P(x > 4) 60.19% 62.78% 51.36% 16.14%

Absolute value of deviation from equilibrium significantly different between EV/Fixed treatment and the other three treatments, but not in

  • ther comparisons.
slide-37
SLIDE 37

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Behavioral variation

Is the distribution of choices more concentrated? (not necessarily around NE) Entropy measures the stochastic variation of a random variable (0 = one strategy always chosen, 4 = all strategies chosen with equal frequency): H = −

  • i=1...16

pi log(pi)

slide-38
SLIDE 38

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Behavioral variation

Is the distribution of choices more concentrated? (not necessarily around NE) Entropy measures the stochastic variation of a random variable (0 = one strategy always chosen, 4 = all strategies chosen with equal frequency): H = −

  • i=1...16

pi log(pi)

Changing actions Fixed actions Lottery EV Lottery EV Entropy 3.22 2.79 2.45 1.50

  • Std. Dev.

3.28 2.56 3.15 1.16

slide-39
SLIDE 39

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Best-response curves in Fixed treatments

slide-40
SLIDE 40

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Stability of choices and convergence

slide-41
SLIDE 41

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Stability of choices and convergence

Changing strategies between rounds in experimentation and incentivized rounds.

slide-42
SLIDE 42

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Replacing humans by computers

Playing against a computer player is different than playing against a human player: no social preferences, lower joy of winning (?)

slide-43
SLIDE 43

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Replacing humans by computers

Playing against a computer player is different than playing against a human player: no social preferences, lower joy of winning (?) Additional treatment replacing computers by human players. All effects replicate if Fixed/EV treatment is replaced by this treatment.

Changing actions Fixed actions Lottery EV Lottery EV EV-Human P(x = NE) 7.04% 13.33%

  • P(x = BR)
  • 22.50%

65.23% 50.42% P(|x − NE| ≤ 1) 25.74% 32.78%

  • P(|x − BR| ≤ 1)
  • 47.95%

83.64% 74.58% P(x > 4) 60.19% 62.78% 51.36% 16.14% 23.33% Entropy 3.22 2.79 2.45 1.50 1.13

  • Std. Dev.

3.28 2.56 3.15 1.16 0.91

slide-44
SLIDE 44

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Strategic uncertainty vs stability

Matching players to computers has two effects:

The action of the other party is stable over time, hence it is easier to learn. Players face no strategic uncertainty, hence it is easier to

  • ptimize
slide-45
SLIDE 45

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Strategic uncertainty vs stability

Matching players to computers has two effects:

The action of the other party is stable over time, hence it is easier to learn. Players face no strategic uncertainty, hence it is easier to

  • ptimize

Is stability of choices necessary in addition to the removal of strategic uncertainty? Design: computer plays actions from the baseline contest, players know these actions.

slide-46
SLIDE 46

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Strategic uncertainty vs stability

Matching players to computers has two effects:

The action of the other party is stable over time, hence it is easier to learn. Players face no strategic uncertainty, hence it is easier to

  • ptimize

Is stability of choices necessary in addition to the removal of strategic uncertainty? Design: computer plays actions from the baseline contest, players know these actions.

Changing actions Changing but known Fixed actions Lottery EV Lottery EV Lottery EV P(a = NE) 7.04% 13.33%

  • P(a = BR)
  • 7.59%

25.37% 22.50% 65.23% P(|a − NE| ≤ 1) 25.74% 32.78%

  • P(|a − BR| ≤ 1)
  • 25.00%

51.85% 47.95% 83.64% P(a > 4) 60.19% 62.78% 62.96% 47.04% 51.36% 16.14%

slide-47
SLIDE 47

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Strategic uncertainty vs stability

slide-48
SLIDE 48

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Contests with forgone payoff information

Conclusion from the first paper: when feedback is more informative about the quality of actions, players make better choices. Can we improve the quality of feedback without changing the nature of the game?

slide-49
SLIDE 49

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Contests with forgone payoff information

Conclusion from the first paper: when feedback is more informative about the quality of actions, players make better choices. Can we improve the quality of feedback without changing the nature of the game? Hypothesis: more information and higher quality of information increases the rate of learning Design: 10 rounds of standard contest, 20 rounds of contest with foregone payoff information, 10 rounds of standard contest

slide-50
SLIDE 50

Nash equilibrium Non-standard preferences Experimental design Results Other projects

”Contests with foregone payoff information”

slide-51
SLIDE 51

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Hypotheses: reinforcement learning simulation

Π(2) > Π(4)

Memory length % of iterations 10 20 30 40 50 25 50 75 100

  • Same actions, same random numbers

Different actions, same random numbers Same actions, different random numbers Different actions, different random numbers

slide-52
SLIDE 52

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Results: average investments

slide-53
SLIDE 53

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Results: dominated strategies

slide-54
SLIDE 54

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Payoff based learning, joint with H. Nax

Calculating expected values is very complicated Convergence is much higher when players can use a payoff table/calculator and with neutral framing

200 400 600 800 invest 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

slide-55
SLIDE 55

Nash equilibrium Non-standard preferences Experimental design Results Other projects

Summary

Nash equilibrium has a very low explanatory power in Tullock contests Explanatory power is much higher when actions have direct payoff consequences Providing additional feedback about foregone payoff information does not improve the explanatory power Paying the expected payoffs does not improve learning, unless players know these payoffs.