Mixed Strategies 4/24/17 Recall: Pursuit/Evasion Game - PowerPoint PPT Presentation

Mixed Strategies 4/24/17

Recall: Pursuit/Evasion Game

Pursuit/Evasion Payoff Matrix L R L 0,1 5,-1 R 3,-1 0,1 • None of the outcomes is a Nash equilibrium. Key idea: randomize your action so that it can’t be guessed.

Mixed Strategies Players can choose a probability distribution over their actions. For example, could go left with probability 0.4, and right with probability 0.6. 0.4 0.6 Mixed strategy: 〈 0.4, 0.6 〉

Responding to Mixed Strategies The best responses to a mixed strategy are the pure strategies with the highest expected value. 2 R P S R 0,0 -1,1 1,-1 1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0 Consider the strategy 〈 ½, ¼, ¼ 〉 in Rock-Paper-Scissors. • U 1 (R, 〈 ½, ¼, ¼ 〉 ) denotes P1’s expected value for playing R against P2’s mixed strategy 〈 ½, ¼, ¼ 〉 .

Expected Value in Mixed Strategies 2 R P S R 0,0 -1,1 1,-1 1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0 ✓ ⌧ 1 2 , 1 4 , 1 �◆ = 1 2 U 1 ( R, R ) + 1 4 U 1 ( R, P ) + 1 4 U 1 ( R, S ) U 1 R, 4 , = 1 2(0) + 1 4( − 1) + 1 4(1) = 0 Paper is the best ✓ ⌧ 1 2 , 1 4 , 1 �◆ = 1 2(1) + 1 4(0) + 1 4( − 1) = 1 response U 1 P, 4 , 4 ✓ ⌧ 1 �◆ 2 , 1 4 , 1 = 1 2( − 1) + 1 4(1) + 1 4(0) = − 1 U 1 S, 4 , 4

Mixed-Strategy Nash Equilibrium A Nash equilibrium is a mixed strategy for each player, where every player’s strategy is a best response to the others’ strategies. How can a mixed strategy be a best response? • Only possible if all of the actions with non-zero probability are best responses.

Rock-Paper-Scissors Nash Equilibrium 2 First verify that there are no R P S dominated strategies and no R 0,0 -1,1 1,-1 pure-strategy equilibria. 1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0 ✓ ⌧ 1 3 , 1 3 , 1 �◆ = 1 3(0) + 1 3( − 1) + 1 3(1) = 0 U 1 R, 3 ✓ ⌧ 1 3 , 1 3 , 1 �◆ = 1 3(1) + 1 3(0) + 1 3( − 1) = 0 U 1 P, 3 ✓ ⌧ 1 3 , 1 3 , 1 �◆ = 1 3( − 1) + 1 3(1) + 1 U 1 S, 3(0) = 0 3 R, P, and S are all best responses to 〈 ⅓, ⅓, ⅓ 〉 for P1.

Rock-Paper-Scissors Nash Equilibrium 2 R P S By essentially the same calculations, R, P, and S are all best responses to R 0,0 -1,1 1,-1 1 〈 ⅓, ⅓, ⅓ 〉 for P2. P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0 ✓⌧ 1 3 , 1 3 , 1 � ◆ = 1 3(0) + 1 3( − 1) + 1 3(1) = 0 U 2 , R 3 ✓⌧ 1 3 , 1 3 , 1 � ◆ = 1 3(1) + 1 3(0) + 1 3( − 1) = 0 U 2 , P 3 ✓⌧ 1 � ◆ 3 , 1 3 , 1 = 1 3( − 1) + 1 3(1) + 1 3(0) = 0 U 2 , S 3 Therefore, both players playing mixed strategy 〈 ⅓, ⅓, ⅓ 〉 is a Nash equilibrium.

A Tougher Example 2 R P S Suppose winning with R rocks! R 0,0 -1,1 2 ,-1 1 Should you play R more often, less P 1,-1 0,0 -1,1 often, or equally often than ⅓? S -1, 2 1,-1 0,0 Key insight: solve for the probabilities that make the other player(s) indifferent. P(R) = 4/12 P(P) = 5/12 P(S) = 3/12

Exercise: Find the Mixed-Strategy NE Step 1: find the probabilities can play to make indifferent between L and R. Step 2: find the probabilities can play to make indifferent between L and R. L R L 0,1 5,-1 R 3,-1 0,1

Mixed-Strategy Support The support of a mixed strategy is the set of actions that are played with non-zero probability. In all of the examples so far, all players have used full-support mixed strategies in equilibrium. Once we know the right support for every player, finding the probabilities requires solving a system of linear equations (linear programming). Finding the right supports is actually the hard part.

General Algorithm for Nash Equilibria eliminate dominated strategies search for pure strategy equilibria for each possible combination of supports: NE = find equilibrium with given supports for each player: Linear program BR = best response to NE if BR ∉ player’s support: NE is not an equilibrium There are exponentially many supports, so this algorithm takes exponential time. • It is an open problem whether a non-exponential algorithm exists.

Example: Hearthstone Meta-Game • Hearthstone is a collectable card game. • Players build a deck and then play against each other. These are the • The meta-game is the choice of which decks not in deck to play. the support. • A website called VS collects data on the win-rate of popular decks. • From those win-rates, a Nash equilibrium can be computed. This is the mixed-strategy Nash equilibrium

Exercise: construct and solve the game 1. Construct a payoff matrix that describes these agents’ incentives. 2. Find all Nash equilibria of the payoff matrix.

Mixed Strategies 4/24/17 Recall: Pursuit/Evasion Game - PowerPoint PPT Presentation

Mixed Strategies 4/24/17 Recall: Pursuit/Evasion Game Pursuit/Evasion Payoff Matrix L R L 0,1 5,-1 R 3,-1 0,1 None of the outcomes is a Nash equilibrium. Key idea: randomize your action so that it cant be guessed. Mixed

Mixed Strategies Krzysztof R. Apt CWI, Amsterdam, the Netherlands , University of Amsterdam

Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in

Mixed Precision Training PAI Overview What is mixed-precision

Mixed Methodological Analysis David F. Feldon Utah State University May 8, 2018 Mixed Methods

Regression 2: Mixed Models Marco Baroni Practical Statistics in R Outline Mixed models with

Mixing it up with random effects Joshua Loftus Mixed models Intro to mixed models What is a

EFFECTIVE USE OF MIXED PRECISION FOR HPC Kate Clark, Smoky Mountain Conference 2019 Why Mixed

MIXED PRECISION TRAINING OF DEEP NEURAL NETWORKS Carl Case, NVIDIA OUTLINE 1. What is mixed

MIXED PRECISION TRAINING Michael OConnor MIXED PRECISION What is the benefit? Using mixed

ECO 199 B GAMES OF STRATEGY Spring Term 2004 B February 26 MIXED STRATEGIES B ZERO-SUM GAMES MIXED

Finding Optimal Mixed Finding Optimal Mixed Strategies to Commit to in g Security Games

A Framework for Teaching Mixed classes Warm up How many of you teach mixed classes? What

Interchange Level 2 Presentation Plus (Mixed media Interchange Level 2 Presentation Plus (Mixed

Interchange Level 2 Presentation Plus (Mixed media Interchange Level 2 Presentation Plus (Mixed

CHEVREUL Simultaneous Contrast Successive Contrast Successive Contrast Mixed Contrast look

Town of East Fishkill Proposed Mixed-Use Zoning October 25, 2018 What is Mixed-Use Zoning?

Lessons from Fukushima August 7, 2012 David Lochbaum Director, Nuclear Safety Project Union of

W HAT IS P LAIN E NGLISH ? According to the Palin English Campaign in Britain, it is the

Reading engines for Visual Narratives by Laurent Le Meur / EDRLab 18 September 2018 EDRLab

Jackstraws : Picking Command and Control Connections from Bot Traffic egoire Jacob 1 , Ralf Hund 2

CMU 15-896 Noncooperative games 4: Stackelberg games Teacher: Ariel Procaccia A curious game

CSC304 Lecture 3 Game Theory (More examples, Computation of Mixed Nash Equilibria, Indifference

ECE700.07: Game Theory with Engineering Applications Le Lecture 3: Ga Games in Normal Form

Part II: Strategic Interaction Introduction of competition Three instruments to compete in

Sambuz

Useful Links

Newsletter

Mail Us