simultaneous move games
play

Simultaneous-Move Games 4/21/17 Recall: Game Trees 1 3,0 0,3 - PowerPoint PPT Presentation

Simultaneous-Move Games 4/21/17 Recall: Game Trees 1 3,0 0,3 2,1 1,2 2 2 2 2 A R A R A R A R 3,-.5 2,.5 0,0 -.5,3 0,0 0,0 .5,2 0,0 Agents make decisions sequentially. Outcomes (leaves) have a utility for each


  1. Simultaneous-Move Games 4/21/17

  2. Recall: Game Trees 1 3,0 0,3 2,1 1,2 2 2 2 2 A R A R A R A R 3,-.5 2,.5 0,0 -.5,3 0,0 0,0 .5,2 0,0 • Agents make decisions sequentially. • Outcomes (leaves) have a utility for each agent. • Solve by backward induction. Good for modeling: • Classic board games (connect four, hex, etc.) • Discrete time interactions (resource sharing, deterrence)

  3. Simultaneous Moves What if agents make decisions at the same time? • Rock-paper-scissors 2 R P S R 0,0 -1,1 1,-1 1 P 1,-1 0,0 -1,1 S -1,1 1,-1 0,0 normal form game • payoff matrix •

  4. Navigation Example (sequential) A B

  5. Exercise: Construct a game tree. Available actions: Up, Right Up, Left Timing: short path: 30s 45s long path: 40s 60s conflict: adds 20 seconds to both

  6. Navigation Example (simultaneous) A B

  7. Exercise: Fill in the payoff matrix. Convention: Row player’s utility first. U L U R Bonus question: what if a conflict only costs 5s?

  8. Nash Equilibrium • An outcome where no agent can gain by unilateral deviation. U L U L U U -60,-80 -40,-45 -45,-65 -40,-45 R R -30,-60 -50,-65 -30,-60 -35,-50

  9. Identifying Nash Equilibria for each cell in the payoff matrix: for each player: for each deviation: if deviation > strategy: cell is not a NE if no beneficial deviations: cell is a NE D E F A 9,3 1,4 7,3 B 4,1 3,3 6,2 C -1,9 2,8 8,-1

  10. Dominated Strategies If strategy X is always better than strategy Y , then Y can be eliminated from the game. D E F Exercise: A 9,3 1,4 7,3 Iteratively eliminate dominated strategies until no more dominated B 4,1 3,3 6,2 strategies remain. C -1,9 2,8 8,-1 After one strategy is eliminated, others may be dominated in the game that remains.

  11. Pursuit/Evasion Example

  12. Exercise: Construct a payoff matrix. Available actions: Left, Right Left, Right Incentives: +5 catch:+1 +3 miss: −1 0

  13. Exercise: Identify all Nash equilibria. L R L 0,1 5,-1 R 3,-1 0,1 • There are no pure-strategy equilibria! • We need mixed strategies • Agents should deliberately randomize their actions

  14. More than two agents • Add more dimensions to the array. • Add more utilities to each outcome tuple. Player 1 u 1 ,u 2 ,u 3 Player 2 In a game with N agents that each have S actions, how many utilities does the payoff matrix contain?

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend