CSC304 Lecture 6 Game Theory : Security games, Applications to security
CSC304 - Nisarg Shah 1
CSC304 Lecture 6 Game Theory : Security games, Applications to - - PowerPoint PPT Presentation
CSC304 Lecture 6 Game Theory : Security games, Applications to security CSC304 - Nisarg Shah 1 Recap Last lecture Zero-sum games The minimax theorem Assignment 1 posted Might add one or two questions (more if you think
CSC304 - Nisarg Shah 1
CSC304 - Nisarg Shah 2
➢ Zero-sum games ➢ The minimax theorem
➢ Might add one or two questions (more if you think it’s a
piece of cake)
➢ Kept my promise (approximately) ➢ Due: October 11 by 3pm
CSC304 - Nisarg Shah 3
CSC304 - Nisarg Shah 4
➢ Cannot later backtrack
➢ Follower must believe leader’s commitment is credible
➢ Can assume to be a pure strategy
CSC304 - Nisarg Shah 5
➢ Can’t I, as player 1, commit to playing 𝑦1 in a
➢ Player 2 wouldn’t believe you. I’ll play 𝑦1. No you won’t. I’m playing 𝑦2; 𝑦1 is not a best response. Doesn’t
committing. Yeah right.
CSC304 - Nisarg Shah 6
CSC304 - Nisarg Shah 7
➢ Can also represent “information sets”, multiple moves, …
Player 1 Player 2 Player 2 (1,1) (3,0) (0,0) (2,1)
CSC304 - Nisarg Shah 8
➢ Continuous spectrum of possible actions
0.5 Up, 0.5 Down Player 1 Player 2 Player 2 (1,1) (3,0) (0,0) (2,1)
Player 2
CSC304 - Nisarg Shah 9
P1 P2 Left Right Up (1 , 1) (3 , 0) Down (0 , 0) (2 , 1)
CSC304 - Nisarg Shah 10
P1 P2 Left Right Up (1 , 1) (3 , 0) Down (0 , 0) (2 , 1)
CSC304 - Nisarg Shah 11
P1 P2 Left Right Up (1 , 1) (3 , 0) Down (0 , 0) (2 , 1)
CSC304 - Nisarg Shah 12
➢ If P1 commits to playing Up and Down with probabilities
0.49 and 0.51, respectively…
➢ P2 is still better off playing Right than Left, in expectation ➢ 𝔽[Reward] for P1 increases to ~2.5
P1 P2 Left Right Up (1 , 1) (3 , 0) Down (0 , 0) (2 , 1)
CSC304 - Nisarg Shah 13
➢ In Stackelberg, he must commit in advance, while in
➢ A: No. The optimal reward for the leader in the
Stackelberg game is always greater than or equal to his maximum reward under any Nash equilibrium of the simultaneous-move version.
CSC304 - Nisarg Shah 14
➢ the leader can commit to mixed strategies ➢ the follower knows (and trusts) the leader’s commitment ➢ the leader knows the follower’s reward structure
CSC304 - Nisarg Shah 15
𝑦1 min 𝑦2
𝑦2 max 𝑦1
CSC304 - Nisarg Shah 16
𝑦1
𝑦2
CSC304 - Nisarg Shah 17
max Σ𝑡1∈𝑇1𝑦1 𝑡1 ⋅ 𝜌1(𝑡1, 𝑡2
∗)
subject to ∀𝑡2 ∈ 𝑇2, Σ𝑡1∈𝑇1 𝑦1 𝑡1 ⋅ 𝜌2 𝑡1, 𝑡2
∗
≥ Σ𝑡1∈𝑇1𝑦1 𝑡1 ⋅ 𝜌2 𝑡1, 𝑡2 Σ𝑡1∈𝑇1𝑦1 𝑡1 = 1 ∀𝑡1 ∈ 𝑇1, 𝑦1 𝑡1 ≥ 0
∗,
take the maximum
to 𝑡2
∗ optimizes over
all 𝑦1 for which 𝑡2
∗ is
the best response
CSC304 - Nisarg Shah 18
➢ Defender (leader) has 𝑙 identical
patrol units
➢ Defender wants to defend a set of 𝑜
targets 𝑈
➢ In a pure strategy, each resource can
protect a subset of targets 𝑇 ⊆ 𝑈 from a given collection 𝒯
➢ A target is covered if it is protected by
at least one resource
➢ Attacker wants to select a target to
attack
CSC304 - Nisarg Shah 19
➢ For each target, the defender and the
attacker have two utilities: one if the target is covered, one if it is not.
➢ Defender commits to a mixed
strategy; attacker follows by choosing a target to attack.
CSC304 - Nisarg Shah 20
➢ In security games, this is 𝒯 𝑙 ➢ Exponential in 𝑙
CSC304 - Nisarg Shah 21
CSC304 - Nisarg Shah 22
➢ Must return home
➢ Continuous-time strategies
➢ Bathroom breaks !!!
➢ Poachers are not fully rational