Multi-agent learning
Satis ing pla yGerard Vreeswijk, Intelligent Software Systems, Computer Science Department, Faculty of Sciences, Utrecht University, The Netherlands.
Tuesday 16th June, 2020
Multi-agent learning Satising pla y Gerard Vreeswijk , Intelligent - - PowerPoint PPT Presentation
Multi-agent learning Satising pla y Gerard Vreeswijk , Intelligent Software Systems, Computer Science Department, Faculty of Sciences, Utrecht University, The Netherlands. Tuesday 16 th June, 2020 Assumptions in game playing Author: Gerard
Gerard Vreeswijk, Intelligent Software Systems, Computer Science Department, Faculty of Sciences, Utrecht University, The Netherlands.
Tuesday 16th June, 2020
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 2
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 2
■ Players know the the
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 2
■ Players know the the
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 2
■ Players know the the
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 2
■ Players know the the
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 2
■ Players know the the
■ Players can observe other
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 2
■ Players know the the
■ Players can observe other
■ . . . other player’s payoffs.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 2
■ Players know the the
■ Players can observe other
■ . . . other player’s payoffs. ■ Players are aware that they
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
■ Players can’t observe other
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
■ Players can’t observe other
■ Players can’t observe other
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
■ Players can’t observe other
■ Players can’t observe other
■ Players aren’t aware that they
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
■ Players can’t observe other
■ Players can’t observe other
■ Players aren’t aware that they
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
■ Players can’t observe other
■ Players can’t observe other
■ Players aren’t aware that they
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
■ Players can’t observe other
■ Players can’t observe other
■ Players aren’t aware that they
■ Reinforcement learning.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
■ Players can’t observe other
■ Players can’t observe other
■ Players aren’t aware that they
■ Reinforcement learning.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
■ Players can’t observe other
■ Players can’t observe other
■ Players aren’t aware that they
■ Reinforcement learning.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
■ Players can’t observe other
■ Players can’t observe other
■ Players aren’t aware that they
■ Reinforcement learning.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
■ Players can’t observe other
■ Players can’t observe other
■ Players aren’t aware that they
■ Reinforcement learning.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 3
■ Players don’t know the
■ Players can’t observe other
■ Players can’t observe other
■ Players aren’t aware that they
■ Reinforcement learning.
■ Satisficing learning.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 4
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 4
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 4
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 4
reasons, Vol. 3. MIT Press, 1997.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 4
reasons, Vol. 3. MIT Press, 1997.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 4
reasons, Vol. 3. MIT Press, 1997.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 4
reasons, Vol. 3. MIT Press, 1997.
American Economic Review, Vol. 69(4), pp. 493-513.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 5
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 6
aspiration level p ersisten e rateAuthor: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 6
■ At any time, t, the agent’s state is a tuple (At, αt).
aspiration level p ersisten e rateAuthor: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 6
■ At any time, t, the agent’s state is a tuple (At, αt).
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 6
■ At any time, t, the agent’s state is a tuple (At, αt).
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 6
■ At any time, t, the agent’s state is a tuple (At, αt).
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 6
■ At any time, t, the agent’s state is a tuple (At, αt).
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 6
■ At any time, t, the agent’s state is a tuple (At, αt).
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 6
■ At any time, t, the agent’s state is a tuple (At, αt).
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 6
■ At any time, t, the agent’s state is a tuple (At, αt).
■ Satisficing algorithm:
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 6
■ At any time, t, the agent’s state is a tuple (At, αt).
■ Satisficing algorithm:
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 7
2 4 6 8 10 t 1 2 3 4 5 Α
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 8
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 9
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 9
■ Take a 2-player 3 × 3 game in
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 9
■ Take a 2-player 3 × 3 game in
■ Plot all 9 pure payoff profiles
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 9
■ Take a 2-player 3 × 3 game in
■ Plot all 9 pure payoff profiles
■ Initialize, say, 100 profiles.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 9
■ Take a 2-player 3 × 3 game in
■ Plot all 9 pure payoff profiles
■ Initialize, say, 100 profiles.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 9
■ Take a 2-player 3 × 3 game in
■ Plot all 9 pure payoff profiles
■ Initialize, say, 100 profiles.
■ Execute satisficing play for
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 10
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 11
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 12
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 12
■ Generalised payoff matrix
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 12
■ Generalised payoff matrix
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 12
■ Generalised payoff matrix
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 12
■ Generalised payoff matrix
■ Use Karandikar et al.’s algorithm.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 12
■ Generalised payoff matrix
■ Use Karandikar et al.’s algorithm.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 12
■ Generalised payoff matrix
■ Use Karandikar et al.’s algorithm.
◆ (At, αt) for the row player.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 12
■ Generalised payoff matrix
■ Use Karandikar et al.’s algorithm.
◆ (At, αt) for the row player. ◆ (Bt, βt) for the column player.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 12
■ Generalised payoff matrix
■ Use Karandikar et al.’s algorithm.
◆ (At, αt) for the row player. ◆ (Bt, βt) for the column player.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 13
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 13
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 13
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 13
t
t
t
t .
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 13
t
t
t
t .
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 13
t
t
t
t .
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 13
t
t
t
t .
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 13
t
t
t
t .
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 13
t
t
t
t .
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 14
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 15
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 16
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 17
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 18
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 19
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 20
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 21
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 22
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 23
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 23
■ Initial aspiration of
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 23
■ Initial aspiration of
■ White: convergence to
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 23
■ Initial aspiration of
■ White: convergence to
■ (A0, B0) = (D, D),
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 23
■ Initial aspiration of
■ White: convergence to
■ (A0, B0) = (D, D),
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 24
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 25
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 26
■ Initial aspiration of
■ White: convergence to
■ (A0, B0) = (C, C),
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 27
■ Initial aspiration of
■ White: convergence to
■ (A0, B0) = (D, C),
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 28
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 29
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 30
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 31
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 32
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 33
aspiration level reinfo r ement in rement HypAuthor: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 33
■ Regret matching can be cast in
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 33
■ Regret matching can be cast in
■ Define the
reinfo r ement in rement for every action x inx =Def u(x, yt) − ¯
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 33
■ Regret matching can be cast in
■ Define the
reinfo r ement in rement for every action x inx =Def u(x, yt) − ¯
■ Define the propensities in
x
s=1
x
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 33
■ Regret matching can be cast in
■ Define the
reinfo r ement in rement for every action x inx =Def u(x, yt) − ¯
■ Define the propensities in
x
s=1
x
■ This is like standard
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 33
■ Regret matching can be cast in
■ Define the
reinfo r ement in rement for every action x inx =Def u(x, yt) − ¯
■ Define the propensities in
x
s=1
x
■ This is like standard
■
HypAuthor: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 33
■ Regret matching can be cast in
■ Define the
reinfo r ement in rement for every action x inx =Def u(x, yt) − ¯
■ Define the propensities in
x
s=1
x
■ This is like standard
■
HypAuthor: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 34
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 34
■ Agents should have high enough initial aspirations.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 34
■ Agents should have high enough initial aspirations. ■ Agents should learn, but slowly.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 34
■ Agents should have high enough initial aspirations. ■ Agents should learn, but slowly. ■ The difference between payoffs for mutual defection and mutual
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 34
■ Agents should have high enough initial aspirations. ■ Agents should learn, but slowly. ■ The difference between payoffs for mutual defection and mutual
■ Agents should start out with similar behavior.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 34
■ Agents should have high enough initial aspirations. ■ Agents should learn, but slowly. ■ The difference between payoffs for mutual defection and mutual
■ Agents should start out with similar behavior.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 34
■ Agents should have high enough initial aspirations. ■ Agents should learn, but slowly. ■ The difference between payoffs for mutual defection and mutual
■ Agents should start out with similar behavior.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 34
■ Agents should have high enough initial aspirations. ■ Agents should learn, but slowly. ■ The difference between payoffs for mutual defection and mutual
■ Agents should start out with similar behavior.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 34
■ Agents should have high enough initial aspirations. ■ Agents should learn, but slowly. ■ The difference between payoffs for mutual defection and mutual
■ Agents should start out with similar behavior.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 34
■ Agents should have high enough initial aspirations. ■ Agents should learn, but slowly. ■ The difference between payoffs for mutual defection and mutual
■ Agents should start out with similar behavior.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 34
■ Agents should have high enough initial aspirations. ■ Agents should learn, but slowly. ■ The difference between payoffs for mutual defection and mutual
■ Agents should start out with similar behavior.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 34
■ Agents should have high enough initial aspirations. ■ Agents should learn, but slowly. ■ The difference between payoffs for mutual defection and mutual
■ Agents should start out with similar behavior.
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 35
single mixed strategy p robabilit y distribution■ Like fictitious play, players model (or
■ Strategies are not played, only
■ Due to CKR (common knowledge of
■ Players gradually adapt their mixed
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 35
■ Like fictitious play, players model (or
■ Strategies are not played, only
■ Due to CKR (common knowledge of
■ Players gradually adapt their mixed
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 35
■ With fictitious play,
■ Like fictitious play, players model (or
■ Strategies are not played, only
■ Due to CKR (common knowledge of
■ Players gradually adapt their mixed
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 35
■ With fictitious play,
■ With Bayesian play,
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 35
■ With fictitious play,
■ With Bayesian play,
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 35
■ With fictitious play,
■ With Bayesian play,
■ Like fictitious play, players model (or
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 35
■ With fictitious play,
■ With Bayesian play,
■ Like fictitious play, players model (or
■ Strategies are not played, only
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 35
■ With fictitious play,
■ With Bayesian play,
■ Like fictitious play, players model (or
■ Strategies are not played, only
■ Due to CKR (common knowledge of
Author: Gerard Vreeswijk. Slides last modified on June 16th, 2020 at 12:21 Multi-agent learning: Satisficing play, slide 35
■ With fictitious play,
■ With Bayesian play,
■ Like fictitious play, players model (or
■ Strategies are not played, only
■ Due to CKR (common knowledge of
■ Players gradually adapt their mixed