Single agent or multiple agents Many domains are characterized by - PowerPoint PPT Presentation

Single agent or multiple agents Many domains are characterized by multiple agents rather than a single agent. Game theory studies what agents should do in a multi-agent setting. Agents can be cooperative, competitive or somewhere in between. Agents that are strategic can’t be modeled as nature. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 1 1 / 23

Multi-agent framework Each agent can have its own utility. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 2 2 / 23

Multi-agent framework Each agent can have its own utility. Agents select actions autonomously. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 3 2 / 23

Multi-agent framework Each agent can have its own utility. Agents select actions autonomously. Agents can have different information. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 4 2 / 23

Multi-agent framework Each agent can have its own utility. Agents select actions autonomously. Agents can have different information. The outcome can depend on the actions of all of the agents. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 5 2 / 23

Multi-agent framework Each agent can have its own utility. Agents select actions autonomously. Agents can have different information. The outcome can depend on the actions of all of the agents. Each agent’s value depends on the outcome. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 6 2 / 23

Fully Observable + Multiple Agents If agents act sequentially and can observe the state before acting: Perfect Information Games. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 7 3 / 23

Fully Observable + Multiple Agents If agents act sequentially and can observe the state before acting: Perfect Information Games. Can do dynamic programming or search: Each agent maximizes for itself. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 8 3 / 23

Fully Observable + Multiple Agents If agents act sequentially and can observe the state before acting: Perfect Information Games. Can do dynamic programming or search: Each agent maximizes for itself. Multi-agent MDPs: value function for each agent. each agent maximizes its own value function. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 9 3 / 23

Fully Observable + Multiple Agents If agents act sequentially and can observe the state before acting: Perfect Information Games. Can do dynamic programming or search: Each agent maximizes for itself. Multi-agent MDPs: value function for each agent. each agent maximizes its own value function. Multi-agent reinforcement learning: each agent has its own Q function. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 10 3 / 23

Fully Observable + Multiple Agents If agents act sequentially and can observe the state before acting: Perfect Information Games. Can do dynamic programming or search: Each agent maximizes for itself. Multi-agent MDPs: value function for each agent. each agent maximizes its own value function. Multi-agent reinforcement learning: each agent has its own Q function. Two person, competitive (zero sum) = ⇒ minimax. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 11 3 / 23

Normal Form of a Game The strategic form of a game or normal-form game: a finite set I of agents, { 1 , . . . , n } . a set of actions A i for each agent i ∈ I . � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 12 4 / 23

Normal Form of a Game The strategic form of a game or normal-form game: a finite set I of agents, { 1 , . . . , n } . a set of actions A i for each agent i ∈ I . An action profile σ is a tuple � a 1 , . . . , a n � , means agent i carries out a i . � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 13 4 / 23

Normal Form of a Game The strategic form of a game or normal-form game: a finite set I of agents, { 1 , . . . , n } . a set of actions A i for each agent i ∈ I . An action profile σ is a tuple � a 1 , . . . , a n � , means agent i carries out a i . a utility function utility ( σ, i ) for action profile σ and agent i ∈ I , gives the expected utility for agent i when all agents follow action profile σ . � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 14 4 / 23

Rock-Paper-Scissors Bob rock paper scissors 0 , 0 − 1 , 1 1 , − 1 rock Alice paper 1 , − 1 0 , 0 − 1 , 1 − 1 , 1 1 , − 1 0,0 scissors � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 15 5 / 23

Extensive Form of a Game Andy share keep give Barb Barb Barb yes no yes no yes no 2,0 0,0 1,1 0,0 0,2 0,0 � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 16 6 / 23

Extensive Form of an imperfect-information Game Alice rock paper scissors Bob Bob Bob r p r p r p s s s 0,0 0,0 -1,1 1, - 1 1, - 1 0,0 - 1,1 - 1,1 1, - 1 Bob cannot distinguish the nodes in an information set. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 17 7 / 23

Multiagent Decision Networks U 1 Call 1 Alarm 1 Call Fire Dept Fire Works Comes Alarm 2 Call 2 U 2 Value node for each agent. Each decision node is owned by an agent. Utility for each agent. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 18 8 / 23

Multiple Agents, shared value ... ... � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 19 9 / 23

Complexity of Multi-agent decision theory It can be exponentially harder to find optimal multi-agent policy even with a shared values. Why? Because dynamic programming doesn’t work: ◮ If a decision node has n binary parents, dynamic programming lets us solve 2 n decision problems. ◮ This is much better than d 2 n policies (where d is the number of decision alternatives). Multiple agents with shared values is equivalent to having a single forgetful agent. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 20 10 / 23

Partial Observability and Competition goalie left right kicker left 0.6 0.2 right 0.3 0.9 Probability of a goal. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 21 11 / 23

Stochastic Policies 0.9 0.8 0.7 pj= 0 0.6 0.5 P(goal) 0.4 0.3 pj=1 0.2 0 0.2 0.4 0.6 0.8 1 pk � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 22 12 / 23

Strategy Profiles Assume a general n -player game, A strategy for an agent is a probability distribution over the actions for this agent. A strategy profile is an assignment of a strategy to each agent. A strategy profile σ has a utility for each agent. Let utility ( σ, i ) be the utility of strategy profile σ for agent i . If σ is a strategy profile: σ i is the strategy of agent i in σ , σ − i is the set of strategies of the other agents. Thus σ is σ i σ − i � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 23 13 / 23

Nash Equilibria σ i is a best response to σ − i if for all other strategies σ ′ i for agent i , utility ( σ i σ − i , i ) ≥ utility ( σ ′ i σ − i , i ) . A strategy profile σ is a Nash equilibrium if for each agent i , strategy σ i is a best response to σ − i . That is, a Nash equilibrium is a strategy profile such that no agent can be better by unilaterally deviating from that profile. Theorem [Nash, 1950] Every finite game has at least one Nash equilibrium. � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 24 14 / 23

Multiple Equilibria Hawk-Dove Game: Agent 2 dove hawk Agent 1 dove R/2,R/2 0,R hawk R,0 -D,-D D and R are both positive with D >> R . � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 25 15 / 23

Coordination Just because you know the Nash equilibria doesn’t mean you know what to do: Agent 2 shopping football Agent 1 shopping 2,1 0,0 football 0,0 1,2 � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 26 16 / 23

Prisoner’s Dilemma Two strangers are in a game show. They each have the choice: Take $100 for yourself Give $1000 to the other player This can be depicted as the playoff matrix: Player 2 take give Player 1 take 100,100 1100,0 give 0,1100 1000,1000 � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 27 17 / 23

Tragedy of the Commons Example: There are 100 agents. There is an common environment that is shared amongst all agents. Each agent has 1/100 of the shared environment. Each agent can choose to do an action that has a payoff of +10 but has a -100 payoff on the environment or do nothing with a zero payoff � D. Poole and A. Mackworth 2017 c Artificial Intelligence, Lecture 11.1, Page 28 18 / 23

Single agent or multiple agents Many domains are characterized by - PowerPoint PPT Presentation

Single agent or multiple agents Many domains are characterized by multiple agents rather than a single agent. Game theory studies what agents should do in a multi-agent setting. Agents can be cooperative, competitive or somewhere in between.

Single agent or multiple agents Many domains are characterized by multiple agents rather than a

I ntelligent Agents I ntelligent Agents Rational Agent PEAS Types of Agents Some

Overview Multi-Agent Systems Introduction to multi-agent systems and agent societies Agent

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

Rational Agents (Ch. 2) Rational agent An agent/robot must be able to perceive and interact with

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents & environments

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

BABA is getting Social BECOME A BETTER AGENT Where good agents go to become great agents.

Learning Agents Overview Learning important aspects Learning in Agents goal, types; individual

Intelligent Agents 2 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 2 1 2 Intelligence Agents

Rational Agents (Ch. 2) Rational agent Remember vacuum problem? Agent program: if [Dirty],

Agent Architectures and Hierarchical Control Overview: Agents and Robots Agent systems and

Rational Agents (Ch. 2) Rational agent Remember vacuum problem? Agent program: if [Dirty],

Agent Architectures and Hierarchical Control Overview: Agents and Robots Agent systems and

Learning Agent Learning Agents An Agent that observes its performance and adapts its

Chapter2 Intelligent Agents 2 20070308 chap2 1 20070308 chap2 What Is An Agent ?

Substructural modal logic for optimality and games Gabrielle Anderson University College London

Game Dynamics in Extensive Form Dietmar Berwanger LSV, ENS Cachan & CNRS ICLA - Logic &

Game Theory for Sequential Interactions CMPUT 366: Intelligent Systems S&LB 5.0-5.2.2

Low-Variance and Zero-Variance Baselines in Extensive-Form Games Trevor Davis 2,* , Martin Schmid

The Role of Commitment in Repeated Games Julio Gonz alez-D az Ignacio Garc a-Jurado

PANEL A: Price of coltan PANEL B: Price of gold Figure 3: Support villages in the sample Notes :

Voting versus Lobbying David K. Levine, Andrea Mattozzi and Salvatore Modica 1 The Setting

Applied Machine Learning Introduction Siamak Ravanbakhsh COMP 551 (fall 2020) Objectives a

Single agent or multiple agents Many domains are characterized by - PowerPoint PPT Presentation

Single agent or multiple agents Many domains are characterized by multiple agents rather than a single agent. Game theory studies what agents should do in a multi-agent setting. Agents can be cooperative, competitive or somewhere in between.

Single agent or multiple agents Many domains are characterized by multiple agents rather than a

I ntelligent Agents I ntelligent Agents Rational Agent PEAS Types of Agents Some

Overview Multi-Agent Systems Introduction to multi-agent systems and agent societies Agent

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

Rational Agents (Ch. 2) Rational agent An agent/robot must be able to perceive and interact with

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents &amp; environments

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

BABA is getting Social BECOME A BETTER AGENT Where good agents go to become great agents.

Learning Agents Overview Learning important aspects Learning in Agents goal, types; individual

Intelligent Agents 2 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 2 1 2 Intelligence Agents

Rational Agents (Ch. 2) Rational agent Remember vacuum problem? Agent program: if [Dirty],

Agent Architectures and Hierarchical Control Overview: Agents and Robots Agent systems and

Rational Agents (Ch. 2) Rational agent Remember vacuum problem? Agent program: if [Dirty],

Agent Architectures and Hierarchical Control Overview: Agents and Robots Agent systems and

Learning Agent Learning Agents An Agent that observes its performance and adapts its

Chapter2 Intelligent Agents 2 20070308 chap2 1 20070308 chap2 What Is An Agent ?

Substructural modal logic for optimality and games Gabrielle Anderson University College London

Game Dynamics in Extensive Form Dietmar Berwanger LSV, ENS Cachan &amp; CNRS ICLA - Logic &amp;

Game Theory for Sequential Interactions CMPUT 366: Intelligent Systems S&amp;LB 5.0-5.2.2

Low-Variance and Zero-Variance Baselines in Extensive-Form Games Trevor Davis 2,* , Martin Schmid

The Role of Commitment in Repeated Games Julio Gonz alez-D az Ignacio Garc a-Jurado

PANEL A: Price of coltan PANEL B: Price of gold Figure 3: Support villages in the sample Notes :

Voting versus Lobbying David K. Levine, Andrea Mattozzi and Salvatore Modica 1 The Setting

Applied Machine Learning Introduction Siamak Ravanbakhsh COMP 551 (fall 2020) Objectives a

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents & environments

Game Dynamics in Extensive Form Dietmar Berwanger LSV, ENS Cachan & CNRS ICLA - Logic &

Game Theory for Sequential Interactions CMPUT 366: Intelligent Systems S&LB 5.0-5.2.2