CS344M Autonomous Multiagent Systems Patrick MacAlpine Department - - PowerPoint PPT Presentation

cs344m autonomous multiagent systems
SMART_READER_LITE
LIVE PREVIEW

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department - - PowerPoint PPT Presentation

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department of Computer Science The University of Texas at Austin Good Afternoon, Colleagues Are there any questions? Patrick MacAlpine Good Afternoon, Colleagues Are there any


slide-1
SLIDE 1

CS344M Autonomous Multiagent Systems

Patrick MacAlpine Department of Computer Science The University of Texas at Austin

slide-2
SLIDE 2

Good Afternoon, Colleagues

Are there any questions?

Patrick MacAlpine

slide-3
SLIDE 3

Good Afternoon, Colleagues

Are there any questions?

  • Pending questions:

− How are agents like automatons? − What is episodic? − What is deterministic? − Set theory in states/actions? − Is a pencil an agent?

Patrick MacAlpine

slide-4
SLIDE 4

Logistics

  • First assignment: how did it go?

Patrick MacAlpine

slide-5
SLIDE 5

Logistics

  • First assignment: how did it go?
  • Next soccer assignment: score a goal

Patrick MacAlpine

slide-6
SLIDE 6

Logistics

  • First assignment: how did it go?
  • Next soccer assignment: score a goal

− Help each other with C issues — parsing strings

Patrick MacAlpine

slide-7
SLIDE 7

Logistics

  • First assignment: how did it go?
  • Next soccer assignment: score a goal

− Help each other with C issues — parsing strings − Evaluating mostly on the logic — does the agent “do the right thing?”

Patrick MacAlpine

slide-8
SLIDE 8

Logistics

  • First assignment: how did it go?
  • Next soccer assignment: score a goal

− Help each other with C issues — parsing strings − Evaluating mostly on the logic — does the agent “do the right thing?”

  • 2D or 3D?

Patrick MacAlpine

slide-9
SLIDE 9

Self-Introductions

  • Speak loudly

Patrick MacAlpine

slide-10
SLIDE 10

Self-Introductions

  • Speak loudly
  • Name, year, major

Patrick MacAlpine

slide-11
SLIDE 11

Self-Introductions

  • Speak loudly
  • Name, year, major
  • At least one other thing about yourself

Patrick MacAlpine

slide-12
SLIDE 12

Discussion

An autonomous agent is a system situated within and a part of an environment that senses that environment and acts on it, over time, in pursuit of its own agenda and so as to affect what it senses in the future.

  • Is this a good definition?
  • The authors claim is is a “formal” definition of agents. Is it?

Patrick MacAlpine

slide-13
SLIDE 13

Discussion

An autonomous agent is a system situated within and a part of an environment that senses that environment and acts on it, over time, in pursuit of its own agenda and so as to affect what it senses in the future.

  • Is this a good definition?
  • The authors claim is is a “formal” definition of agents. Is it?
  • Can you do better?

Patrick MacAlpine

slide-14
SLIDE 14

Discussion

An autonomous agent is a system situated within and a part of an environment that senses that environment and acts on it, over time, in pursuit of its own agenda and so as to affect what it senses in the future.

  • Is this a good definition?
  • The authors claim is is a “formal” definition of agents. Is it?
  • Can you do better?
  • Do they need to be social? persistent?
  • Can they cease to be agents in a different environment?
  • Autonomy

Patrick MacAlpine

slide-15
SLIDE 15

Varieties of Autonomy

  • Do we have complete freedom over our beliefs, goals,

and actions?

Patrick MacAlpine

slide-16
SLIDE 16

Varieties of Autonomy

  • Do we have complete freedom over our beliefs, goals,

and actions?

  • Software service has no autonomy — does what it’s told.

Patrick MacAlpine

slide-17
SLIDE 17

Varieties of Autonomy

  • Do we have complete freedom over our beliefs, goals,

and actions?

  • Software service has no autonomy — does what it’s told.
  • What’s Wooldridge’s take on where autonomous agents

lie on the spectrum?

Patrick MacAlpine

slide-18
SLIDE 18

Varieties of Autonomy

  • Do we have complete freedom over our beliefs, goals,

and actions?

  • Software service has no autonomy — does what it’s told.
  • What’s Wooldridge’s take on where autonomous agents

lie on the spectrum? − Decide how to act so as to accomplish delegated goals

Patrick MacAlpine

slide-19
SLIDE 19

Varieties of Autonomy

  • Do we have complete freedom over our beliefs, goals,

and actions?

  • Software service has no autonomy — does what it’s told.
  • What’s Wooldridge’s take on where autonomous agents

lie on the spectrum? − Decide how to act so as to accomplish delegated goals

  • Also mentions adjustable autonomy

Patrick MacAlpine

slide-20
SLIDE 20

My Requirements of Agents

  • They must sense their environment.
  • They must decide what action to take (“think”).
  • They must act in their environment.

Patrick MacAlpine

slide-21
SLIDE 21

My Requirements of Agents

  • They must sense their environment.
  • They must decide what action to take (“think”).
  • They must act in their environment.

Complete Agents

Patrick MacAlpine

slide-22
SLIDE 22

My Requirements of Agents

  • They must sense their environment.
  • They must decide what action to take (“think”).
  • They must act in their environment.

Complete Agents Multiagent systems: Interact with other agents

Patrick MacAlpine

slide-23
SLIDE 23

My Requirements of Agents

  • They must sense their environment.
  • They must decide what action to take (“think”).
  • They must act in their environment.

Complete Agents Multiagent systems: Interact with other agents Learning agents: Improve performance from experience

Patrick MacAlpine

slide-24
SLIDE 24

My Requirements of Agents

  • They must sense their environment.
  • They must decide what action to take (“think”).
  • They must act in their environment.

Complete Agents Multiagent systems: Interact with other agents Learning agents: Improve performance from experience Autonomous Bidding, Cognitive Systems, Traffic management, Robot Soccer

Patrick MacAlpine

slide-25
SLIDE 25

Environments

Environment = ⇒ sensations, actions

Patrick MacAlpine

slide-26
SLIDE 26

Environments

Environment = ⇒ sensations, actions

  • fully observable vs. partially observable (accessible)

Patrick MacAlpine

slide-27
SLIDE 27

Environments

Environment = ⇒ sensations, actions

  • fully observable vs. partially observable (accessible)
  • deterministic vs. non-deterministic

Patrick MacAlpine

slide-28
SLIDE 28

Environments

Environment = ⇒ sensations, actions

  • fully observable vs. partially observable (accessible)
  • deterministic vs. non-deterministic
  • episodic vs. non-episodic

Patrick MacAlpine

slide-29
SLIDE 29

Environments

Environment = ⇒ sensations, actions

  • fully observable vs. partially observable (accessible)
  • deterministic vs. non-deterministic
  • episodic vs. non-episodic
  • static vs. dynamic

Patrick MacAlpine

slide-30
SLIDE 30

Environments

Environment = ⇒ sensations, actions

  • fully observable vs. partially observable (accessible)
  • deterministic vs. non-deterministic
  • episodic vs. non-episodic
  • static vs. dynamic
  • discrete vs. continuous

Patrick MacAlpine

slide-31
SLIDE 31

Environments

Environment = ⇒ sensations, actions

  • fully observable vs. partially observable (accessible)
  • deterministic vs. non-deterministic
  • episodic vs. non-episodic
  • static vs. dynamic
  • discrete vs. continuous
  • single-agent vs. multiagent

Patrick MacAlpine

slide-32
SLIDE 32

The Decision

Patrick MacAlpine

slide-33
SLIDE 33

The Decision

  • reactive vs. deliberative

Patrick MacAlpine

slide-34
SLIDE 34

The Decision

  • reactive vs. deliberative
  • multiagent reasoning?

Patrick MacAlpine

slide-35
SLIDE 35

The Decision

  • reactive vs. deliberative
  • multiagent reasoning?
  • learning?

Patrick MacAlpine

slide-36
SLIDE 36

Formalizing My Example

Knowns:

  • O = {Blue, Red, Green, Black, . . .}
  • Rewards in IR
  • A = {Wave, Clap, Stand}
  • 0, a0, r0, o1, a1, r1, o2, . . .

Unknowns:

  • S = 4x3 grid
  • R : S × A → IR
  • P = S → O
  • T : S × A → S
  • i = P(si)

ri = R(si, ai) si+1 = T (si, ai)

Patrick MacAlpine

slide-37
SLIDE 37

Standard/Reactive/State-based Agents

  • Standard agent:

Patrick MacAlpine

slide-38
SLIDE 38

Standard/Reactive/State-based Agents

  • Standard agent:

action : P∗ → A

Patrick MacAlpine

slide-39
SLIDE 39

Standard/Reactive/State-based Agents

  • Standard agent:

action : P∗ → A

  • Reactive agent:

Patrick MacAlpine

slide-40
SLIDE 40

Standard/Reactive/State-based Agents

  • Standard agent:

action : P∗ → A

  • Reactive agent:

action : P → A

Patrick MacAlpine

slide-41
SLIDE 41

Standard/Reactive/State-based Agents

  • Standard agent:

action : P∗ → A

  • Reactive agent:

action : P → A − Decision based entirely on the present

Patrick MacAlpine

slide-42
SLIDE 42

Standard/Reactive/State-based Agents

  • Standard agent:

action : P∗ → A

  • Reactive agent:

action : P → A − Decision based entirely on the present

  • State-based agent:

Patrick MacAlpine

slide-43
SLIDE 43

Standard/Reactive/State-based Agents

  • Standard agent:

action : P∗ → A

  • Reactive agent:

action : P → A − Decision based entirely on the present

  • State-based agent:

action : I → A, next : I × P → I

Patrick MacAlpine

slide-44
SLIDE 44

Standard/Reactive/State-based Agents

  • Standard agent:

action : P∗ → A

  • Reactive agent:

action : P → A − Decision based entirely on the present

  • State-based agent:

action : I → A, next : I × P → I It is worth observing that state-based agents as defined here are in fact no more powerful than the standard agents we introduced earlier. In fact, they are identical in their expressive power.

Patrick MacAlpine

slide-45
SLIDE 45

Standard/Reactive/State-based Agents

  • Standard agent:

action : P∗ → A

  • Reactive agent:

action : P → A − Decision based entirely on the present

  • State-based agent:

action : I → A, next : I × P → I It is worth observing that state-based agents as defined here are in fact no more powerful than the standard agents we introduced earlier. In fact, they are identical in their expressive power. Reactive agents for next Thursday’s assignment task?

Patrick MacAlpine

slide-46
SLIDE 46

Discussion

What are some tasks that are partially observable, non-deterministic, dynamic, continuous, and multi-agent?

Patrick MacAlpine

slide-47
SLIDE 47

Discussion

What are some tasks that are partially observable, non-deterministic, dynamic, continuous, and multi-agent? Can we possibly expect an agent to perform well in such tasks?

Patrick MacAlpine