CS344M Autonomous Multiagent Systems Todd Hester Department of - - PowerPoint PPT Presentation

cs344m autonomous multiagent systems
SMART_READER_LITE
LIVE PREVIEW

CS344M Autonomous Multiagent Systems Todd Hester Department of - - PowerPoint PPT Presentation

CS344M Autonomous Multiagent Systems Todd Hester Department of Computer Science The University of Texas at Austin Good Afternoon, Colleagues Are there any questions? Todd Hester Good Afternoon, Colleagues Are there any questions? Todd


slide-1
SLIDE 1

CS344M Autonomous Multiagent Systems

Todd Hester Department of Computer Science The University of Texas at Austin

slide-2
SLIDE 2

Good Afternoon, Colleagues

Are there any questions?

Todd Hester

slide-3
SLIDE 3

Good Afternoon, Colleagues

Are there any questions?

Todd Hester

slide-4
SLIDE 4

Logistics

  • First assignment: how did it go?

Todd Hester

slide-5
SLIDE 5

Logistics

  • First assignment: how did it go?
  • Next soccer assignment: score a goal and passing

Todd Hester

slide-6
SLIDE 6

Logistics

  • First assignment: how did it go?
  • Next soccer assignment: score a goal and passing

− Help each other with C issues – parsing strings

Todd Hester

slide-7
SLIDE 7

Logistics

  • First assignment: how did it go?
  • Next soccer assignment: score a goal and passing

− Help each other with C issues – parsing strings − Evaluating mostly on the logic – does the agent “do the right thing?”

Todd Hester

slide-8
SLIDE 8

Logistics

  • First assignment: how did it go?
  • Next soccer assignment: score a goal and passing

− Help each other with C issues – parsing strings − Evaluating mostly on the logic – does the agent “do the right thing?”

  • 2D or 3D?

Todd Hester

slide-9
SLIDE 9

Self-Introductions

  • Speak loudly

Todd Hester

slide-10
SLIDE 10

Self-Introductions

  • Speak loudly
  • Name, year, major

Todd Hester

slide-11
SLIDE 11

Self-Introductions

  • Speak loudly
  • Name, year, major
  • At least one other thing about yourself

Todd Hester

slide-12
SLIDE 12

Discussion

An autonomous agent is a system situated within and a part of an environment that senses that environment and acts on it, over time, in pursuit of its own agenda and so as to affect what it senses in the future.

  • Is this a good definition?
  • The authors claim is is a “formal” definition of agents. Is it?

Todd Hester

slide-13
SLIDE 13

Discussion

An autonomous agent is a system situated within and a part of an environment that senses that environment and acts on it, over time, in pursuit of its own agenda and so as to affect what it senses in the future.

  • Is this a good definition?
  • The authors claim is is a “formal” definition of agents. Is it?
  • Can you do better?

Todd Hester

slide-14
SLIDE 14

Discussion

An autonomous agent is a system situated within and a part of an environment that senses that environment and acts on it, over time, in pursuit of its own agenda and so as to affect what it senses in the future.

  • Is this a good definition?
  • The authors claim is is a “formal” definition of agents. Is it?
  • Can you do better?
  • Do they need to be social? persistent?
  • Can they cease to be agents in a different environment?
  • Autonomy

Todd Hester

slide-15
SLIDE 15

Varieties of Autonomy

  • Do we have complete freedom over our beliefs, goals,

and actions?

Todd Hester

slide-16
SLIDE 16

Varieties of Autonomy

  • Do we have complete freedom over our beliefs, goals,

and actions?

  • Software service has no autonomy — does what it’s told.

Todd Hester

slide-17
SLIDE 17

Varieties of Autonomy

  • Do we have complete freedom over our beliefs, goals,

and actions?

  • Software service has no autonomy — does what it’s told.
  • What’s Wooldridge’s take on where autonomous agents

lie on the spectrum?

Todd Hester

slide-18
SLIDE 18

Varieties of Autonomy

  • Do we have complete freedom over our beliefs, goals,

and actions?

  • Software service has no autonomy — does what it’s told.
  • What’s Wooldridge’s take on where autonomous agents

lie on the spectrum? − Decide how to act so as to accomplish delegated goals

Todd Hester

slide-19
SLIDE 19

Varieties of Autonomy

  • Do we have complete freedom over our beliefs, goals,

and actions?

  • Software service has no autonomy — does what it’s told.
  • What’s Wooldridge’s take on where autonomous agents

lie on the spectrum? − Decide how to act so as to accomplish delegated goals

  • Also mentions adjustable autonomy

Todd Hester

slide-20
SLIDE 20

My Requirements of Agents

  • They must sense their environment.
  • They must decide what action to take (“think”).
  • They must act in their environment.

Todd Hester

slide-21
SLIDE 21

My Requirements of Agents

  • They must sense their environment.
  • They must decide what action to take (“think”).
  • They must act in their environment.

Complete Agents

Todd Hester

slide-22
SLIDE 22

My Requirements of Agents

  • They must sense their environment.
  • They must decide what action to take (“think”).
  • They must act in their environment.

Complete Agents Multiagent systems: Interact with other agents

Todd Hester

slide-23
SLIDE 23

My Requirements of Agents

  • They must sense their environment.
  • They must decide what action to take (“think”).
  • They must act in their environment.

Complete Agents Multiagent systems: Interact with other agents Learning agents: Improve performance from experience

Todd Hester

slide-24
SLIDE 24

My Requirements of Agents

  • They must sense their environment.
  • They must decide what action to take (“think”).
  • They must act in their environment.

Complete Agents Multiagent systems: Interact with other agents Learning agents: Improve performance from experience Autonomous Bidding, Cognitive Systems, Traffic management, Robot Soccer

Todd Hester

slide-25
SLIDE 25

Environments

Environment = ⇒ sensations, actions

Todd Hester

slide-26
SLIDE 26

Environments

Environment = ⇒ sensations, actions

  • fully observable vs. partially observable (accessible)

Todd Hester

slide-27
SLIDE 27

Environments

Environment = ⇒ sensations, actions

  • fully observable vs. partially observable (accessible)
  • deterministic vs. non-deterministic

Todd Hester

slide-28
SLIDE 28

Environments

Environment = ⇒ sensations, actions

  • fully observable vs. partially observable (accessible)
  • deterministic vs. non-deterministic
  • episodic vs. non-episodic

Todd Hester

slide-29
SLIDE 29

Environments

Environment = ⇒ sensations, actions

  • fully observable vs. partially observable (accessible)
  • deterministic vs. non-deterministic
  • episodic vs. non-episodic
  • static vs. dynamic

Todd Hester

slide-30
SLIDE 30

Environments

Environment = ⇒ sensations, actions

  • fully observable vs. partially observable (accessible)
  • deterministic vs. non-deterministic
  • episodic vs. non-episodic
  • static vs. dynamic
  • discrete vs. continuous

Todd Hester

slide-31
SLIDE 31

Environments

Environment = ⇒ sensations, actions

  • fully observable vs. partially observable (accessible)
  • deterministic vs. non-deterministic
  • episodic vs. non-episodic
  • static vs. dynamic
  • discrete vs. continuous
  • single-agent vs. multiagent

Todd Hester

slide-32
SLIDE 32

The Decision

Todd Hester

slide-33
SLIDE 33

The Decision

  • reactive vs. deliberative

Todd Hester

slide-34
SLIDE 34

The Decision

  • reactive vs. deliberative
  • multiagent reasoning?

Todd Hester

slide-35
SLIDE 35

The Decision

  • reactive vs. deliberative
  • multiagent reasoning?
  • learning?

Todd Hester

slide-36
SLIDE 36

Formalizing My Example

Knowns:

  • O = {Blue, Red, Green, Black, . . .}
  • Rewards in IR
  • A = {Wave, Clap, Stand}
  • 0, a0, r0, o1, a1, r1, o2, . . .

Unknowns:

  • S = 4x3 grid
  • R : S × A → IR
  • P = S → O
  • T : S × A → S
  • i = P(si)

ri = R(si, ai) si+1 = T (si, ai)

Todd Hester

slide-37
SLIDE 37

Standard/Reactive/State-based Agents

  • Observation P, Action A, Internal State I

Todd Hester

slide-38
SLIDE 38

Standard/Reactive/State-based Agents

  • Observation P, Action A, Internal State I
  • Standard agent:

Todd Hester

slide-39
SLIDE 39

Standard/Reactive/State-based Agents

  • Observation P, Action A, Internal State I
  • Standard agent:

action : P∗ → A

Todd Hester

slide-40
SLIDE 40

Standard/Reactive/State-based Agents

  • Observation P, Action A, Internal State I
  • Standard agent:

action : P∗ → A

  • Reactive agent:

Todd Hester

slide-41
SLIDE 41

Standard/Reactive/State-based Agents

  • Observation P, Action A, Internal State I
  • Standard agent:

action : P∗ → A

  • Reactive agent:

action : P → A

Todd Hester

slide-42
SLIDE 42

Standard/Reactive/State-based Agents

  • Observation P, Action A, Internal State I
  • Standard agent:

action : P∗ → A

  • Reactive agent:

action : P → A − Decision based entirely on the present

Todd Hester

slide-43
SLIDE 43

Standard/Reactive/State-based Agents

  • Observation P, Action A, Internal State I
  • Standard agent:

action : P∗ → A

  • Reactive agent:

action : P → A − Decision based entirely on the present

  • State-based agent:

Todd Hester

slide-44
SLIDE 44

Standard/Reactive/State-based Agents

  • Observation P, Action A, Internal State I
  • Standard agent:

action : P∗ → A

  • Reactive agent:

action : P → A − Decision based entirely on the present

  • State-based agent: action : I → A, next : I × P → I

Todd Hester

slide-45
SLIDE 45

Standard/Reactive/State-based Agents

  • Observation P, Action A, Internal State I
  • Standard agent:

action : P∗ → A

  • Reactive agent:

action : P → A − Decision based entirely on the present

  • State-based agent: action : I → A, next : I × P → I

It is worth observing that state-based agents as defined here are in fact no more powerful than the standard agents we introduced earlier. In fact, they are identical in their expressive power.

Todd Hester

slide-46
SLIDE 46

Standard/Reactive/State-based Agents

  • Observation P, Action A, Internal State I
  • Standard agent:

action : P∗ → A

  • Reactive agent:

action : P → A − Decision based entirely on the present

  • State-based agent: action : I → A, next : I × P → I

It is worth observing that state-based agents as defined here are in fact no more powerful than the standard agents we introduced earlier. In fact, they are identical in their expressive power.

Todd Hester

slide-47
SLIDE 47

Reactive agents for next Thursday’s assignment task?

Todd Hester

slide-48
SLIDE 48

Discussion

What are some tasks that are partially observable, non-deterministic, dynamic, continuous, and multi-agent? Can we possibly expect an agent to perform well in such tasks?

Todd Hester