Introduction to AI Introduction to AI & Intelligent Agents - - PowerPoint PPT Presentation

introduction to ai introduction to ai intelligent agents
SMART_READER_LITE
LIVE PREVIEW

Introduction to AI Introduction to AI & Intelligent Agents - - PowerPoint PPT Presentation

Introduction to AI Introduction to AI & Intelligent Agents Intelligent Agents Russell&Norvig, Chapters 1-2 CS-271, Fall 2010 CS-271: 1 What is Artificial Intelligence? Thought processes vs. behavior Human-like vs.


slide-1
SLIDE 1

Introduction to AI Introduction to AI & Intelligent Agents Intelligent Agents

Russell&Norvig, Chapters 1-2 CS-271, Fall 2010

CS-271: 1

slide-2
SLIDE 2

What is Artificial Intelligence?

  • Thought processes vs. behavior
  • Human-like vs. rational-like
  • How to simulate humans intellect and

behavior by a machine.

M th ti l bl ( l – Mathematical problems (puzzles, games, theorems) – Common-sense reasoning – Expert knowledge: lawyers, medicine, diagnosis – Social behavior Web and online intelligence – Web and online intelligence – Planning for assembly and logistics operations

  • Things we call “intelligent” if done by a human.

CS-271: 2

g g y

slide-3
SLIDE 3

What is AI? Views of AI fall into four categories: Thi ki h l Thi ki ti ll Thinking humanly Thinking rationally Acting humanly Acting rationally

The textbook advocates "acting rationally“

CS-271: 3

slide-4
SLIDE 4

What is Artificial Intelligence (John McCarthy , Basic Questions)

  • What is artificial intelligence?
  • It is the science and engineering of making intelligent machines,

especially intelligent computer programs. It is related to the similar task f i t t d t d h i t lli b t AI d t

  • f using computers to understand human intelligence, but AI does not

have to confine itself to methods that are biologically observable.

  • Yes, but what is intelligence?
  • Intelligence is the computational part of the ability to achieve goals in
  • Intelligence is the computational part of the ability to achieve goals in

the world. Varying kinds and degrees of intelligence occur in people, many animals and some machines.

  • Isn't there a solid definition of intelligence that doesn't depend on

Isn t there a solid definition of intelligence that doesn t depend on relating it to human intelligence?

  • Not yet. The problem is that we cannot yet characterize in general what

kinds of computational procedures we want to call intelligent. We understand some of the mechanisms of intelligence and not others.

CS-271: 4

  • More in: http://www-formal.stanford.edu/jmc/whatisai/node1.html
slide-5
SLIDE 5

What is Artificial Intelligence

  • Thought processes

– “The exciting new effort to make computers think .. g p Machines with minds, in the full and literal sense” (Haugeland, 1985)

B h i

  • Behavior

– “The study of how to make computers do things at which at the moment people are better ” (Rich which, at the moment, people are better. (Rich, and Knight, 1991)

CS-271: 5

slide-6
SLIDE 6

AI as “Raisin Bread”

  • Esther Dyson [predicted] AI would [be] embedded in main-stream,

strategically important systems, like raisins in a loaf of raisin bread.

  • Time has proven Dyson's prediction correct.
  • Emphasis shifts away from replacing expensive human experts
  • Emphasis shifts away from replacing expensive human experts

with stand-alone expert systems toward main-stream computing systems that create strategic advantage.

  • Many of today's AI systems are connected to large data bases, they

deal with legacy data, they talk to networks, they handle noise and data corruption with style and grace, they are implemented in l l d th t d d ti t popular languages, and they run on standard operating systems.

  • Humans usually are important contributors to the total solution.

CS-271: 6

  • Adapted from Patrick Winston, Former Director, MIT AI Laboratory
slide-7
SLIDE 7

Agents and environments Compare: Standard Embedded System Structure

microcontroller sensors ADC DAC actuators se so s ac ua o s ASIC FPGA

CS-271: 7

slide-8
SLIDE 8

The Turing Test

(Can Machine think? A. M. Turing, 1950) ( g, )

  • Requires:

Requires:

– Natural language – Knowledge representation – Automated reasoning – Machine learning

CS-271: 8

– (vision, robotics) for full test

slide-9
SLIDE 9

Acting/Thinking Humanly/Rationally

  • Turing test (1950)
  • Requires:

Humanly/Rationally

  • Requires:

– Natural language – Knowledge representation automated reasoning – automated reasoning – machine learning – (vision, robotics.) for full test

M th d f Thi ki H l

  • Methods for Thinking Humanly:

– Introspection, the general problem solver (Newell and Simon 1961) C iti i – Cognitive sciences

  • Thinking rationally:

– Logic

CS-271: 9

– Problems: how to represent and reason in a domain

  • Acting rationally:

– Agents: Perceive and act

slide-10
SLIDE 10

Agents

  • An agent is anything that can be viewed as

i i it i t th h d perceiving its environment through sensors and acting upon that environment through actuators Human agent: eyes, ears, and other organs for sensors; h d l th d th b d t f hands, legs, mouth, and other body parts for actuators

  • Robotic agent:

cameras and infrared range finders for sensors; ario s motors for act ators

CS-271: 10

various motors for actuators

slide-11
SLIDE 11

Agents and environments

  • The agent function maps from percept histories to

actions: [f: P* P*  A] [f: P* P*  A]

  • The agent program runs on the physical

The agent program runs on the physical architecture to produce f

CS-271: 11

  • agent = architecture + program
slide-12
SLIDE 12

Vacuum-cleaner world

  • Percepts: location and state of the environment,

p , e.g., [A,Dirty], [B,Clean]

  • Actions: Left, Right, Suck, NoOp

CS-271: 12

slide-13
SLIDE 13

Rational agents

  • Rational Agent: For each possible percept

sequence, a rational agent should select an action that is expected to maximize its performance that is expected to maximize its performance measure, based on the evidence provided by the percept sequence and whatever built-in knowledge the agent has knowledge the agent has.

  • Performance measure: An objective criterion for

j success of an agent's behavior

  • E g

performance measure of a vacuum cleaner

  • E.g., performance measure of a vacuum-cleaner

agent could be amount of dirt cleaned up, amount

  • f time taken, amount of electricity consumed,

amount of noise generated etc

CS-271: 13

amount of noise generated, etc.

slide-14
SLIDE 14

Rational agents

  • Rationality is distinct from omniscience

(all-knowing with infinite knowledge)

  • Agents can perform actions in order to

modify future percepts so as to obtain useful information (information gathering, exploration) exploration)

  • An agent is autonomous if its behavior is

An agent is autonomous if its behavior is determined by its own percepts & experience (with ability to learn and adapt)

CS-271: 14

without depending solely on build-in knowledge

slide-15
SLIDE 15

Discussion Items

  • An realistic agent has finite amount of computation and memory
  • available. Assume an agent is killed because it did not have

enough computation resources to calculate some rare eventually enough computation resources to calculate some rare eventually that ended up killing it. Can this agent still be rational?

  • The Turing test was contested by Searle by using the “Chinese

g y y g Room” argument. The Chinese Room agent needs an exponential large memory to work. Can we “save” the Turing test from the Chinese Room argument?

CS-271: 15

slide-16
SLIDE 16

Task Environment

  • Before we design an intelligent agent, we

must specify its “task environment”: PEAS: Performance measure Environment Actuators Sensors

CS-271: 16

slide-17
SLIDE 17

PEAS

  • Example: Agent = taxi driver

P f S f f t l l – Performance measure: Safe, fast, legal, comfortable trip, maximize profits – Environment: Roads, other traffic, pedestrians, customers – Actuators: Steering wheel, accelerator, brake, signal, horn – Sensors: Cameras, sonar, speedometer, GPS,

  • dometer, engine sensors, keyboard

CS-271: 17

slide-18
SLIDE 18

PEAS

  • Example: Agent = Medical diagnosis system

Performance measure: Healthy patient, minimize costs, lawsuits Environment: Patient, hospital, staff Actuators: Screen display (questions, tests, diagnoses, treatments, referrals) g , , ) Sensors: Keyboard (entry of symptoms, findings, i ' )

CS-271: 18

patient's answers)

slide-19
SLIDE 19

PEAS

  • Example: Agent = Part-picking robot
  • Performance measure: Percentage of parts in

correct bins

  • Environment: Conveyor belt with parts, bins
  • Actuators: Jointed arm and hand
  • Sensors: Camera, joint angle sensors

CS-271: 19

slide-20
SLIDE 20

Environment types F ll b bl ( ti ll b bl ) A

  • Fully observable (vs. partially observable): An

agent's sensors give it access to the complete state of the environment at each point in time.

  • Deterministic (vs. stochastic): The next state of

the environment is completely determined by the t e e

  • e t s co

p ete y dete ed by t e current state and the action executed by the

  • agent. (If the environment is deterministic except

for the actions of other agents, then the for the actions of other agents, then the environment is strategic) Episodic ( s seq ential) An agent’s action is

  • Episodic (vs. sequential): An agent’s action is

divided into atomic episodes. Decisions do not depend on previous decisions/actions.

CS-271: 20

slide-21
SLIDE 21

Environment types

  • Static (vs. dynamic): The environment is

unchanged while an agent is deliberating. (The environment is semidynamic if the environment itself does not change with the passage of time but the agent's performance score does) but the agent s performance score does)

  • Discrete (vs. continuous): A limited number of

distinct, clearly defined percepts and actions. How do we represent or abstract or model the world? world?

  • Single agent (vs. multi-agent): An agent operating

CS-271: 21

S g e age t ( s u t age t) age t ope at g by itself in an environment. Does the other agent interfere with my performance measure?

slide-22
SLIDE 22

task environm.

  • bservable

determ./ stochastic episodic/ sequential static/ dynamic discrete/ continuous agents crossword fully determ. sequential static discrete single puzzle chess with clock fully strategic sequential semi discrete multi poker back gammon gammon taxi driving partial stochastic sequential dynamic continuous multi medical partial stochastic sequential dynamic continuous single medical diagnosis partial stochastic sequential dynamic continuous single image analysis fully determ. episodic semi continuous single analysis partpicking robot partial stochastic episodic dynamic continuous single refinery partial stochastic sequential dynamic continuous single

CS-271: 22

refinery controller partial stochastic sequential dynamic continuous single interact.

  • Eng. tutor

partial stochastic sequential dynamic discrete multi

slide-23
SLIDE 23

task environm.

  • bservable

determ./ stochastic episodic/ sequential static/ dynamic discrete/ continuous agents crossword fully determ. sequential static discrete single puzzle chess with clock fully strategic sequential semi discrete multi poker partial stochastic sequential static discrete multi back gammon gammon taxi driving partial stochastic sequential dynamic continuous multi medical partial stochastic sequential dynamic continuous single medical diagnosis partial stochastic sequential dynamic continuous single image analysis fully determ. episodic semi continuous single analysis partpicking robot partial stochastic episodic dynamic continuous single refinery partial stochastic sequential dynamic continuous single

CS-271: 23

refinery controller partial stochastic sequential dynamic continuous single interact.

  • Eng. tutor

partial stochastic sequential dynamic discrete multi

slide-24
SLIDE 24

task environm.

  • bservable

determ./ stochastic episodic/ sequential static/ dynamic discrete/ continuous agents crossword fully determ. sequential static discrete single puzzle chess with clock fully strategic sequential semi discrete multi poker partial stochastic sequential static discrete multi back gammon fully stochastic sequential static discrete multi gammon taxi driving partial stochastic sequential dynamic continuous multi medical partial stochastic sequential dynamic continuous single medical diagnosis partial stochastic sequential dynamic continuous single image analysis fully determ. episodic semi continuous single analysis partpicking robot partial stochastic episodic dynamic continuous single refinery partial stochastic sequential dynamic continuous single

CS-271: 24

refinery controller partial stochastic sequential dynamic continuous single interact.

  • Eng. tutor

partial stochastic sequential dynamic discrete multi

slide-25
SLIDE 25

Agent types

  • Five basic types in order of increasing generality:
  • Table Driven agents
  • Simple reflex agents

M d l b d fl t

  • Model-based reflex agents
  • Goal-based agents
  • Goal-based agents
  • Utility-based agents

CS-271: 25

Ut ty based age ts

slide-26
SLIDE 26

Table Driven Agent.

current state of decision process current state of decision process

table lookup table lookup for entire history

CS-271: 26

slide-27
SLIDE 27

Simple reflex agents

NO MEMORY Fails if environment Fails if environment is partially observable

CS-271: 27

example: vacuum cleaner world

slide-28
SLIDE 28

Model-based reflex agents

Model the state of the world by: d li h th ld h description of current world state modeling how the world changes how it’s actions change the world current world state

  • This can work even with partial information
  • It’s is unclear what to do

without a clear goal

CS-271: 28

slide-29
SLIDE 29

Goal-based agents

G l id t f ti th th Goals provide reason to prefer one action over the other. We need to predict the future: we need to plan & search

CS-271: 29

slide-30
SLIDE 30

Utility-based agents

Some solutions to goal states are better than others. Whi h i b t i i b tilit f ti Which one is best is given by a utility function. Which combination of goals is preferred?

CS-271: 30

slide-31
SLIDE 31

Learning agents

How does an agent improve over time? B it i it’ f d ti By monitoring it’s performance and suggesting better modeling, new action rules, etc.

Evaluates current world t t state changes action rules

“old agent”= model world

suggests explorations

and decide on actions to be taken

CS-271: 31