Lecture 2 Agents & Environments (Chap. 2) Based on slides by - PowerPoint PPT Presentation

Lecture 2 Agents & Environments (Chap. 2) Based on slides by UW CSE AI faculty, Dan Klein, Stuart Russell, Andrew Moore

Outline • Agents and environments • Rationality • PEAS specification • Environment types • Agent types • Pac-Man projects 2

Agents • An agent is any entity that can perceive its environment through sensors and act upon that environment through actuators • Human agent: Sensors: Eyes, ears, and other organs Actuators: Hands, legs, mouth, etc. • Robotic agent: Sensors: Cameras, laser range finders, etc. Actuators: Motorized limbs, wheels, etc. 3

Other Types of Agents • Immobots (Immobile Robots) Intelligent buildings Intelligent forests • Softbots Askjeeves.com (now Ask.com) Expert Systems Microsoft Clippy 4

Intelligent Agents • Have sensors and actuators (effectors) • Implement mapping from percept sequence to actions • Maximize a Performance Measure percepts Environment Agent actions 5

Performance Measures • Performance measure = An objective criterion for success of an agent's behavior • E.g., vacuum cleaner agent performance measure: amount of dirt cleaned up, amount of time taken, amount of electricity consumed, amount of noise generated, etc. 6

Rational Agent “ For each possible percept sequence, does whatever action maximizes expected performance on the basis of evidence perceived so far and built-in prior knowledge .'' 7

Autonomy A rational agent is autonomous if it can learn to compensate for partial or incorrect prior knowledge Why is this important? 8

Task Environments • The “task environment” for an agent is comprised of PEAS (Performance measure, Environment, Actuators, Sensors) • E.g., Consider the task of designing an automated taxi driver: Performance measure = ? Environment = ? Actuators = ? Sensors = ? 9

PEAS • PEAS for Automated taxi driver • Performance measure: Safe, fast, legal, comfortable trip, maximize profits • Environment: Roads, other traffic, pedestrians, customers • Actuators: Steering wheel, accelerator, brake, signal, horn • Sensors: Cameras, sonar, speedometer, GPS, odometer, engine sensors, touchpad or keyboard 10

PEAS • PEAS for Medical diagnosis system • Performance measure: Healthy patient, minimize costs, lawsuits • Environment: Patient, hospital, staff • Actuators: Screen display (questions, tests, diagnoses, treatments, referrals) • Sensors: Keyboard (entry of symptoms, findings, patient's answers) 11

Properties of Environments • Observability: full vs. partial Sensors detect all aspects of state of environment relevant to choice of action? • Deterministic vs. stochastic Next state completely determined by current state and action? • Episodic vs. sequential Current action independent of previous actions? • Static vs. dynamic Can environment change over time? • Discrete vs. continuous State of environment, time, percepts, and actions discrete or continuous-valued? • Single vs . multiagent 12

Fully observable vs. Partially observable Can the agent observe the complete state of the environment? vs. 13

Single agent vs. Multiagent Is the agent the only thing acting in the world? vs. 14

Deterministic vs. Stochastic Is there uncertainty in how the world works? vs. 15

Episodic vs. Sequential Does the agent take more than one action? vs. 16

Discrete vs. Continuous Are the states, actions etc. discrete or continuous? vs. 17

Agent Functions and Agent Programs • An agent’s behavior can be described by an agent function mapping percept sequences to actions taken by the agent • An implementation of an agent function running on the agent architecture (e.g., a robot) is called an agent program • Our goal: Develop concise agent programs for implementing rational agents 18

Implementing Rational Agents • Table lookup based on percept sequences Infeasible • Agent programs: Simple reflex agents Agents with memory • Reflex agent with internal state • Goal-based agents • Utility-based agents 19

Simple Reflex Agents AGENT Sensors Percept ENVIRONMENT what action Condition-Action rules should I do now? Effectors 20

Simple Reflex Agents 21

Famous Reflex Agents 22

Reflex Agent with Internal State Sensors state Estimate of How world evolves world state ENVIRONMENT What my actions do what action Condition-Action rules should I do now? AGENT Effectors 23

Goal-Based Agents Sensors state Estimate of world state How world evolves ENVIRONMENT what it’ll be like What my actions do if I do action A what action Goals should I do now? AGENT Effectors 24

Utility-Based Agents Sensors state Estimate of world state How world evolves ENVIRONMENT what it’ll be like if I do action A What my actions do How happy would I be in such a state? Utility function what action should I do now? AGENT Effectors 25

While driving, what’s the best policy? • Always stop at a stop sign • Never stop at a stop sign • Look around for other cars and stop only if you see one approaching • Look around for a cop and stop only if you see one • What kind of agent are you? – reflex, goal-based, utility-based? 26

To Do • Project 0: Python tutorial • Finish chapters 1 and 2; start chapter 3 27

Lecture 2 Agents & Environments (Chap. 2) Based on slides by - PowerPoint PPT Presentation

Lecture 2 Agents & Environments (Chap. 2) Based on slides by UW CSE AI faculty, Dan Klein, Stuart Russell, Andrew Moore Outline Agents and environments Rationality PEAS specification Environment types Agent types

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

CEE 680 Lecture #2 1/22/2020 1 CEE 680 Lecture #2 1/22/2020 2 CEE 680 Lecture #2

Pocket Lecture Pocket Lecture Pocket Lecture Pocket Lecture Listen Audio Notes Progress

Multiphase Modelling in Cancer Helen Byrne Wolfson Centre for Mathematical Biology Mathematical

Previous Lecture Todays Lecture Slides for Lecture 5 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 30 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 28 Completion of divide-by-3 counter

Previous Lecture Todays Lecture Slides for Lecture 12 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 3 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 2 ENEL 353: Digital Circuits Fall 2013

Previous Lecture Todays Lecture Slides for Lecture 35 ENEL 353: Digital Circuits Fall

Lecture Capture Introduction to Lecture Capture Learning Outcomes What will lecture capture

Previous Lecture Todays Lecture Slides for Lecture 32 Completion of a timing analysis

Repetition Automatic Control, Basic Course, Lecture 11 Fredrik Bagge Carlson December 17, 2016

Previous Lecture Todays Lecture Slides for Lecture 26 ENEL 353: Digital Circuits Fall

Previous Lecture Todays Lecture Slides for Lecture 33 ENEL 353: Digital Circuits Fall

Hello Alexa, Im Drupal Arash Farazdaghi Builder Track \

ZONE TO WIN ORGANIZING TO COMPETE IN AN AGE OF DISRUPTION G o t o c o n A c c e l e r a

Disintermedia+on 2.0 Librarians and Systems Rory Litwin FIP February 5, 2010 University of

CS490W Web Search (I ) Luo Si Department of Computer Science Purdue University Slides from

Internet Engineering: Search Ali Kamandi Sharif University of Technology kamandi@ce.sharif.edu

How Engineering Management is like Jenna Zeigen - @ zeigenvector - October 20, 2016 - BrooklynJS

Outline Introduction to information retrieval Logical view of documents L i l i f d

Information Retrieval Lecture 10 Recap Last lecture HITS algorithm using anchor text