CS 188: Artificial Intelligence Lectures 2 and 3: Search Pieter - PDF document

CS 188: Artificial Intelligence Lectures 2 and 3: Search Pieter Abbeel – UC Berkeley Many slides from Dan Klein Reminder § Only a very small fraction of AI is about making computers play games intelligently § Recall: computer vision, natural language, robotics, machine learning, computational biology, etc. § That being said: games tend to provide relatively simple example settings which are great to illustrate concepts and learn about algorithms which underlie many areas of AI 1

Reflex Agent § Choose action based on current percept (and maybe memory) § May have memory or a model of the world ’ s current state § Do not consider the future consequences of their actions § Act on how the world IS § Can a reflex agent be rational? A reflex agent for pacman 4 actions: move North, East, South or West Reflex agent § While(food left) § Sort the possible directions to move according to the amount of food in each direction § Go in the direction with the largest amount of food 2

A reflex agent for pacman (2) Reflex agent § While(food left) § Sort the possible directions to move according to the amount of food in each direction § Go in the direction with the largest amount of food A reflex agent for pacman (3) Reflex agent § While(food left) § Sort the possible directions to move according to the amount of food in each direction § Go in the direction with the largest amount of food § But, if other options are available, exclude the direction we just came from 3

A reflex agent for pacman (4) Reflex agent § While(food left) § If can keep going in the current direction, do so § Otherwise: § Sort directions according to the amount of food § Go in the direction with the largest amount of food § But, if other options are available, exclude the direction we just came from A reflex agent for pacman (5) Reflex agent § While(food left) § If can keep going in the current direction, do so § Otherwise: § Sort directions according to the amount of food § Go in the direction with the largest amount of food § But, if other options are available, exclude the direction we just came from 4

Reflex Agent Goal-based Agents § Choose action based on § Plan ahead current percept (and § Ask “ what if ” maybe memory) § Decisions based on § May have memory or a (hypothesized) model of the world ’ s consequences of current state actions § Do not consider the future § Must have a model of consequences of their how the world evolves actions in response to actions § Act on how the world IS § Act on how the world WOULD BE § Can a reflex agent be rational? Search Problems § A search problem consists of: § A state space “ N ” , § A successor function 1.0 “ E ” , 1.0 § A start state and a goal test § A solution is a sequence of actions (a plan) which transforms the start state to a goal state 5

Example: Romania § State space: § Cities § Successor function: § Go to adj city with cost = dist § Start state: § Arad § Goal test: § Is state == Bucharest? § Solution? What ’ s in a State Space? The world state specifies every last detail of the environment A search state keeps only the details needed (abstraction) § Problem: Pathing § Problem: Eat-All-Dots § States: (x,y) location § States: {(x,y), dot booleans} § Actions: NSEW § Actions: NSEW § Successor: update location § Successor: update location only and possibly a dot boolean § Goal test: is (x,y)=END § Goal test: dots all false 6

State Space Graphs § State space graph: A mathematical G a representation of a c b search problem e § For every search problem, d f there ’ s a corresponding S h state space graph § The successor function is p r q represented by arcs Ridiculously tiny state space § We can rarely build this graph for a tiny search problem graph in memory (so we don ’ t) State Space Sizes? § Search Problem: Eat all of the food § Pacman positions: 10 x 12 = 120 § Food count: 30 § Ghost positions: 12 § Pacman facing: up, down, left, right 7

Search Trees “ N ” , 1.0 “ E ” , 1.0 § A search tree: § This is a “ what if ” tree of plans and outcomes § Start state at the root node § Children correspond to successors § Nodes contain states, correspond to PLANS to those states § For most problems, we can never actually build the whole tree Another Search Tree § Search: § Expand out possible plans § Maintain a fringe of unexpanded plans § Try to expand as few tree nodes as possible 8

General Tree Search § Important ideas: § Fringe Detailed pseudocode § Expansion is in the book! § Exploration strategy § Main question: which fringe nodes to explore? Example: Tree Search G a c b e d f S h p r q 9

State Graphs vs. Search Trees G a Each NODE in in the c search tree is an b entire PATH in the e d f problem graph. S h p r q S e p d q b c e h r We construct both on demand – and a a h r p q f we construct as little as possible. q c p q f G a q c G a Review: Depth First (Tree) Search G a a Strategy: expand c c b b deepest node first e e d d f f Implementation: S h h Fringe is a LIFO p p r r stack q q S e p d q b c e h r a a h r p q f q c p q f G a q c G a 10

Review: Breadth First (Tree) Search G a Strategy: expand c b shallowest node first e d Implementation: f S h Fringe is a FIFO queue p r q S e p d Search q b c e h r Tiers a a h r p q f q c p q f G a q c G a Search Algorithm Properties § Complete? Guaranteed to find a solution if one exists? § Optimal? Guaranteed to find the least cost path? § Time complexity? § Space complexity? Variables: n Number of states in the problem b The average branching factor B (the average number of successors) C* Cost of least cost solution s Depth of the shallowest solution m Max depth of the search tree 11

DFS Algorithm Complete Optimal Time Space Depth First DFS N N O(B LMAX ) O(LMAX) N N Infinite Infinite Search b START a GOAL § Infinite paths make DFS incomplete … § How can we fix this? DFS § With cycle checking, DFS is complete.* 1 node b b nodes … b 2 nodes m tiers b m nodes Algorithm Complete Optimal Time Space DFS w/ Path Y N O( b m ) O( bm ) Checking § When is DFS optimal? * Or graph search – next lecture. 12

BFS Algorithm Complete Optimal Time Space DFS w/ Path Y N O( b m ) O( bm ) Checking BFS Y N* O( b s+1 ) O( b s+1 ) 1 node b b nodes … s tiers b 2 nodes b s nodes b m nodes § When is BFS optimal? Comparisons § When will BFS outperform DFS? § When will DFS outperform BFS? 13

Iterative Deepening Iterative deepening uses DFS as a subroutine: b … 1. Do a DFS which only searches for paths of length 1 or less. 2. If “ 1 ” failed, do a DFS which only searches paths of length 2 or less. 3. If “ 2 ” failed, do a DFS which only searches paths of length 3 or less. … .and so on. Algorithm Complete Optimal Time Space DFS w/ Path Y N O( b m ) O( bm ) Checking BFS Y N* O( b s+1 ) O( b s+1 ) ID Y N* O( b s+1 ) O( bs ) Costs on Actions GOAL a 2 2 c b 3 2 1 8 2 e d 3 f 9 8 2 START h 4 1 1 4 p r 15 q Notice that BFS finds the shortest path in terms of number of transitions. It does not find the least-cost path. We will quickly cover an algorithm which does find the least-cost path. 14

Uniform Cost (Tree) Search 2 G a c b 8 1 Expand cheapest node first: 2 2 e 3 d Fringe is a priority queue f 8 1 9 S h 1 1 p r q 15 0 S 1 e 9 p 3 d 16 5 17 11 q 4 e h r b c 11 Cost 7 6 h 13 r p q f a a contours q c p q f 8 G a q c 11 10 G a Priority Queue Refresher § A priority queue is a data structure in which you can insert and retrieve (key, value) pairs with the following operations: pq.push(key, value) inserts (key, value) into the queue. returns the key with the lowest value, and pq.pop() removes it from the queue. § You can decrease a key ’ s priority by pushing it again § Unlike a regular queue, insertions aren ’ t constant time, usually O(log n ) § We ’ ll need priority queues for cost-sensitive search methods 15

Uniform Cost (Tree) Search Algorithm Complete Optimal Time (in nodes) Space DFS w/ Path Y N O( b m ) O( bm ) Checking BFS Y N O( b s+1 ) O( b s+1 ) UCS Y* Y O( b C*/ ε ) O( b C*/ ε ) b … * UCS can fail if C*/ ε tiers actions can get arbitrarily cheap Uniform Cost Issues § Remember: explores c ≤ 1 … increasing cost contours c ≤ 2 c ≤ 3 § The good: UCS is complete and optimal! § The bad: § Explores options in every “ direction ” § No information about goal location Start Goal 16

Uniform Cost Search Example Search Heuristics § Any estimate of how close a state is to a goal § Designed for a particular search problem § Examples: Manhattan distance, Euclidean distance 10 5 11.2 17

Example: Heuristic Function h(x) Best First / Greedy Search § Expand the node that seems closest … § What can go wrong? 18

CS 188: Artificial Intelligence Lectures 2 and 3: Search Pieter - PDF document

CS 188: Artificial Intelligence Lectures 2 and 3: Search Pieter Abbeel UC Berkeley Many slides from Dan Klein Reminder Only a very small fraction of AI is about making computers play games intelligently Recall: computer vision,

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial intelligence Artificial Intelligence is the science of PHILOSOPHY OF ARTIFICIAL

Artificial Intelligence Intro (Chapter 1 of AIMA) Summary Artificial Intelligence What is AI?

What is Artificial Intelligence? CPSC 322 Lecture 1 September 5, 2007 What is Artificial

Traditional Definition of Artificial Intelligence Trends Artificial Intelligence (AI) is

Standard 188-2015 Presentation - TE Watson ANSI/ASHRAE Standard 188-2015 Legionellosis: Risk

CS 188: Artificial Intelligence Introduction Instructors: Anca Dragan, Sergey Levine University

Lecture 29: Artificial Intelligence Marvin Zhang 08/10/2016 Some slides are adapted from CS 188

Artificial Intelligence as Law Bart Verheij Department of Artificial Intelligence, Bernoulli

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

Lecture Overview What is Artificial Intelligence? Agents acting in an environment

CSCI 446: Artificial Intelligence CSCI 446: Artificial Intelligence Course Website:

1.1 What is AI? 1. What is Artificial Intelligence? 2. AI Past and Present 3. Rational

8th November 2019 Artificial Intelligence Finance Institute NYU Courant Artificial Intelligence

Stacks and Queues Dr. Mattox Beckman University of Illinois at Urbana-Champaign Department of

Understanding Task Scheduling Algorithms Kenjiro Taura 1 / 51 Contents 1 Introduction 2 Work

Functional Data Structures [C. Okasaki, Simple and efficient purely functional queues and deques ,

Programming Language Concepts: Lecture 4 Madhavan Mukund Chennai Mathematical Institute

Stack / Queue ADT Stack ADT Implementations Array resizing Queue ADT January 27, 2020 Cinda

Week 6 Oliver Kullmann Dynamic sets Data structures Simple implementa- tion Dynamic sets

Implementing a Binary_Tree Class Section 8.3 The BTNode Class Like a linked list, a node

Object-Oriented Programming for Scientific Computing Templates and Static Polymorphism Ole Klein