[PPT] - Search Algorithms 3 AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 PowerPoint Presentation

SLIDE 1

Search Algorithms

3

AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 1

SLIDE 2

3 Search Algorithms 3.1 Problem-solving agents 3.2 Basic search algorithms 3.3 Heuristic search

Greedy search • A∗ search

3.4 Local search

Hill-climbing • Simulated annealing∗ • Genetic algorithms∗

3.5 Online search∗ 3.6 Adversarial search

minimax decisions • α–β pruning • Monte Carlo tree search∗

3.7 Metaheuristic∗

AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 2

SLIDE 3

Problem-solving agents

Problem-solving agents: finding sequences of actions that lead to desirable states (goal-based) state: some description of the current world states – abstracted for problem solving as state space Goal: a set of world states Action: transition between world states Search: the algorithm takes a problem as input and returns a solution in the form of an action sequence

AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 3

SLIDE 4

Problem-solving agents

function Simple-Problem-Solving-Agent( p) returns an action s, an action sequence, initially empty state, some description of the current world state g, a goal, initially null problem, a problem formulation state ← Update-State(state, p) if s is empty then g ← Formulate-Goal(state) problem ← Formulate-Problem(state, g) s ← Search( problem) if s=failure then return a null action action ← First(s, state) s ← Rest(s, state) return action

Note: offline vs. online problem solving

AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 4

SLIDE 5

Example: Romania

On holiday in Romania; currently in Arad Flight leaves tomorrow from Bucharest Formulate goal: be in Bucharest Formulate problem: states: various cities actions: drive between cities Find solution: sequence of cities, e.g., Arad, Sibiu, Fagaras, Bucharest

AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 5

SLIDE 6

Example: Romania

Giurgiu Urziceni Hirsova Eforie Neamt Oradea Zerind Arad Timisoara Lugoj Mehadia Dobreta Craiova Sibiu Fagaras Pitesti Vaslui Iasi Rimnicu Vilcea Bucharest 71 75 118 111 70 75 120 151 140 99 80 97 101 211 138 146 85 90 98 142 92 87 86

AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 6

SLIDE 7

Problem types

Deterministic, fully observable = ⇒ single-state problem Agent knows exactly which state it will be in; solution is a sequence Non-observable = ⇒ conformant problem Agent may have no idea where it is; solution (if any) is a sequence Nondeterministic and/or partially observable = ⇒ contingency prob- lem percepts provide new information about current state solution is a contingent plan or a policy

ften interleave search, execution

Unknown state space = ⇒ exploration problem (“online”)

AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 7

SLIDE 8

Example: vacuum world

Single-state, start in #5. Solution??

1 2 3 4 5 6 7 8

AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 8

SLIDE 9

Example: vacuum world

Single-state, start in #5. Solution?? [Right, Suck] Conformant, start in {1, 2, 3, 4, 5, 6, 7, 8} e.g., Right goes to {2, 4, 6, 8}. Solution??

1 2 3 4 5 6 7 8

AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 9

SLIDE 10

Example: vacuum world

Single-state, start in #5. Solution?? [Right, Suck] Conformant, start in {1, 2, 3, 4, 5, 6, 7, 8} e.g., Right goes to {2, 4, 6, 8}. Solution?? [Right, Suck, Left, Suck] Contingency, start in #5 Murphy’s Law: Suck can dirty a clean carpet Local sensing: dirt, location only Solution??

1 2 3 4 5 6 7 8

AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 10

SLIDE 11

Example: vacuum world

Single-state, start in #5. Solution?? [Right, Suck] Conformant, start in {1, 2, 3, 4, 5, 6, 7, 8} e.g., Right goes to {2, 4, 6, 8}. Solution?? [Right, Suck, Left, Suck] Contingency, start in #5 Murphy’s Law: Suck can dirty a clean carpet Local sensing: dirt, location only. Solution?? [Right, if dirt then Suck]

1 2 3 4 5 6 7 8

AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 11

SLIDE 12

Problem formulation

A problem is defined formally by five coomponents: initial state that the agent starts – any state s ∈ S (set of states), the initial state S0 ∈ S e.g., In(Arad) (“at Arad”) actions: Given a state s, Action(s) returns the set of actions that can be executed in s e.g., from the state In(Arad), the applicable actions are {Go(Sibiu), Go(Timisoara), Go(Zerind)} transition model: a function Result(s, a) (or Do(a, s)) that re- turns the state that results from doing action a in the state s; – also use the term successor to refer to any state reachable from a given state by a single action e.g., Result(In(Arad), Go(Zerind)) = In(Zerind)

AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 12

SLIDE 13

Problem formulation