CS 730/830: Intro AI 1 handout: slides Are We Done? Beyond A* - - PowerPoint PPT Presentation

cs 730 830 intro ai
SMART_READER_LITE
LIVE PREVIEW

CS 730/830: Intro AI 1 handout: slides Are We Done? Beyond A* - - PowerPoint PPT Presentation

CS 730/830: Intro AI 1 handout: slides Are We Done? Beyond A* Suboptimal Search Anytime Search Real-time Search EOLQs Wheeler Ruml (UNH) Lecture 4, CS 730 1 / 24 EOLQs Are We Done? Beyond A* Suboptimal Search Anytime Search


slide-1
SLIDE 1

CS 730/830: Intro AI

■ Are We Done? Beyond A* Suboptimal Search Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 1 / 24

1 handout: slides

slide-2
SLIDE 2

EOLQs

■ Are We Done? Beyond A* Suboptimal Search Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 2 / 24

slide-3
SLIDE 3

Are We Done?

■ Are We Done? Beyond A* Suboptimal Search Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 3 / 24

slide-4
SLIDE 4

Beyond A*

■ Are We Done? Beyond A* ■ GBFS ■ 8-puzzle ■ Evaluating Greedy ■ Beam Search Suboptimal Search Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 4 / 24

slide-5
SLIDE 5

Greedy Best-first Search (BGFS)

■ Are We Done? Beyond A* ■ GBFS ■ 8-puzzle ■ Evaluating Greedy ■ Beam Search Suboptimal Search Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 5 / 24

Q ← an ordered list containing just the initial state. Loop If Q is empty, then return failure. Node ← Pop(Q). If Node is a goal, then return Node (or path to it) else Children ← Expand (Node). Merge Children into Q, keeping sorted by heuristic.←

slide-6
SLIDE 6

GBFS on the 8-puzzle

■ Are We Done? Beyond A* ■ GBFS ■ 8-puzzle ■ Evaluating Greedy ■ Beam Search Suboptimal Search Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 6 / 24

h(n) = number of tiles out of place. (The blank is not a tile.) Start state: 2 8 3 1 6 4 7 ⊔ 5 Goal state: 1 2 3 8 ⊔ 4 7 6 5 Please draw the tree resulting from the first two node expansions.

slide-7
SLIDE 7

Evaluating Greedy

■ Are We Done? Beyond A* ■ GBFS ■ 8-puzzle ■ Evaluating Greedy ■ Beam Search Suboptimal Search Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 7 / 24

Assume branching factor b and solution at depth d. Completeness: Time: Space: Admissibility:

slide-8
SLIDE 8

Beam Search

■ Are We Done? Beyond A* ■ GBFS ■ 8-puzzle ■ Evaluating Greedy ■ Beam Search Suboptimal Search Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 8 / 24

Truncate queue to hold the most promising k nodes. k is the beam width.

slide-9
SLIDE 9

Suboptimal Search

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 9 / 24

slide-10
SLIDE 10

Problem Settings

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 10 / 24

  • ptimal:

minimize solution cost suffer all with f(n) = g(n) + h(n) < f ∗ greedy: minimize solving time bounded suboptimal: minimize time subject to relative cost bound (factor of optimal) bounded cost: minimize time subject to absolute cost bound contract: minimize cost subject to absolute time bound anytime: iteratively converge to optimal utility: maximize given function of cost and time

slide-11
SLIDE 11

Weighted A*

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 11 / 24

f ′(n) = g(n) + w · h(n)

nodes with high h(n) look even worse

no infinite rabbit holes

suboptimality bounded: within a factor of w of optimal!

slide-12
SLIDE 12

wA* Behavior

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 12 / 24

  • ptimal: uniform-cost search
slide-13
SLIDE 13

wA* Behavior

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 12 / 24

  • ptimal: A*
slide-14
SLIDE 14

wA* Behavior

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 12 / 24

bounded suboptimal: Weighted A*

slide-15
SLIDE 15

For Speed: Distance-to-go, Not Cost-to-go

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 13 / 24

how to minimize solving time?

slide-16
SLIDE 16

For Speed: Distance-to-go, Not Cost-to-go

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 13 / 24

how to minimize solving time? how to minimize number of expansions?

slide-17
SLIDE 17

For Speed: Distance-to-go, Not Cost-to-go

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 13 / 24

how to minimize solving time? how to minimize number of expansions? take the shortest path to a goal

slide-18
SLIDE 18

For Speed: Distance-to-go, Not Cost-to-go

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 13 / 24

how to minimize solving time? how to minimize number of expansions? take the shortest path to a goal for domains with costs, this is not h(n) new information source: distance-to-go= d(n) n h = 4 d = 2 h = 5 d = 1

slide-19
SLIDE 19

For Speed: Distance-to-go, Not Cost-to-go

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 13 / 24

how to minimize solving time? how to minimize number of expansions? take the shortest path to a goal for domains with costs, this is not h(n) new information source: distance-to-go= d(n) n h = 4 d = 2 h = 5 d = 1 Speedy: best-first search on d

slide-20
SLIDE 20

Explicit Estimation Search

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 14 / 24

bounded-suboptimal using h, d, and h

  • ptimal: uniform-cost
slide-21
SLIDE 21

Explicit Estimation Search

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 14 / 24

bounded-suboptimal using h, d, and h

  • ptimal: A*
slide-22
SLIDE 22

Explicit Estimation Search

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 14 / 24

bounded-suboptimal using h, d, and h bounded suboptimal: Weighted A*

slide-23
SLIDE 23

Explicit Estimation Search

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 14 / 24

bounded-suboptimal using h, d, and h bounded suboptimal: Optimistic Search (ICAPS, 2008)

slide-24
SLIDE 24

Explicit Estimation Search

■ Are We Done? Beyond A* Suboptimal Search ■ Problem Settings ■ wA* ■ wA* Behavior ■ Distance-to-go ■ EES Anytime Search Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 14 / 24

bounded-suboptimal using h, d, and h bounded suboptimal: Explicit Estimation Search (IJCAI, 2011)

slide-25
SLIDE 25

Anytime Search

■ Are We Done? Beyond A* Suboptimal Search Anytime Search ■ Anytime A* ■ Break Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 15 / 24

slide-26
SLIDE 26

Anytime A*

■ Are We Done? Beyond A* Suboptimal Search Anytime Search ■ Anytime A* ■ Break Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 16 / 24

  • 1. run weighted A*
  • 2. keep going after finding a goal
  • 3. keep best goal found (can test at generation)
  • 4. prune anything with f(n) > incumbent

Anytime Restarting A* (ARA*): lower weight after finding each solution Anytime EES

slide-27
SLIDE 27

Break

■ Are We Done? Beyond A* Suboptimal Search Anytime Search ■ Anytime A* ■ Break Real-time Search EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 17 / 24

asst2

scores and grades

AAAI

slide-28
SLIDE 28

Real-time Search

■ Are We Done? Beyond A* Suboptimal Search Anytime Search Real-time Search ■ RTA* ■ LSS-LRTA* ■ Search Algorithms ■ Other Algorithms EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 18 / 24

slide-29
SLIDE 29

RTA*

■ Are We Done? Beyond A* Suboptimal Search Anytime Search Real-time Search ■ RTA* ■ LSS-LRTA* ■ Search Algorithms ■ Other Algorithms EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 19 / 24

keep hash table of h values for visited states

  • 1. for each neighbor of current state s
  • 2. either find h in table or do some lookahead
  • 3. add edge cost to get f
  • 4. update h(s) to second-best f value
  • 5. move to best neighbor
slide-30
SLIDE 30

LSS-LRTA*

■ Are We Done? Beyond A* Suboptimal Search Anytime Search Real-time Search ■ RTA* ■ LSS-LRTA* ■ Search Algorithms ■ Other Algorithms EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 20 / 24

  • 1. single A* lookahead (LSS)
  • 2. update all h values in LSS
  • 3. move to frontier
slide-31
SLIDE 31

Search Algorithms

■ Are We Done? Beyond A* Suboptimal Search Anytime Search Real-time Search ■ RTA* ■ LSS-LRTA* ■ Search Algorithms ■ Other Algorithms EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 21 / 24

Uninformed: DFS, UCS Admissible: A* Limited memory: iterative deepening (IDDFS, IDA*) Satisficing: GBFS, Speedy, Beam Bounded suboptimal: wA*, EES Real-time: RTA*, LSS-LRTA*

slide-32
SLIDE 32

Other Shortest-path Algorithms

■ Are We Done? Beyond A* Suboptimal Search Anytime Search Real-time Search ■ RTA* ■ LSS-LRTA* ■ Search Algorithms ■ Other Algorithms EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 22 / 24

SMA*, IE

RBFS

Bugsy

RTAA* Course projects!

slide-33
SLIDE 33

EOLQs

■ Are We Done? Beyond A* Suboptimal Search Anytime Search Real-time Search EOLQs ■ EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 23 / 24

slide-34
SLIDE 34

EOLQs

■ Are We Done? Beyond A* Suboptimal Search Anytime Search Real-time Search EOLQs ■ EOLQs

Wheeler Ruml (UNH) Lecture 4, CS 730 – 24 / 24

Please write down the most pressing question you have about the course material covered so far and put it in the box on your way out. Thanks!