A Planning Graph Heuristic for Forward-Chaining Adversarial Planning - - PowerPoint PPT Presentation

a planning graph heuristic for forward chaining
SMART_READER_LITE
LIVE PREVIEW

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning - - PowerPoint PPT Presentation

Introduction Search Heuristic Results A Planning Graph Heuristic for Forward-Chaining Adversarial Planning Pascal Bercher and Robert Mattmller Institute for Computer Science University of Freiburg ECAI 2008, Patras, Greece Wednesday,


slide-1
SLIDE 1

Introduction Search Heuristic Results

Adversarial Planning A Planning Graph Heuristic for Forward-Chaining

Wednesday, July 23, 2008

ECAI 2008, Patras, Greece University of Freiburg Institute for Computer Science

Pascal Bercher and Robert Mattmüller

University of Freiburg al Planning A Planning Graph Heuristic for Forward-Chaining Adversari

slide-2
SLIDE 2

Introduction Search Heuristic Results

Introduction

Motivation

◮ Given: Adversarial planning problem (extensive two-player game) ◮ Desired: Strong plan (winning strategy)

Technically

◮ Two players taking turns ◮ STRIPS-style state and action encoding ◮ Full observability ◮ Reduces to evaluation of AND/OR graph over physical states

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-3
SLIDE 3

Introduction Search Heuristic Results

Example

Problem

◮ Logistics-like problem ◮ Pilot and co-pilot have

different capabilities (loading, unloading, flying, re-fuelling, no-ops)

◮ Co-pilot wants to sabotage

transport task

AND/OR Graph and Solution

. . . . . . . . . . . . . . . . . .

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-4
SLIDE 4

Introduction Search Heuristic Results

Search

◮ Alternatives:

◮ Symbolic regression search (cf. MBP) ◮ Heuristically guided explicit-state progression search

◮ Here: Variant of AO* algorithm

◮ Search over AND/OR graph ◮ Elimination of duplicate nodes ◮ Approximative updates of cost estimates

◮ How to initialize cost estimates at leaf nodes?

Variant of FF heuristic.

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-5
SLIDE 5

Introduction Search Heuristic Results

Heuristic: Example

◮ Variables: v1, . . . , v8 ◮ Rules in relaxed problem:

ri = v1 → vi+1, i = 1, 2, 3, 4, 5 rj = vj → vj+1, j = 6, 7

◮ Rules controlled by protagonist: {r1, r2, r3, r4, r5, r7} ◮ Rules controlled by antagonist: {r1, r2, r6} ◮ Current state: {v1} ◮ Goal: {v1, . . . , v8}

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-6
SLIDE 6

Introduction Search Heuristic Results

Heuristic: Relaxed Planning Graph

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-7
SLIDE 7

Introduction Search Heuristic Results

Heuristic: Relaxed Planning Graph

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-8
SLIDE 8

Introduction Search Heuristic Results

Heuristic: Relaxed Planning Graph

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-9
SLIDE 9

Introduction Search Heuristic Results

Heuristic: Relaxed Planning Graph

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-10
SLIDE 10

Introduction Search Heuristic Results

Heuristic: Relaxed Planning Graph

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-11
SLIDE 11

Introduction Search Heuristic Results

Heuristic: Relaxed Planning Graph

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-12
SLIDE 12

Introduction Search Heuristic Results

Heuristic: Relaxed Planning Graph

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-13
SLIDE 13

Introduction Search Heuristic Results

Heuristic: Relaxed Planning Graph

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-14
SLIDE 14

Introduction Search Heuristic Results

Heuristic: Relaxed Planning Graph

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-15
SLIDE 15

Introduction Search Heuristic Results

Heuristic: Relaxed Planning Graph

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-16
SLIDE 16

Introduction Search Heuristic Results

Heuristic: Relaxed Plan Extraction

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-17
SLIDE 17

Introduction Search Heuristic Results

Heuristic: Relaxed Plan Extraction

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-18
SLIDE 18

Introduction Search Heuristic Results

Heuristic: Relaxed Plan Extraction

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-19
SLIDE 19

Introduction Search Heuristic Results

Heuristic: Relaxed Plan Extraction

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-20
SLIDE 20

Introduction Search Heuristic Results

Heuristic: Relaxed Plan Extraction

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-21
SLIDE 21

Introduction Search Heuristic Results

Heuristic: Relaxed Plan Extraction

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-22
SLIDE 22

Introduction Search Heuristic Results

Heuristic: Relaxed Plan Extraction

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-23
SLIDE 23

Introduction Search Heuristic Results

Heuristic: Relaxed Plan Extraction

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-24
SLIDE 24

Introduction Search Heuristic Results

Heuristic: Relaxed Plan Extraction

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-25
SLIDE 25

Introduction Search Heuristic Results

Heuristic: Relaxed Plan Extraction

F p Ap F a Aa F p

1

Ap

1

F a

1

Aa

1

F p

2

v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 v1 v2 v3 v4 v5 v6 v7 v8 r1 r2 r3 r4 r5 r7 r1 r2 r6 r1 r2 r3 r4 r5 r7 r1 r2 r6

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-26
SLIDE 26

Introduction Search Heuristic Results

Heuristic: Relaxed Plan Postprocessing

◮ Selected rules for protagonist: {r1, r2, r3, r4, r5, r7} ◮ Selected rules for antagonist: {r6} ◮ Redistribution of rules:

◮ Revised selected rules for protagonist: {r3, r4, r5, r7} ◮ Revised selected rules for antagonist: {r1, r2, r6}

◮ Return heuristic value 2 · |{r3, r4, r5, r7}| = 8.

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-27
SLIDE 27

Introduction Search Heuristic Results

Experiments and Results

◮ Logistics-like problems as in the example, varying problem sizes ◮ Comparison of breadth-first search, AO* search with FF heuristic

and adversarial FF heuristic, and MBP.

BFS AO* + h

FF

AO* + h

adv.-FF

MBP

ℓ p

time nodes time nodes time nodes time BDD 2 1 0.014 44 0.025 37 0.026 37 0.000 6601 2 2 0.048 152 0.071 88 0.072 78 0.016 84424 3 3 0.354 2106 0.202 625 0.260 628 0.380 23068 3 4 0.870 8211 0.463 1871 0.232 605 1.780 165718 3 5 5.556 43785 1.437 6917 0.321 794 9.041 365272 3 6 87.691 237264 16.323 63498 1.157 4164 44.287 546666 4 6 — 722750 76.718 169349 82.701 194304 130.064 834704 4 7 — 771629 373.553 510738 99.639 225544 — —

ℓ : #locations, p : #packages, BDD: #BDD nodes, red: worst, blue: best

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg

slide-28
SLIDE 28

Introduction Search Heuristic Results

Conclusion

◮ Domain-independent heuristics promising approach to

conditional/adevrsarial planning

◮ Explicit-state progression competitive with symbolic regression ◮ Potential application in General Game Playing ◮ Future work: Assesment of other domain-independent heuristics

in conditional setting

A Planning Graph Heuristic for Forward-Chaining Adversarial Planning University of Freiburg