Random Walk Planning: Theory, Practice, and Application Hootan - - PowerPoint PPT Presentation

random walk planning theory practice and application
SMART_READER_LITE
LIVE PREVIEW

Random Walk Planning: Theory, Practice, and Application Hootan - - PowerPoint PPT Presentation

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions Random Walk Planning: Theory, Practice, and Application Hootan Nakhost University of Alberta, Canada Google Canada since May 2013 May 9, 2012


slide-1
SLIDE 1

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Random Walk Planning: Theory, Practice, and Application

Hootan Nakhost

University of Alberta, Canada Google Canada since May 2013

May 9, 2012

slide-2
SLIDE 2

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Outline

RW Planning

Empirical study of the design space Design Why does it work? Application Inefficient plans RW Theory Resource-constrained Planning Postprocessing

slide-3
SLIDE 3

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

1

Automated Planning

2

RW Theory

3

RW Search

4

Application

5

Plan Improvement

6

Systems

7

Conclusions

slide-4
SLIDE 4

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Automated Planning Given a model of the world, generate a plan to achieve predefined goals Applications Autonomous agents General solvers

slide-5
SLIDE 5

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Classical Representations (STRIPS) State Each state is a set of propositions

A B

{On(B, A), Ontable(A), Clear(B)}

Action Each action has preconditions, positive and negative effects

A B

{OnTable(A), Holding(B)}

Plan A sequence of actions that starts from the initial state and ends in s ⊇ G

slide-6
SLIDE 6

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Planning Methods Heuristic Search Common standard systematic search algorithms such as Greedy Best First Search (GBFS) and WA* Contribution A new search paradigm for satisficing planning: random walk (RW) search

slide-7
SLIDE 7

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

1

Automated Planning

2

RW Theory

3

RW Search

4

Application

5

Plan Improvement

6

Systems

7

Conclusions

slide-8
SLIDE 8

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Why Random Walks? Random Walk A sequence of randomly selected actions High level and Intuitive Explanations Escaping faster from plateaus More exploration Not wasting time in dead-ends A theoretical model can explain ... What are the key features affecting the performance How we can improve the algorithms

slide-9
SLIDE 9

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

A Motivating Example: Transportation Domain

slide-10
SLIDE 10

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Random Walks vs. Systematic Search

slide-11
SLIDE 11

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Theoretical Analysis of RW Planning Graph properties affecting RW performance Progress Chance(PC) Regress Chance(RC) Regress Factor(RF) PC = 1 4, RC = 1 2, RF = RC PC = 2

slide-12
SLIDE 12

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Definitions: Fairness and Hitting Time Fairness A single state transition in the graph cannot change the goal distance by more than one unit. Every undirected graph is a fair graph. Hitting Time The expected number of steps in a random walk starting from the initial state and ending in the goal for the first time.

slide-13
SLIDE 13

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Fair Strongly Homogenous Graph (FSHG) p = progress chance q = regress chance D = largest goal distance

slide-14
SLIDE 14

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Theorem: Hitting time in FSHG hx =

  • Θ
  • β0λD + β1dx
  • if q = p

Θ (α1Ddx) if q = p where λ = q p, β0 = q (p − q)2 , β1 = 1 p − q , α0 = 1 2p, α1 = 1 p

slide-15
SLIDE 15

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Bounds for more general graphs qi = maximum regress chance at the goal distancei pi = minimum progress chance at the goal distancei

slide-16
SLIDE 16

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Analysis of the Transport Example RCmax = PCmin = 1 2 × |trucks| hx = Ddx p

slide-17
SLIDE 17

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Fair Homogenous Graph (FHG) pi = progress chance at goal distance i qi = regress chance at goal distance i D = largest goal distance

slide-18
SLIDE 18

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Hitting time in FHG hx =

dx

  • d=1

 βD

D−1

  • i=d

λi +

D−1

  • j=d

 βj

j−1

  • i=d

λi     where for all 1 ≤ d ≤ D, λd = qd pd , βd = 1 pd

slide-19
SLIDE 19

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Theory for Random Walks with Restart Restarting Random Walks At each step with probability r restart from the initial state Hitting Time hx ∈ O

  • βλdx−1

where λ = q p + r p(1 − r) + 1

  • , β = q + r

pr

slide-20
SLIDE 20

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Findings Determined the key features of the search space affecting RW

Regress factor RF Largest goal distance D Initial goal distance d

Provides valuable insights to design RW planners

Biasing action selection Restarting frequency r

slide-21
SLIDE 21

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

1

Automated Planning

2

RW Theory

3

RW Search

4

Application

5

Plan Improvement

6

Systems

7

Conclusions

slide-22
SLIDE 22

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

RW Search The General Framework Use forward chaining Local Search In each step, run random walks to find the next state Use restarts to recover from unpromising search regions

slide-23
SLIDE 23

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

RWS Framework: an Illustration

9

slide-24
SLIDE 24

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

RWS Framework: an Illustration

∞ 65 14 9 15 9 9 10 7 10 10 14 13 14 14 10

slide-25
SLIDE 25

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

RWS Framework: an Illustration

∞ 65 14 9 15 9 9 10 7 10 10 14 13 14 14 10

slide-26
SLIDE 26

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

RWS Framework: an Illustration

∞ 65 14 9 15 9 9 7 7 7 43 7 7 2 7 7 9 10 7 10 10 14 13 14 14 10

slide-27
SLIDE 27

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

RWS Framework: an Illustration

∞ 65 14 9 15 9 9 7 7 7 43 7 7 2 7 7 9 10 7 10 10 14 13 14 14 10

slide-28
SLIDE 28

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

A Basic RW planner Walk Length Use a local restarting rate rl: at each step terminate the walk with probability rl Restarting Use a restarting threshold tg: restart the search when the last tg walks have not reached lower heuristic

slide-29
SLIDE 29

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Experimental Study of the Design Space Local Exploration Length of Walks Evaluation Rate Action Selection Bias Global Exploration Jumping Strategies Restarting Strategies Heuristic function Type of the heuristic function The accuracy of the heuristic function

slide-30
SLIDE 30

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Two Practical Outcomes Learning systems that adapt parameters to the input problem Effective Biasing techniques

slide-31
SLIDE 31

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

The Effect of Restarting Threshold: Elevators 03

50 100 150 200 250 300 350 400 10000 20000 30000 40000 50000

  • Min. Heuristic Value
  • No. of Walks

Fast Restarting Slow Restarting

slide-32
SLIDE 32

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

The Effect of Restarting Threshold: Floortile 01

10 20 30 40 50 60 70 10000 20000 30000 40000 50000

  • Min. Heuristic Value
  • No. of Walks

Fast Restarting Slow Restarting

slide-33
SLIDE 33

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Adaptive Global Restarting (AGR) Let Vw be the average heuristic improvement per walk AGR continually estimates Vw and sets tg = h0

Vw

0% ¡ 10% ¡ 20% ¡ 30% ¡ 40% ¡ 50% ¡ 60% ¡ 70% ¡ 80% ¡ 90% ¡ 100% ¡ elevators ¡ floor6le ¡ nomystery ¡ parcprinter ¡ parking ¡ scanalyzer ¡ sokoban ¡ 6dybot ¡ visitall ¡ woodworking ¡ total ¡ Coverage ¡

rl=0.01 ¡

tg=100 ¡ tg=1000 ¡ tg=10000 ¡ AGR ¡

slide-34
SLIDE 34

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Comparison with GBFS

0% ¡ 10% ¡ 20% ¡ 30% ¡ 40% ¡ 50% ¡ 60% ¡ 70% ¡ 80% ¡ 90% ¡ 100% ¡ b a r m a n ¡ e l e v a t

  • r

s ¡ fl

  • r

9 l e ¡ n

  • m

y s t e r y ¡

  • p

e n s t a c k s ¡ p a r c p r i n t e r ¡ p a r k i n g ¡ p e g s

  • l

¡ s c a n a l y z e r ¡ s

  • k
  • b

a n ¡ 9 d y b

  • t

¡ t r a n s p

  • r

t ¡ v i s i t a l l ¡ w

  • d

w

  • r

k i n g ¡ t

  • t

a l ¡ Coverage ¡ GBFS ¡ RWS ¡

slide-35
SLIDE 35

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Comparison with EHC

0% ¡ 10% ¡ 20% ¡ 30% ¡ 40% ¡ 50% ¡ 60% ¡ 70% ¡ 80% ¡ 90% ¡ 100% ¡ barman ¡ elevators ¡ floor9le ¡ nomystery ¡

  • penstacks ¡

parcprinter ¡ parking ¡ pegsol ¡ scanalyzer ¡ sokoban ¡ 9dybot ¡ transport ¡ visitall ¡ woodworking ¡ total ¡ Coverage ¡ EHC ¡ RWS ¡

slide-36
SLIDE 36

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Biasing Action Selections Monte Carlo Helpful Actions (MHA) MHA gives a higher priority to preferred operators. P(a, s) = eQ(a)/T n

b∈A(s) eQ(b)/T

slide-37
SLIDE 37

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

MHA vs. Uniform Action Selection

0% ¡ 10% ¡ 20% ¡ 30% ¡ 40% ¡ 50% ¡ 60% ¡ 70% ¡ 80% ¡ 90% ¡ 100% ¡ barman ¡ elevators ¡ floor9le ¡ nomystery ¡

  • penstacks ¡

parcprinter ¡ parking ¡ pegsol ¡ scanalyzer ¡ sokoban ¡ 9dybot ¡ transport ¡ visitall ¡ woodworking ¡ total ¡ Coverage ¡ RWS ¡ RWS+PO(MHA) ¡

slide-38
SLIDE 38

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

MHA vs. GBFS+PO

0% ¡ 10% ¡ 20% ¡ 30% ¡ 40% ¡ 50% ¡ 60% ¡ 70% ¡ 80% ¡ 90% ¡ 100% ¡ barman ¡ elevators ¡ floor9le ¡ nomystery ¡

  • penstacks ¡

parcprinter ¡ parking ¡ pegsol ¡ scanalyzer ¡ sokoban ¡ 9dybot ¡ transport ¡ visitall ¡ woodworking ¡ total ¡ Coverage ¡ GBFS+PO ¡ RWS+PO(MHA) ¡

slide-39
SLIDE 39

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Building a Planning System Combine several techniques that complement each other Examples Multiple heuristics in LAMA and Fast Downward Multiple search strategies in Fast Forward and FD Stone Soup

slide-40
SLIDE 40

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Learning the Best Configuration

Config1 Config2 Config3 Learning Algorithm Planner Problem

slide-41
SLIDE 41

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Comparing Arvand-2013 with Top Satisficing Planners

Table: IPC problems without Derived Predicates

  • No. of Problems Arvand-2013 LAMA-2011 FDFSS2 Probe Roamer

1661 1552 1540 1533 1422 1507 Table: All IPC problems

  • No. of Problems Arvand-2013 LAMA-2011 FDFSS2 Probe Roamer

1857 1666 1659 1668 – 1635

slide-42
SLIDE 42

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

The Gap Between RW and Systematic Planning

Domains Arvand-2013 LAMA-2011

Airport (50)

44 31

Notankage (50)

50 44

Sokoban (20)

1 19

Storage (30)

30 19

Tankage (50)

44 41

Woodworking (30)

14 20

Philosophers (48)

44 34

PSR Large (50)

19 31

PSR Middle (50)

43 50

slide-43
SLIDE 43

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

1

Automated Planning

2

RW Theory

3

RW Search

4

Application

5

Plan Improvement

6

Systems

7

Conclusions

slide-44
SLIDE 44

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Reasoning about Resources Examples of limited resources Fuel, energy, money, time Model: not replenishable resources Initial supply Some actions consume resources

slide-45
SLIDE 45

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Limitation of the Current Methods Relaxation heuristics do not model resource consumption at all Greedy search algorithms add more problems

slide-46
SLIDE 46

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Improvements to Arvand for RCP Smart Restarting (SR) On-path Search Continuation (OPSC)

slide-47
SLIDE 47

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Basic Restarting in an Example

0 ¡ 5 ¡ 10 ¡ 15 ¡ 20 ¡ 25 ¡ 30 ¡ 35 ¡ 40 ¡ 45 ¡ 50 ¡ 0 ¡ 50 ¡ 100 ¡ 150 ¡ 200 ¡ 250 ¡ 300 ¡ 350 ¡ 400 ¡ 450 ¡ 500 ¡ 550 ¡ 600 ¡ 650 ¡ Minimum ¡h ¡ Number ¡of ¡Restarts ¡

slide-48
SLIDE 48

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Smart Restarting Algorithm Maintain a pool of most promising episodes performed When an episode gets stuck restart from a state visited in an episode in the pool

!" #" $!" $#" %!" %#" &!" &#" '!" '#" #!" !" #!" $!!" $#!" %!!" %#!" &!!" &#!" '!!" '#!" #!!" ##!" (!!" (#!" !"#"$%$&'& (%$)*+&,-&.*/01+0/&

Most Promising Episodes

slide-49
SLIDE 49

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Smart Restarting in an Example

0 ¡ 5 ¡ 10 ¡ 15 ¡ 20 ¡ 25 ¡ 30 ¡ 35 ¡ 40 ¡ 45 ¡ 50 ¡ 0 ¡ 50 ¡ 100 ¡ 150 ¡ 200 ¡ 250 ¡ 300 ¡ 350 ¡ 400 ¡ 450 ¡ 500 ¡ 550 ¡ 600 ¡ 650 ¡ Minimum ¡h ¡ Number ¡of ¡Restarts ¡

slide-50
SLIDE 50

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

How to test RCP planners? Performance as a function of constrainedness Resource constrainedness C (Hoffmann et. al. IJCAI-2007) C = initial supply minimum need The closer C is to 1, the more constrained is the problem. My Contributions Extended the definition of C to multiple resources Developed two new benchmarks for RCP

slide-51
SLIDE 51

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Experiments 3 RCP Domains NoMystery, Rovers, TPP 8 Satisficing Planners Arvand, FD-AT1, FD-AT2, LAMA, FF , LPG, M, Mp, LPRPGP 5 Optimal Planners Num-2-sat, LM-cut, Merge and Shrink, Selmax, FD-AT-OPT

slide-52
SLIDE 52

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Results: Rovers, small

0% ¡ 10% ¡ 20% ¡ 30% ¡ 40% ¡ 50% ¡ 60% ¡ 70% ¡ 80% ¡ 90% ¡ 100% ¡ 1.0 ¡ 1.1 ¡ 1.2 ¡ 1.3 ¡ 1.4 ¡ 1.5 ¡ 1.6 ¡ 1.7 ¡ 1.8 ¡ 1.9 ¡ 2.0 ¡ Coverage ¡ C ¡ LAMA ¡ FD-­‑AT1 ¡ FD-­‑AT2 ¡ Mp ¡ LPG ¡ M ¡

slide-53
SLIDE 53

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Results: Rovers, small

0% ¡ 10% ¡ 20% ¡ 30% ¡ 40% ¡ 50% ¡ 60% ¡ 70% ¡ 80% ¡ 90% ¡ 100% ¡ 1.0 ¡ 1.1 ¡ 1.2 ¡ 1.3 ¡ 1.4 ¡ 1.5 ¡ 1.6 ¡ 1.7 ¡ 1.8 ¡ 1.9 ¡ 2.0 ¡ Coverage ¡ C ¡ A2 ¡ A2(SR) ¡ Arvand ¡ LAMA ¡ FD-­‑AT1 ¡ FD-­‑AT2 ¡ Mp ¡ LPG ¡ M ¡

slide-54
SLIDE 54

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Results: Rovers, large

0% ¡ 10% ¡ 20% ¡ 30% ¡ 40% ¡ 50% ¡ 60% ¡ 70% ¡ 80% ¡ 90% ¡ 100% ¡ 1.0 ¡ 1.1 ¡ 1.2 ¡ 1.3 ¡ 1.4 ¡ 1.5 ¡ 1.6 ¡ 1.7 ¡ 1.8 ¡ 1.9 ¡ 2.0 ¡ Coverage ¡ C ¡ LAMA ¡ FD-­‑AT1 ¡ LPG ¡

slide-55
SLIDE 55

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Results: Rovers, large

0% ¡ 10% ¡ 20% ¡ 30% ¡ 40% ¡ 50% ¡ 60% ¡ 70% ¡ 80% ¡ 90% ¡ 100% ¡ 1.0 ¡ 1.1 ¡ 1.2 ¡ 1.3 ¡ 1.4 ¡ 1.5 ¡ 1.6 ¡ 1.7 ¡ 1.8 ¡ 1.9 ¡ 2.0 ¡ Coverage ¡ C ¡ A2 ¡ A2(SR) ¡ Arvand ¡ LAMA ¡ FD-­‑AT1 ¡ LPG ¡

slide-56
SLIDE 56

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Results: NoMystery, large

0% ¡ 10% ¡ 20% ¡ 30% ¡ 40% ¡ 50% ¡ 60% ¡ 70% ¡ 80% ¡ 90% ¡ 100% ¡ 1.0 ¡ 1.1 ¡ 1.2 ¡ 1.3 ¡ 1.4 ¡ 1.5 ¡ 1.6 ¡ 1.7 ¡ 1.8 ¡ 1.9 ¡ 2.0 ¡ Coverage ¡ C ¡ LAMA ¡ FD-­‑AT1 ¡ FD-­‑AT2 ¡ Mp ¡ LPG ¡ M ¡ FF ¡ LPRPGP ¡

slide-57
SLIDE 57

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Results: NoMystery, large

0% ¡ 10% ¡ 20% ¡ 30% ¡ 40% ¡ 50% ¡ 60% ¡ 70% ¡ 80% ¡ 90% ¡ 100% ¡ 1.0 ¡ 1.1 ¡ 1.2 ¡ 1.3 ¡ 1.4 ¡ 1.5 ¡ 1.6 ¡ 1.7 ¡ 1.8 ¡ 1.9 ¡ 2.0 ¡ Coverage ¡ C ¡ A2 ¡ A2(OPSC) ¡ A2(SR) ¡ Arvand ¡ LAMA ¡ FD-­‑AT1 ¡ FD-­‑AT2 ¡ Mp ¡ LPG ¡ M ¡ FF ¡ LPRPGP ¡

slide-58
SLIDE 58

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

1

Automated Planning

2

RW Theory

3

RW Search

4

Application

5

Plan Improvement

6

Systems

7

Conclusions

slide-59
SLIDE 59

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Plan Improvement RW planning can generate bad-quality solutions Idea Develop fast post-processing techniques to improve the solutions Outcome: Aras A postprocessor that works well for a wide range of planners Even for those like LAMA that are well-designed to generate good-quality solutions

slide-60
SLIDE 60

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Plan Neighborhood Graph Search (PNGS)

Initial Plan Improved Plan Goal State

slide-61
SLIDE 61

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Anytime PNGS Iteratively increase the expansion limit Each iteration starts with last plan generated in previous iterations

slide-62
SLIDE 62

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Experiments Compare state-of-the-art planners with and without plan improvement on IPC domains Scoring function: the cost of the best plan produced by any planner divided by the cost of the generated plan Issue: how to divide time between planner and postprocessor

slide-63
SLIDE 63

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Cutoff Time Run the planner until a cutoff time is reached

If no solution is found, keep running until the first solution is found

Use the postprocessor to improve the best generated plan

slide-64
SLIDE 64

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

IPC-2008 PNGS

208.36 199.68 216.27 197.79 183.53 249.40 170.02 160.05 174.32 160.53 135.02 235.06 0.00 50.00 100.00 150.00 200.00 250.00 300.00 FF_sa FF_ha FF C3 ARVAND LAMA Total Score Base PNGS

slide-65
SLIDE 65

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Integration of Arvand-2013 and Aras Repeat until the time limit (30 min.) is reached:

Run Arvand-2013 until a solution s is found Run Aras to improve s until a memory/time limit (2 GB) is reached

The cost of the best previous plan is used for prunning Report the best plan found as the result

slide-66
SLIDE 66

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Arvand-2013 vs. Top Planner (Solution Quality)

slide-67
SLIDE 67

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Random Walk Planners Arvand-2009: Establishing the foundation Arvand-RC: Using RW Search for RCP Arvand-2011: Learning the Best Configuration and Using Aras Arvand-LS: RandomWalks with Memory ArvandHerd: Parallel portfolio

slide-68
SLIDE 68

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

1

Automated Planning

2

RW Theory

3

RW Search

4

Application

5

Plan Improvement

6

Systems

7

Conclusions

slide-69
SLIDE 69

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Contributions RW search as an effective framework for satisficing planning A theoretical framework for studying RW search

Determined key features affecting RW Explained where and why RW exploration is effective

A detailed experimental study of design space

Built effective learning systems that adapt parameters Built efficient biasing techniques Gained valuable insights regarding the effects of different parameters

slide-70
SLIDE 70

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Contributions Application of RW search to RCP

Extended the definition of C to multiple resources Developed of new benchmarks Significantly improved the state of the art

Aras: a very effective postprocessing system Several strong planning systems

Arvand-2009: Establishing the foundation Arvand-2011: Configuration learner and Aras Arvand-2013: Empirical study of the design space Arvand-RC: Using RW search for RCP Arvand-LS: RW with memory ArvandHerd: Parallel portfolio

slide-71
SLIDE 71

Automated Planning RW Theory RW Search Application Plan Improvement Systems Conclusions

Thank you for your attention!