CS541 Review Jim Blythe 2 Planning for the Grid USC INFORMATION - - PDF document

cs541 review
SMART_READER_LITE
LIVE PREVIEW

CS541 Review Jim Blythe 2 Planning for the Grid USC INFORMATION - - PDF document

CS541 Review Jim Blythe 2 Planning for the Grid USC INFORMATION SCIENCES INSTITUTE LIGOs Pulsar Search (Laser Interferometer Gravitational-wave Observatory) I nterferom archive eter Short Extract Fourier transpose channel Transform


slide-1
SLIDE 1

CS541 Review

Jim Blythe

slide-2
SLIDE 2

2

USC INFORMATION SCIENCES INSTITUTE

Planning for the Grid

slide-3
SLIDE 3

3

USC INFORMATION SCIENCES INSTITUTE

LIGO’s Pulsar Search (Laser Interferometer Gravitational-wave Observatory)

archive

I nterferom eter Long tim e fram es Store raw channels Short tim e fram es

Hz Time

Single Fram e Extract channel transpose Tim e- frequency I m age Find Candidate event DB Short Fourier Transform 3 0 m inutes Extract frequency range Construct image

slide-4
SLIDE 4

4

USC INFORMATION SCIENCES INSTITUTE

Operators include data dependencies, host and resource constraints.

(operator pulsar-search (preconds ((<host> (or Condor-pool Mpi)) (<start-time> Number) (<channel> Channel) (<fcenter> Number) (<right-ascension> Number) (<sample-rate> Number) (<file> File-Handle) ;; These two are parameters for the frequency-extract. (<f0> (and Number (get-low-freq-from-center-and-band <fcenter> <fband>))) (<fN> (and Number (get-high-freq-from-center-and-band <fcenter> <fband>))) (<run-time> (and Number (estimate-pulsar-search-run-time <start-time> <end-time> <sample-rate> <f0> <fN> <host> <run-time>))) …) (and (available pulsar-search <host>) (forall ((<sub-sft-file-group> (and File-Group-Handle (gen-sub-sft-range-for-pulsar-search <f0> <fN> <start-time> <end-time> <sub-sft-file-group>)))) (and (sub-sft-group <start-time> <end-time> <channel> <instrument> <format> <f0> <fN> <sample-rate> <sub-sft-file-group>) (at <sub-sft-file-group> <host>))))) (effects () ( (add (created <file>)) (add (at <file> <host>)) (add (pulsar <start-time> <end-time> <channel> <instrument> <format> <fcenter> <fband> <fderv1> <fderv2> <fderv3> <fderv4> <fderv5> <right-ascension> <declination> <sample-rate> <file>)) ) ))

slide-5
SLIDE 5

5

USC INFORMATION SCIENCES INSTITUTE

Grid Grid Grid

workflow executor (DAGman)

Execution Workflow Planning

Globus Replica Location Service Globus Monitoring and Discovery Service

Information and Models

Metadata Catalog Service Resource Models

detector

Raw data

Concrete Workflow tasks

Models and current state information Monitoring information High-level specs of desired results and intermediate data products

Dynamic information

Request Manager Current State Generator Submission and Monitoring System

AI-based Planner

slide-6
SLIDE 6

6

USC INFORMATION SCIENCES INSTITUTE

Temporal logics for planning

slide-7
SLIDE 7

7

USC INFORMATION SCIENCES INSTITUTE

Fahiem Bacchus

slide-8
SLIDE 8

8

USC INFORMATION SCIENCES INSTITUTE

Fahiem Bacchus

slide-9
SLIDE 9

9

USC INFORMATION SCIENCES INSTITUTE

Heuristic search planning

slide-10
SLIDE 10

10

USC INFORMATION SCIENCES INSTITUTE

Derive cost estimate from a relaxed planning problem

Ignore the deletes on actions BUT – still NP-hard, so approximate: For individual propositions p:

d(s, p) = 0 if p is true in s = 1 + min(d(s, pre(a))) otherwise [min over actions a that add p]

slide-11
SLIDE 11

11

USC INFORMATION SCIENCES INSTITUTE

HSP2 overview

Best-first search, using h+ Based on WA* - weighted A*:

f(n) = g(n) + W * h(n). If W = 1, it’s A* (with admissible h). If W > 1, it’s a little greedy – generally finds solutions faster, but not optimal.

In HSP2, W = 5

slide-12
SLIDE 12

12

USC INFORMATION SCIENCES INSTITUTE

HSPr problem space

States are sets of atoms (correspond to sets of states

in original space)

initial state is the goal G Goal states are those that are true in s0 (initial state

in planning problem)

Still use h+. h+(s) = sum g(s0, p)

slide-13
SLIDE 13

13

USC INFORMATION SCIENCES INSTITUTE

Mutexes in HSPr, take 2

Better definition:

A set M of pairs R = {p, q} is a mutex set if (1) R is not true in s0 (2) for every op o that adds p, either o deletes q

  • r o does not add q, and for some precond r of o,

{r, q} is in M. Recursive definition allows for some interaction of the

  • perators
slide-14
SLIDE 14

14

USC INFORMATION SCIENCES INSTITUTE

Temporal reasoning and scheduling

slide-15
SLIDE 15

15

USC INFORMATION SCIENCES INSTITUTE

Temporal planning with mutual exclusion relation

Propositions and actions are monotonically

increasing, no-goods monotonically decreasing:

slide-16
SLIDE 16

16

USC INFORMATION SCIENCES INSTITUTE

ASPEN

Combine planning and scheduling steps as

alternative ‘conflict repair’ operations

Activities have start time, end time, duration Maintain ‘most-commitment’ approach – easier to

reason about temporal dependencies with full information

C.f. TLPlan

slide-17
SLIDE 17

17

USC INFORMATION SCIENCES INSTITUTE

Contributors for a non-depletable resource violation

slide-18
SLIDE 18

18

USC INFORMATION SCIENCES INSTITUTE

Contributors for a depletable resource violation

slide-19
SLIDE 19

19

USC INFORMATION SCIENCES INSTITUTE

Learning search control knowledge and case-based planning

slide-20
SLIDE 20

20

USC INFORMATION SCIENCES INSTITUTE

Using EBL to improve plan quality

Given: planning domain, evaluation function

planner’s plan, a better plan

Learn: control knowledge to produce the better plan Explanation used: explain why the alternative plan is

better

Target concept: control rules that make choices

based on the planner state and meta-state

slide-21
SLIDE 21

21

USC INFORMATION SCIENCES INSTITUTE

Architecture of Quality system

slide-22
SLIDE 22

22

USC INFORMATION SCIENCES INSTITUTE

Explaining better plans recursively: target concept: shared subgoal

slide-23
SLIDE 23

23

USC INFORMATION SCIENCES INSTITUTE

Hamlet: blame assignment

slide-24
SLIDE 24

24

USC INFORMATION SCIENCES INSTITUTE

Probabilistic planning

slide-25
SLIDE 25

25

USC INFORMATION SCIENCES INSTITUTE

Sources of uncertainty

Incomplete knowledge of the world (uncertain initial

state)

Non-deterministic effects of actions Effects of external agents or state dynamics.

slide-26
SLIDE 26

26

USC INFORMATION SCIENCES INSTITUTE

Dealing with uncertainty: re-planning and conditional planning

Re-planning:

Make a plan assuming nothing bad will happen Build a new plan if a problem is found (either re-plan to the goal state or try to repair the plan) In some cases, this is too late.

Deal with contingencies (plans for bad outcomes) at planning

time, before they occur.

Can’t plan for every contingency, so need to prioritize Implies sensing

Build a plan that reduces the number of contingencies requires

(conformant planning)

May not be possible

slide-27
SLIDE 27

27

USC INFORMATION SCIENCES INSTITUTE

A Buridan plan based on SNLP

slide-28
SLIDE 28

28

USC INFORMATION SCIENCES INSTITUTE

Computing the probability of success 2: Bayes nets

Time-stamped literal node Action outcome node

What is the worst-case time complexity

  • f this

algorithm?

slide-29
SLIDE 29

29

USC INFORMATION SCIENCES INSTITUTE

MAXPLAN

Inspired by SATPLAN. Compile planning problem to an instance

  • f E-MAJSAT

E-MAJSAT: given a boolean formula with variables that are

either choice variables or chance variables, find an assignment to the choice variables that maximizes the probability that the formula is true.

Choice variables: we can control them

e.g. which action to use

Chance variables: we cannot control them

e.g. the weather, the outcome of each action, ..

Then use standard algorithm to compute and maximize

probability of success

slide-30
SLIDE 30

30

USC INFORMATION SCIENCES INSTITUTE

Probabilistic planning: exogenous events

slide-31
SLIDE 31

31

USC INFORMATION SCIENCES INSTITUTE

Representing external sources of change

Model actions that external agents can take in the same way as actions that the planner can take. (event oil-spills (probability 0.1) (preconds (and (oil-in-tanker <sea-sector>) (poor-weather <sea-sector>))) (effects (del (oil-in-tanker <sea-sector>)) (add (oil-in-sea <sea-sector>))))

slide-32
SLIDE 32

32

USC INFORMATION SCIENCES INSTITUTE

Computing the probability of success using a Bayes net

slide-33
SLIDE 33

33

USC INFORMATION SCIENCES INSTITUTE

Example: the weather events and the corresponding markov chain

The markov chain shows possible states independent of time. As long as transition probabilities are independent of time, the

probability of the state at some future time t can be computed in logarithmic time complexity in t.

The computation time is polynomial in the number of states in

the markov chain.

slide-34
SLIDE 34

34

USC INFORMATION SCIENCES INSTITUTE

The event graph

Captures the dependencies between events needed to build

small but correct markov chains.

Any event whose literals should be included will be an ancestor

  • f the events governing objective literals.
slide-35
SLIDE 35

35

USC INFORMATION SCIENCES INSTITUTE

Probabilistic planning: structured policy iteration

slide-36
SLIDE 36

Craig Boutilier

36

USC INFORMATION SCIENCES INSTITUTE

Structured representation

States decomposable into state variables Structured representations the norm in AI

STRIPS, Sit-Calc., Bayesian networks, etc. Describe how actions affect/depend on features Natural, concise, can be exploited computationally

Same ideas can be used for MDPs

actions, rewards, policies, value functions, etc. dynamic Bayes nets [DeanKanazawa89,BouDeaGol95] decision trees and diagrams [BouDeaGol95,Hoeyetal99]

slide-37
SLIDE 37

Craig Boutilier

37

USC INFORMATION SCIENCES INSTITUTE

Action Representation – DBN/ADD

slide-38
SLIDE 38

Craig Boutilier

38

USC INFORMATION SCIENCES INSTITUTE

Structured Policy and Value Function