11/27/2006 Massachusetts Institute of Technology Outline - - PDF document

11 27 2006
SMART_READER_LITE
LIVE PREVIEW

11/27/2006 Massachusetts Institute of Technology Outline - - PDF document

11/27/2006 Massachusetts Institute of Technology Outline Motivation Example Generalized Conflict Learning for Problem Formulation Hybrid Discrete/Linear Optimization The GCD-BB algorithm Empirical Evaluation Conclusion


slide-1
SLIDE 1

11/27/2006 1

Massachusetts Institute of Technology

Generalized Conflict Learning for Hybrid Discrete/Linear Optimization

Hui Li and Brian Williams

MIT June 6, 2005 ICAPS Workshop on Plan Execution

2

Outline

  • Motivation Example
  • Problem Formulation
  • The GCD-BB algorithm
  • Empirical Evaluation
  • Conclusion & Future Work

3

Motivation

The continuous model-based execution problem:

  • A dynamic system (a plant)
  • A desired evolution of the plant state over time (a temporally flexible

state plan)

  • A consistent control sequence

Fire fighting example [Léauté-AAAI-05]

  • Two UAVs
  • A fire to extinguish
  • Avoid obstacles
  • Drop water
  • Assess the damage

4

The state plan of the fire fighting example:

Motivation

Receding horizon continuous planner:

5

Encodings

In [Léauté-AAAI-05] the plant and the state plan within each limited horizon are encoded in disjunctive linear programs (DLPs) [Balas-ADM-79].

  • State plan encoding example
  • Plant model encoding example

A fast algorithm is needed to solve the DLPs for each horizon. Generalized Conflict-Directed Branch and Bound

6

Disjunctive Linear Programs

Definition:

i j

Example:

clause

slide-2
SLIDE 2

11/27/2006 2

7

Other Formulations

Binary Integer Program (BIP):

DLP:

  • LCNF [Wolfman-IJCAI-99]

“Trigger” linear inequalities with propositional variables

  • Mixed Logical Linear Programs (MLLPs)

[Hooker-JDAM-99]

  • Generalization from LCNF
  • optimization
  • variables over finite domain
  • logic forms other than CNF

8

Generalized Conflict-Directed Branch and Bound

  • Building upon Branch and Bound (B&B)
  • Two key features:

– Generalized Conflict Learning

  • infeasibility
  • sub-optimality

– Forward Conflict-Directed Search

  • constituent kernel
  • kernel
  • DLP candidate

9

Branch and Bound

  • Discrete + Continuous
  • “Branch”
  • “Bound”

For BIPs

y≤20 x≤10 B1 B2 y≤0 x≥30 x≥80 C1 C2 C3 y≤30 x≤100 A1 A2

min -x-3y s.t. x≤200 y≤200 x≤100 min -x-3y s.t. x≤200 y≤200 y≤30 min -x-3y s.t. x≤200 y≤200 x≤100 x≤10 min -x-3y s.t. x≤200 y≤200 x≤100 y≤20

For DLPs

root minimize -x-3y s.t. x≤200 y≤200 y≤30 V x≤100 y≤20 V x≤10 x≥80 V x≥30 V y≤0 min -x-3y s.t. x≤200 y≤200

For DLPs

min -x-3y s.t. x≤200 y≤200 x≤100 x≤10 y≤0 min -x-3y s.t. x≤200 y≤200 x≤100 x≤10 x≥30 min -x-3y s.t. x≤200 y≤200 x≤100 x≤10 x≥80

Incumbent

  • 160

an upper bound x≤10

min -x-3y s.t. x≤200 y≤200 y≤30 x≤10

p' is a relaxed LP of an optimization problem p, if the feasible region of p’ contains the feasible region of p, and they have the same objective function. a lower bound

10

Generalized Conflict-Directed Branch and Bound

  • Building upon Branch and Bound (B&B)
  • Two key features:

– Generalized Conflict Learning

  • infeasibility
  • sub-optimality.

– Forward Conflict-Directed Search

  • constituent kernel
  • kernel
  • DLP candidate

11

Generalized Conflict Learning

  • Infeasibility conflict
  • Sub-optimality conflict

An infeasibility conflict, since the constraints are not satisfiable for any value of x. A sub-optimality conflict, since the best solution that satisfies it has value -100 > f(x*).

Incumbent: x*(200,0)

A minimal infeasibility conflict A minimal sub-optimality conflict

12

Minimal Conflict Extraction

  • Extract minimal conflicts rather than any conflicts
  • Novel methods based on duality theory
  • No overhead LPs incurred

LP Solver

Reduce the constraint matrix to linearly independent rows Run the dual simplex method Identify the constraints of the minimal conflict with the non-zero elements of the extreme ray

An relaxed LP

If the dual problem is unbounded? (an extreme ray is discovered?) yes no Output optimal solution

LP Solver Reduce the constraint matrix to linearly independent rows Run the dual simplex method

An relaxed LP

If the dual problem is unbounded? (an extreme ray is discovered?)

yes no

Output optimal solution

Extract sub-optimality conflict? yes Take the dual solution vector If more than n elements of the dual vector is non-zero? Identify the minimal conflict with any n of the non-zero elements Identify the minimal conflict with all the non-zero elements no no yes

slide-3
SLIDE 3

11/27/2006 3

13

Generalized Conflict-Directed Branch and Bound

  • Building upon Branch and Bound (B&B)
  • Two key features:

– Generalized Conflict Learning

  • infeasibility
  • sub-optimality.

– Forward Conflict-Directed Search

  • constituent kernel
  • kernel
  • DLP candidate

14

Forward Conflict-Directed Search

  • Backward conflict-directed methods use conflicts to select

backtrack points and as a cache to prune nodes without testing consistency.

– dependency-directed backtracking [Stallman77] – conflict-directed backjumping [Prosser93] – dynamic backtracking [Ginsberg93] – LPSAT [Wolfman99].

  • Forward conflict-directed search guides the forward step of

search away from regions of the state space that are ruled out by known conflicts [Williams - CD-A* - JDAM05].

  • Our experimental results on a range of cooperative vehicle plan

execution problems show that forward conflict-directed search significantly outperforms backtrack search with conflicts.

15

Forward Conflict-Directed Search

Generate Constituent Kernels Generate Kernels Generate DLP Candidates

A constituent kernel is a minimal description of the states that resolve a conflict. In the context

  • f DLPs, a constituent kernel of a conflict is a

linear inequality that is the negation of a linear constraint contained in the conflict. Conflict {x≥80,x≤10} → {x≤80},{x≥10} For each unresolved conflict, a set of constituent kernels are generated. Given the sets of constituent kernels from multiple unresolved conflicts, kernels are generated, each of which resolves all the conflicts, by combining the constituent kernels using minimal set covering. A DLP candidate is generated for each kernel. The ones that are propositionally unsatisfiable are pruned and the DLP is simplified, using a fast unit propagation test before solving any relaxed LP.

16

Example

y≤20 x≤10 B1 B2 y≤0 x≥30 x≥80 C1 C2 C3 min -x-3y s.t. x≤200 y≤200 x≤100 min -x-3y s.t. x≤200 y≤200 y≤30

For DLPs

min -x-3y s.t. x≤200 y≤200 min -x-3y s.t. x≤200 y≤200 x≤100 x≤10 y≤0 min -x-3y s.t. x≤200 y≤200 x≤100 x≤10 x≥30 min -x-3y s.t. x≤200 y≤200 x≤100 x≤10 x≥80 Incumbent

  • 160

root

y≤30 x≤100 A1 A2 {x≤10, x≥30} {x≤10, x≥80} Conflicts: {b2, c2} {b2, c1}

{ b2, c2 } { b2, c1 } Conflicts Constituent Kernels { {¬b2}, {¬c2} } { {¬b2}, {¬c1} } Kernels { ¬b2 } { ¬c1, ¬c2 }

minimize -x-3y s.t. (u1)x≤200 (u2)y≤200 (a1)y≤30 V (a2)x≤100 (b1)y≤20 V (b2)x≤10 (c1)x≥80 V (c2)x≥30 V (c3)y≤0 min -x-3y s.t. x≤200 y≤200 y≤30 y≤20 min -x-3y s.t. x≤200 y≤200 y≤30 y≤0 u1 u2 a1 ¬b2 b1 V b2 c1 V c2 V c3 u1 u2 a1 ¬c1 ¬c2 b1 V b2 c1 V c2 V c3

DLP Candidates

u1 u2 a1 b1 c1 V c2 V c3 u1 u2 a1 c3 b1 V b2

unit propagation

17

Best-First Search (BFS) v.s. Depth-First Search (DFS)

  • BFS is more efficient than DFS in time.
  • BFS can take dramatically more memory space than DFS.
  • With conflict learning and forward conflict-directed search,

BFS takes similar memory space to DFS.

  • The concept of sub-optimality is rooted in maintaining an
  • incumbent. Hence, it can be applied to DFS but not to BFS.

18

Empirical Evaluation

Test problems: Model-based temporal plan execution for cooperative vehicles [Léauté-AAAI-05]. Comparisons: GCD-BB v.s. BIP-BB algorithmic variants of GCD-BB Measures: Computation time - number of LPs & average LP size Memory use - maximum queue length

BI P-BB v.s. GCD-BB

1000 2000 3000 4000 5000 6000 7000 8000 9000 1 2 3 4 Clause / Variable Series1 Series2 Series3

BIP-BB DLP+BFS+Infeas DLP+DFS+Infeas+Sub 80/36 700/144 1492/300 2456/480

slide-4
SLIDE 4

11/27/2006 4

19

Empirical Evaluation

W ithout conflicts v.s. w ith conflicts

500 1000 1500 2000 2500 3000 1 2 3 4 Clause / Variable

80/36 700/144 1492/300 2456/480 DLP+BFS+Without Conflicts DLP+BFS+With Conflicts Backtrack w ith conflicts v.s. Forw ard conflict- directed search

200 400 600 800 1000 1200 1400 1600 1 2 3 4 Clause / Variable

80/36 700/144 1492/300 2456/480 DLP+BFS+Backtrack DLP+BFS+Forward Search 20

Empirical Evaluation

BFS v.s. DFS

100 200 300 400 500 600 700 800 1 2 3 4 Clause / Variable BFS+ Infeas DFS+ Infeas DFS+ Infeas+ Sub DFS+ Sub

80/36 700/144 1492/300 2456/480

BFS v.s. DFS

50 100 150 200 250 300 350 400 450 1 2 3 4 Clause / Variable BFS+ Without Conflicts BFS+ Infeas DFS+ Without Conflicts DFS+ Infeas+ Sub

80/36 700/144 1492/300 2456/480 21

Conclusion

Generalized Conflict-Directed Branch and Bound (GCD-BB) – branch and bound for DLPs – generalized conflict learning – forward conflict-directed search An order of magnitude speed-up over BIP-BB.

22

Future Work

  • Run GCD-BB on a range of well-known benchmark

problems, and compare its actual runtime against that of BIP- BB.

  • Study empirically the reason why sub-optimality conflicts do

not speed up search as much as infeasibility conflicts.

  • Apply GCD-BB to a more general form of HDLOPs than

DLPs.

  • Extend conflict learning to non-linear programming.