knowledge based programs as plans
play

Knowledge-based programs as plans Jrme Lang (LAMSADE, Paris) & - PowerPoint PPT Presentation

Knowledge-based programs as plans Jrme Lang (LAMSADE, Paris) & Bruno Zanuttini (GREYC, Caen) ECAI-2012 + TARK 2013 Jrme Lang, Bruno Zanuttini Knowledge-based plans 1/35 A card game program Goal : pick some cards, maximum 5


  1. Knowledge-based programs as plans Jérôme Lang (LAMSADE, Paris) & Bruno Zanuttini (GREYC, Caen) ECAI-2012 + TARK 2013 Jérôme Lang, Bruno Zanuttini Knowledge-based plans 1/35

  2. A card game program Goal : ◮ pick some cards, maximum 5 ◮ try to obtain three cards of the same rank Do pick a card c look at the rank of c Until three cards of the same rank or know it is impossible Jérôme Lang, Bruno Zanuttini Knowledge-based plans 2/35

  3. A diagnose-and-repair program ◮ three components 1,2,3 ; ◮ propositional symbol ok i : component i is in working order ; ◮ action repair ( i ) : makes ok i true ; ◮ action test ( i ) : returns the truth value of ok i ; ◮ initial knowledge state : K (( ok 1 ↔ ( ok 2 ∧ ok 3 )) ∧ ( ¬ ok 1 ∨ ¬ ok 3 )) ; ◮ Goal : to have the three components working without replacing more components than necessary. While ¬ K ( ok 1 ∧ ok 2 ∧ ok 3 ) do i := smallest integer such that ¬ Kok i ; If ¬ K ¬ ok i then test ( i ) endif ; If K ¬ ok i then replace ( i ) endif Endwhile Jérôme Lang, Bruno Zanuttini Knowledge-based plans 3/35

  4. Outline Knowledge-based programs : ◮ introduced by Fagin, Halpern, Moses and Vardi [1995] ◮ studied for behaviour specification in distributed environments Our work : ◮ using knowledge-based programs as (single-agent) plans reaching some goals described by epistemic formulas ◮ LOFT-12 / ECAI-12 : expressivity and complexity of plan verification ◮ TARK-13 : comparing the succinctness of KBPs to that of standard plans + complexity of plan existence Jérôme Lang, Bruno Zanuttini Knowledge-based plans 4/35

  5. Plan Knowledge-based programs Knowledge-based planning problems Succinctness KBP verification KBP existence Conclusion Jérôme Lang, Bruno Zanuttini Knowledge-based plans 5/35

  6. Knowledge-based programs Knowledge-based planning problems Succinctness KBP verification KBP existence Conclusion Jérôme Lang, Bruno Zanuttini Knowledge-based plans 6/35

  7. Syntax Input ◮ set of propositional variables X = { x 1 , . . . , x n } ◮ Queen( c 1 ), ok 1 . . . ◮ state = truth assignment (unobservable) ◮ set of actions Knowledge-based program π : ◮ action, or ◮ sequence π 1 ; π 2 ; . . . ; π n , or ◮ branching If Φ then π 1 else π 2 , where Φ is a purely subjective S5 formula (Boolean combination of epistemic atoms K ϕ ) ; or ◮ loop While Φ do π 1 , where Φ is a purely subjective S5 formula. Jérôme Lang, Bruno Zanuttini Knowledge-based plans 7/35

  8. Actions Ontic action : ◮ changes the state of the world ◮ possibly nondeterministic + no feedback ◮ propositional symbol x �→ { x , x ′ } ; ◮ x before the action is performed ◮ x ′ after the action is performed ◮ switch( x i ) : Σ = ( x ′ j � = i ( x ′ i ↔ ¬ x i ) ∧ � j ↔ x j ) ◮ x i ← 0 : Σ = ( ¬ x ′ i ) ◮ reinit( x i ) : Σ = � j � = i ( x ′ j ↔ x j ) Epistemic action : ◮ does not change the state of the world ◮ sends back one of several possible observations ◮ test( x i ∨ x j ) : observe x i ∨ x j or observe ¬ ( x i ∨ x j ) ◮ ask-how-much-time-left : observe ( t = 15 mn ) or observe ( t = 10 mn ) or observe ( t = 5 mn ) or observe ( t = 0) Jérôme Lang, Bruno Zanuttini Knowledge-based plans 8/35

  9. Executing a KBP At every step : ◮ current state of variables s t ◮ s 0 = x 1 x 2 ¯ x 3 ◮ current knowledge state M t ◮ M t = { x 1 x 2 x 3 , x 1 ¯ x 2 x 3 , x 1 x 2 ¯ x 3 } ◮ succinct representation O ( x 1 ∧ ( x 2 ∨ x 3 )) : all I know is x 1 ∧ ( x 2 ∨ x 3 ) . Execution : ◮ branching condition / loop : evaluated in M t ◮ ontic action : nondeterministic modification of s t ◮ epistemic action : ◮ no modification of s t ◮ reception of an observation ω Jérôme Lang, Bruno Zanuttini Knowledge-based plans 9/35

  10. Progression Progression by an ontic action : ◮ M t = { x 1 x 2 x 3 , ¯ x 1 ¯ x 2 ¯ x 3 } O (( x 1 ∧ x 2 ∧ x 3 ) ∨ ( ¬ x 1 ∧ ¬ x 2 ∧ ¬ x 3 )) ◮ progression of M t by switch( x 1 ) : M t + 1 = { ¯ x 1 x 2 x 3 , x 1 ¯ x 2 ¯ x 3 } O (( ¬ x 1 ∧ x 2 ∧ x 3 ) ∨ ( x 1 ∧ ¬ x 2 ∧ ¬ x 3 )) ◮ progression of M t + 1 by reinit( x 1 ) : M t + 2 = { x 1 x 2 x 3 , ¯ x 1 x 2 x 3 , x 1 ¯ x 2 ¯ x 3 , ¯ x 1 ¯ x 2 ¯ x 3 } O ( x 2 ↔ x 3 ) Progression by an observation (received after some epistemic action) : ◮ action test( x 1 ∧ x 2 ), observation ¬ ( x 1 ∧ x 2 ) : ◮ progression of M t + 2 by observation ¬ ( x 1 ∧ x 2 ) : M t + 3 = { ¯ x 1 x 2 x 3 , x 1 ¯ x 2 ¯ x 3 , ¯ x 1 ¯ x 2 ¯ x 3 } O (( x 2 ↔ x 3 ) ∧ ¬ ( x 1 ∧ x 2 )) Jérôme Lang, Bruno Zanuttini Knowledge-based plans 10/35

  11. Knowledge-based programs Knowledge-based planning problems Succinctness KBP verification KBP existence Conclusion Jérôme Lang, Bruno Zanuttini Knowledge-based plans 11/35

  12. Classical planning ◮ Set of initial states and goal states (described succinctly) ◮ Set of actions whose effects are described succinctly ◮ Output : standard plan (policy) : ◮ tree or DAG containing observations/actions ◮ branching on current state and observations Jérôme Lang, Bruno Zanuttini Knowledge-based plans 12/35

  13. Knowledge-based planning problems ◮ initial knowledge state initial M 0 : ◮ possibly O ⊤ ◮ must contain the true initial state ◮ goal G (purely subjective epistemic formula) ◮ π valid plan if ◮ terminates ◮ for every possible sequence of states s 0 ∈ M 0 . . . s final ∈ M final we have M final | = G Jérôme Lang, Bruno Zanuttini Knowledge-based plans 13/35

  14. Example ◮ initial knowledge state : O (( ok 1 ↔ ( ok 2 ∧ ok 3 )) ∧ ( ¬ ok 1 ∨ ¬ ok 3 )) ◮ goal knowledge state : K ( ok 1 ∧ ok 2 ∧ ok 3 ) ◮ actions : test( i ), repair( i ) for i = 1 , 2 , 3 Knowledge-based plan : While ¬ K ( ok 1 ∧ ok 2 ∧ ok 3 ) do find the smallest i such that ¬ Kok 1 ; If ¬ K ¬ ok i then test ( i ) ; If K ¬ ok i then replace ( i ) Endwhile Jérôme Lang, Bruno Zanuttini Knowledge-based plans 14/35

  15. Knowledge-based plans vs. standard policies ◮ A standard policy is a KBP in which the last action executed before any branching condition if Φ or while Φ is an epistemic action a such that Φ is one of the possible observations for a . ◮ For every KBP π there exists a standard policy π ′ “equivalent” to π ( π and π ′ have the same execution traces). Expressivity : ◮ there exists a valid knowledge-based for a planning problem P iff there exists a valid standard policy for P Jérôme Lang, Bruno Zanuttini Knowledge-based plans 15/35

  16. Knowledge-based plans vs. policies KBP standard policy replace ( 1 ) ; test ( 2 ) ; If ok ( 2 ) replace ( 3 ) then else replace ( 2 ) ; While ¬ K ( ok 1 ∧ ok 2 ∧ ok 3 ) do test ( 3 ) ; find smallest i such that ¬ Kok 1 ; If ok ( 2 ) If ¬ K ¬ ok i then test ( i ) ; then replace ( 3 ) If K ¬ ok i then replace ( i ) else replace ( 2 ) ; Endwhile test ( 3 ) ; If ¬ ok ( 3 ) then replace ( 3 ) endif endif endif Jérôme Lang, Bruno Zanuttini Knowledge-based plans 16/35

  17. Knowledge-based plans vs. policies : reactivity On-line execution : ◮ standard policy : ◮ move to the subtree corresponding to the observation and execute the next action ◮ constant time ◮ knowledge-based plan : ◮ branching / loop condition : decide M t | = Φ ◮ NP-hard and coNP-hard, in ∆ 2 P Jérôme Lang, Bruno Zanuttini Knowledge-based plans 17/35

  18. Knowledge-based plans vs. policies : succinctness Proposition : unless NP ⊆ P/poly (extremely unlikely), while-free KBPs with atomic branching conditions are exponentially more succinct than while-free standard policies. Proof sketch : ◮ for each n ∈ N we build a polysize KBP π n that “reads” a CNF formula ϕ and either makes sure that it is unsatisfiable or else builds a model of it. ◮ if there is a family of standard policies π ′ n for every n , of size polynomial in | π n | , with π n equivalent to π ′ n , then there is a (possibly nonuniform) polytime algorithm for 3sat , yielding NP ⊆ P/poly. Jérôme Lang, Bruno Zanuttini Knowledge-based plans 18/35

  19. Knowledge-based plans vs. policies : succinctness Proposition : KBPs (with loops) are more succinct than standard policies (with loops). Proof sketch : ◮ there is a polynomial pol and a collection of KBPs ( π n ) n such that | π n | ≤ pol ( n ) and such that π n “counts” up to 2 2 n − 1 (by going once through all knowledge states). ◮ we build a family of planning problems ( P n ) n such that the only valid plans for P n are all equivalent to π n ◮ assume that for all n there is a standard policy π ′ n for P n and | π ′ n | ≤ pol ( n ) | ; then π ′ n can manipulate only pol ( n ) variables, and can have only 2 pol ( n ) . | π ′ n | configurations (states + control points) ; then it cannot count up to 2 2 n − 1, contradiction. Jérôme Lang, Bruno Zanuttini Knowledge-based plans 19/35

  20. Knowledge-based plans vs. policies : succinctness Proposition : KBPs are more succinct than while-free KBPs. Proof sketch : later Jérôme Lang, Bruno Zanuttini Knowledge-based plans 20/35

  21. Knowledge-based programs Knowledge-based planning problems Succinctness KBP verification KBP existence Conclusion Jérôme Lang, Bruno Zanuttini Knowledge-based plans 21/35

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend