Conditional Planning Section 11.3 Sec. 11.3 p.1/18 Outline Fully - PowerPoint PPT Presentation

Conditional Planning Section 11.3 Sec. 11.3 – p.1/18

Outline Fully observable environments Partially observable environments Conditional POP Sec. 11.3 – p.2/18

Uncertainty The agent might not know what the initial state is The agent might not know the outcome of its actions The plans will have branches rather than being straight line plans, includes conditional steps → → if < test > then plan A else plan B Full observability : The agent knows what state it currently is, does not have to execute an observation action Simply get plans ready for all possible contingencies Sec. 11.3 – p.3/18

Modeling uncertainty Actions sometimes fail → disjunctive effects Example: moving left sometimes fails Action ( Left , P RECOND : AtR , E FFECT : AtL ∨ AtR ) Conditional effects : effects are conditioned on secondary preconditions Action ( Suck , P RECOND : ;, E FFECT : ( when AtL: CleanL ) ∧ ( when AtR: CleanR )) Actions may have both disjunctive and conditional effects: Moving sometimes dumps dirt on the destination square only when that square is clean Action ( Left , P RECOND : AtR ;, E FFECT : AtL ∨ ( AtL ∧ when CleanL: ¬ CleanL )) Sec. 11.3 – p.4/18

The vacuum world example Double Murphy world the vacuum cleaner sometimes deposits dirt when it moves to a clean destination square sometimes deposits dirt if S UCK is applied to a clean square The agent is playing a game against nature Sec. 11.3 – p.5/18

Perform and-or search Left Suck GOAL Suck LOOP Right Suck Left GOAL LOOP Sec. 11.3 – p.6/18

The plan In the “double-Murphy” vacuum world, the plan is: [ Left , if AtL ∧ CleanL ∧ CleanR then [ ] else Suck ] Sec. 11.3 – p.7/18

And-or Search Algorithm function A ND -O R -G RAPH -S EARCH ( problem ) returns a conditional plan , or failure O R -S EARCH ( I NITIAL -S TATE [ problem ], problem , []) function O R -S EARCH ( state, problem, path ) returns a conditional plan , or failure if G OAL -T EST [ problem ]( state ) then return the empty plan if state is on path then return failure for each action, state-set in S UCCESSORS [ problem ]( state ) do plan ← A ND -S EARCH ( state, problem , [ state | path ]) if plan � = failure then return [ action | plan ] return failure Sec. 11.3 – p.8/18

And-or Search Algorithm function A ND -S EARCH ( state-set, problem, path ) returns a conditional plan , or failure for each s i in state-set do plan i ← O R -S EARCH ( S i , problem, path ) if plan = failure then return failure return **[ if s 1 **[ if then plan 1 **[ if else if s 2 **[ if else if then plan 2 **[ if else if else . . . if s n − 1 **[ if else if else . . . if then plan n − 1 **[ if else if else . . . if else plan n ] Sec. 11.3 – p.9/18

Triple Murhpy vacuum world The vacuum cleaner sometimes deposits dirt when it moves to a clean destination square It sometimes deposits dirt if suck is applied to a clean square + move sometimes fails Sec. 11.3 – p.10/18

First level of the search Left Suck GOAL Sec. 11.3 – p.11/18

Triple Murphy vacuum world No acyclic solutions A cyclic solution is to try going left until it works. Use a label . [ L 1 : Left , if atR then L 1 else if CleanL then [] else Suck ] Sec. 11.3 – p.12/18

Partially observable environments The agent knows only a certain amount of the actual state (e.g., local sensing only, does not know about the other squares) Automatic sensing : at every time step the agent gets all the available percepts Active sensing : percepts are obtained only by executing specific sensory actions Belief state : The set of possible states that the agent can be in “Alternate double Murphy world”: dirt can sometimes be left behind when the agent leaves a clean square Sec. 11.3 – p.13/18

Part of the search Left CleanL ~CleanL Suck Right CleanR ~CleanR Suck Sec. 11.3 – p.14/18

Conditional POP (CNLP algorithm) INIT atL cleanL cleanR LEFT atL ~cleanL cleanL atL cleanR Dangling Edge GOAL A Sec. 11.3 – p.15/18

Conditional POP (CNLP algorithm) INIT atL cleanL cleanR LEFT atL ~cleanL cleanL atL cleanR GOAL A cleanL atL Duplicate the goal cleanR and label it GOAL B Sec. 11.3 – p.16/18

Conditional POP (CNLP algorithm) INIT atL cleanL cleanR LEFT atL ~cleanL cleanL atL cleanR GOAL SUCK A cleanL atL cleanR GOAL B Sec. 11.3 – p.17/18

Comments Classical planning is NP Conditional planning is harder than NP Had to go back to state space search Many problems are intractable Sec. 11.3 – p.18/18

Conditional Planning Section 11.3 Sec. 11.3 p.1/18 Outline Fully - PowerPoint PPT Presentation

Conditional Planning Section 11.3 Sec. 11.3 p.1/18 Outline Fully observable environments Partially observable environments Conditional POP Sec. 11.3 p.2/18 Uncertainty The agent might not know what the initial state is The agent

Review: Conditional Probability Conditional Probability The conditional probability of event

11/15/16 Conditional distributions Let X and Y be discrete r.v.s. Conditional probability mass

Conditional Statements Python Conditional Statements Sometimes a statement (or a block of

Markov random fields 2. conditional specifications 3. conditional auto-regression Rasmus

Formal Modeling in Cognitive Science Independence Lecture 23: Conditional Probability; Bayes

Conditional Sentences as Conditional Speech Acts Workshop Questioning Speech Acts Universitt

Conditional Probability & Independence Conditional Probabilities Question : How should we

P( ) 1 conditional probability where P(F) > 0 Conditional probability of E given F:

Multiscale Conditional 1) Generalization of conditional random fields (CRF) to multiscale

15. The Conditional 15.1 The conditional: Formation and uses 15.2 Mise en pratique 15.1 The

Conditional Probability & Independence Conditional Probabilities Question : How should we

Conditional Random Fields [Hanna M. Wallach, Conditional Random Fields: An Introduction,

Blended Conditional Gradients: The unconditioning of conditional gradients Joint work with Gabor

Protocol for Booleans ifTrue:ifFalse: trueBlock falseBlock Full conditional Part conditional

Topic 6 Conditional Probability and Independence Conditional Probability 1 / 9 Definition The

ENGLISH IN ACTION IV Stage 3: Personal Insights Content of the Presentation Conditional

Reducing the Cost of Conditional Transfers of Control by Using Comparison Specifications May 30,

Lexical Analyzer Scanner ALSU Textbook Chapter 3.13.4, 3.6, 3.7, 3.5, 3.8 Tsan-sheng Hsu

Bone s a nd Ske le ta l T issue s The Skeleton What are the components of the skeletal

Apps data data data learning Locality Filtering PageRank, Recommen sensitive data SVM

Truth Conditional Meaning of Sentences Ling324; Fall 2004; Chung-hye Han Reading: Meaning and

Tensor Methods for Feature Learning Anima Anandkumar U.C. Irvine Feature Learning For Efficient

Live-Range Reordering Sven Verdoolaege 1 Albert Cohen 2 1 Polly Labs and KU Leuven 2 INRIA and

cedram Math literature Math E-literature DML Implementation Conclusions Outline The

Conditional Planning Section 11.3 Sec. 11.3 p.1/18 Outline Fully - PowerPoint PPT Presentation

Conditional Planning Section 11.3 Sec. 11.3 p.1/18 Outline Fully observable environments Partially observable environments Conditional POP Sec. 11.3 p.2/18 Uncertainty The agent might not know what the initial state is The agent

Review: Conditional Probability Conditional Probability The conditional probability of event

11/15/16 Conditional distributions Let X and Y be discrete r.v.s. Conditional probability mass

Conditional Statements Python Conditional Statements Sometimes a statement (or a block of

Markov random fields 2. conditional specifications 3. conditional auto-regression Rasmus

Formal Modeling in Cognitive Science Independence Lecture 23: Conditional Probability; Bayes

Conditional Sentences as Conditional Speech Acts Workshop Questioning Speech Acts Universitt

Conditional Probability &amp; Independence Conditional Probabilities Question : How should we

P( ) 1 conditional probability where P(F) &gt; 0 Conditional probability of E given F:

Multiscale Conditional 1) Generalization of conditional random fields (CRF) to multiscale

15. The Conditional 15.1 The conditional: Formation and uses 15.2 Mise en pratique 15.1 The

Conditional Probability &amp; Independence Conditional Probabilities Question : How should we

Conditional Random Fields [Hanna M. Wallach, Conditional Random Fields: An Introduction,

Blended Conditional Gradients: The unconditioning of conditional gradients Joint work with Gabor

Protocol for Booleans ifTrue:ifFalse: trueBlock falseBlock Full conditional Part conditional

Topic 6 Conditional Probability and Independence Conditional Probability 1 / 9 Definition The

ENGLISH IN ACTION IV Stage 3: Personal Insights Content of the Presentation Conditional

Reducing the Cost of Conditional Transfers of Control by Using Comparison Specifications May 30,

Lexical Analyzer Scanner ALSU Textbook Chapter 3.13.4, 3.6, 3.7, 3.5, 3.8 Tsan-sheng Hsu

Bone s a nd Ske le ta l T issue s The Skeleton What are the components of the skeletal

Apps data data data learning Locality Filtering PageRank, Recommen sensitive data SVM

Truth Conditional Meaning of Sentences Ling324; Fall 2004; Chung-hye Han Reading: Meaning and

Tensor Methods for Feature Learning Anima Anandkumar U.C. Irvine Feature Learning For Efficient

Live-Range Reordering Sven Verdoolaege 1 Albert Cohen 2 1 Polly Labs and KU Leuven 2 INRIA and

cedram Math literature Math E-literature DML Implementation Conclusions Outline The

Conditional Probability & Independence Conditional Probabilities Question : How should we

P( ) 1 conditional probability where P(F) > 0 Conditional probability of E given F:

Conditional Probability & Independence Conditional Probabilities Question : How should we