Observation Decoding with Sensor Models: Recognition Tasks via - - PowerPoint PPT Presentation

observation decoding with sensor models recognition tasks
SMART_READER_LITE
LIVE PREVIEW

Observation Decoding with Sensor Models: Recognition Tasks via - - PowerPoint PPT Presentation

Observation Decoding with Sensor Models: Recognition Tasks via Classical Planning Diego Aineto, Sergio Jimenez, Eva Onaindia October 16, 2020 Universitat Polit` ecnica de Val` encia 1 What is decoding? What is decoding? Decoding : finding


slide-1
SLIDE 1

Observation Decoding with Sensor Models: Recognition Tasks via Classical Planning

Diego Aineto, Sergio Jimenez, Eva Onaindia October 16, 2020

Universitat Polit` ecnica de Val` encia 1

slide-2
SLIDE 2

What is decoding?

slide-3
SLIDE 3

What is decoding?

Decoding: finding the most likely explanation to some evidence.

2

slide-4
SLIDE 4

What is decoding?

Decoding: finding the most likely explanation to some evidence. Basic reasoning tool in:

  • ”Plan recognition as Planning” (Ramirez and Geffner, 2009).
  • ”Diagnosis as Planning Revisited” (Sohrabi et al., 2010).
  • ”Counterplanning using Goal Recognition and Landmarks” (Pozanco et al. 2018)
  • ”Learning action models with minimal observability” (Aineto et al., 2019)

2

slide-5
SLIDE 5

What is decoding?

Decoding: finding the most likely explanation to some evidence. Basic reasoning tool in:

  • ”Plan recognition as Planning” (Ramirez and Geffner, 2009).
  • ”Diagnosis as Planning Revisited” (Sohrabi et al., 2010).
  • ”Counterplanning using Goal Recognition and Landmarks” (Pozanco et al. 2018)
  • ”Learning action models with minimal observability” (Aineto et al., 2019)

Contributions:

  • Formalization of the decoding problem within a probabilistic framework.
  • Extension of decoding to support sensor models.

2

slide-6
SLIDE 6

Motivating example

0.25

  • 3
  • 2
  • 1

Acting agent

0.25 0.25 0.25 0.25

O = (loc = (0, 2), loc = (1, 2), loc = (4, 2) 3

slide-7
SLIDE 7

Motivating example

0.25

  • 3
  • 2
  • 1

Acting agent

0.25 0.25 0.25 0.25

O = (loc = (0, 2), loc = (1, 2), loc = (4, 2) τ1 = ((at x0 y2), (at x1 y2), (at x2 y2), (at x3 y2), (at x4 y2)) 4

slide-8
SLIDE 8

Motivating example

0.25

  • 3
  • 2
  • 1

Observer Acting agent

0.25 0.25 0.25 0.25 0.9 0.1 1

O = (loc = (0, 2), loc = (1, 2), loc = (4, 2) τ1 = ((at x0 y2), (at x1 y2), (at x2 y2), (at x3 y2), (at x4 y2)) 5

slide-9
SLIDE 9

Motivating example

0.25

  • 3
  • 2
  • 1

Observer Acting agent

0.25 0.25 0.25 0.25 0.9 0.1 1

O = (loc = (0, 2), loc = (1, 2), loc = (4, 2) τ2 = ((at x0 y2), (at x1 y2), (at x1 y1), (at x2 y1), (at x3 y1), (at x4 y1), (at x4 y2)) 6

slide-10
SLIDE 10

Problem Definition

slide-11
SLIDE 11

Probabilistic Framework

Mp τ O Ms

Planning Model Trajectory Observations Sensor Model Synthesis Sensing

7

slide-12
SLIDE 12

Sensor Model and Observations

A sensor model Ms = X, Y , Φ

  • X are the state variables.
  • Y are the observable variables.
  • Φ is the set of sensing functions fi : Ci × Yi → [0, 1]
  • exhaustive (

c∈Ci Sc = S), and

  • exclusive (Sc ∩ Sc′ = ∅, ∀c, c′ ∈ Ci)

8

slide-13
SLIDE 13

Sensor Model and Observations

A sensor model Ms = X, Y , Φ

  • X are the state variables.
  • Y are the observable variables.
  • Φ is the set of sensing functions fi : Ci × Yi → [0, 1]
  • exhaustive (

c∈Ci Sc = S), and

  • exclusive (Sc ∩ Sc′ = ∅, ∀c, c′ ∈ Ci)

Blindspots example Clear tile (x ≥ 2): floc(atx,y, loc = (x, y)) = 0.9, floc(atx,y, loc = ǫ) = 0.1 Blindspot tile (x ≤ 1): floc(atx,y, loc = ǫ) = 1

8

slide-14
SLIDE 14

Sensor Model and Observations

A sensor model Ms = X, Y , Φ

  • X are the state variables.
  • Y are the observable variables.
  • Φ is the set of sensing functions fi : Ci × Yi → [0, 1]
  • exhaustive (

c∈Ci Sc = S), and

  • exclusive (Sc ∩ Sc′ = ∅, ∀c, c′ ∈ Ci)

An observation o = Y1 = w1, . . . , Y|Y | = w|Y | is a full assignment of Y .

8

slide-15
SLIDE 15

The Observation Decoding Problem

An observation decoding problem is a triplet D = Mp, Ms, O where:

  • Mp = X, A is a planning model,
  • Ms = X, Y , Φ is a sensor model, and
  • O = o0, o1, . . . , om is an input observation sequence.

The solution to D = Mp, Ms, O is the most likely trajectory τ ∗ defined as τ ∗ = arg max

τ∈T

P(O, τ|Mp, Ms),

9

slide-16
SLIDE 16

Synthesis and Sensing Probabilities

τ ∗ = arg maxτ∈T P(O, τ|Mp, Ms) = arg maxτ∈T P(τ|Mp)P(O|τ, Ms)

Mp τ O Ms

Planning Model Trajectory Observations Sensor Model Synthesis Sensing

Synthesis probability The probability of generating τ with Mp: P(τ|Mp) = P(s0)

|τ|

  • i=1

P(si|si−1, Mp), (1) Sensing probability The probability of perceiving O from τ: P(O|τ, Ms) =

|τ|

  • i=1

P(oi|si, Ms), (2)

10

slide-17
SLIDE 17

Observation decoding via Classical Planning

slide-18
SLIDE 18

Compilation

From probability maximization to cost minimization: τ ∗ = arg maxτ∈T P(O, τ|Mp, Ms) → τ ∗ = arg minτ∈T − log P(O, τ|Mp, Ms) Compile D = Mp, Ms, O to a planning problem P′ = F ′, A′, I ′, G ′ such that A′ = At ∪ Ae where:

  • transition actions At are the cost-normalized versions of A
  • sensing actions Ae to process an observation

11

slide-19
SLIDE 19

Compilation

From probability maximization to cost minimization: τ ∗ = arg maxτ∈T P(O, τ|Mp, Ms) → τ ∗ = arg minτ∈T − log P(O, τ|Mp, Ms) Compile D = Mp, Ms, O to a planning problem P′ = F ′, A′, I ′, G ′ If π is a solution plan for P′ then:

  • cost(πt) = − log P(τ π|Mp).
  • cost(πe) = − log P(O|τ π, Mp).
  • cost(π) = − log P(O, τ π|Mp, Ms).

11

slide-20
SLIDE 20

Sensing Actions

Ae contains a sensek action for each observation ok ∈ O

  • Implement an acceptor automaton for trajectories that satisfy the observation.
  • Accumulate − log P(O|τ, Ms)

sensed0 sensed1

...

sensedK-1 sensedK

start

sense1 senseK

guard(sensek) := P(ok|si, Ms) > 0 reset(sensek) := x+ = x − log P(ok|si, Ms)

12

slide-21
SLIDE 21

Sensing Actions

O = (loc = (0, 2), loc = (1, 2), loc = (4, 2)

pre(sense2) sensed1 eff(sense2) sensed2∧ when (at x1 y2) increase total cost − log (0.9) when (not (at x1 y2)) (deadend)

13

slide-22
SLIDE 22

Experimental Evaluation

slide-23
SLIDE 23

Setup

Evaluate the effectiveness of using a sensor model for decoding.

  • ODN: optimal plan that satisfies the observation.
  • ODS: the approach presented here.

Metric: plan diversity1 δα(πi, πj) = |Si − Sj| |Si| + |Sj| + |Sj − Si| |Si| + |Sj|

1”Domain independent approaches for finding diverse plans” (Srivastava et al., 2007).

14

slide-24
SLIDE 24

Results

Domain H L ODS ODN 100 0.03 0.18 Blindspots 80 20 0.08 0.20 60 40 0.11 0.17 100 0.58 Intrusion 80 20 0.07 0.18 60 40 0.13 0.14 100 0.34 Blocks 2h 80 20 0.05 0.27 60 40 0.07 0.26 100 0.58 Office 80 20 0.23 0.38 60 40 0.16 0.23

H: Observability of the high observability region L: Observability of the low observability region ODS : δα(π, πS) ODN : δα(π, πN) 15

slide-25
SLIDE 25

Conclusions

slide-26
SLIDE 26

Conclusions

  • Formalization of the decoding problem within a probabilistic framework.
  • Extension of decoding to support sensor models.
  • Unifying probabilistic framework (future work).

16