SLIDE 1
Observation Decoding with Sensor Models: Recognition Tasks via - - PowerPoint PPT Presentation
Observation Decoding with Sensor Models: Recognition Tasks via - - PowerPoint PPT Presentation
Observation Decoding with Sensor Models: Recognition Tasks via Classical Planning Diego Aineto, Sergio Jimenez, Eva Onaindia October 16, 2020 Universitat Polit` ecnica de Val` encia 1 What is decoding? What is decoding? Decoding : finding
SLIDE 2
SLIDE 3
What is decoding?
Decoding: finding the most likely explanation to some evidence.
2
SLIDE 4
What is decoding?
Decoding: finding the most likely explanation to some evidence. Basic reasoning tool in:
- ”Plan recognition as Planning” (Ramirez and Geffner, 2009).
- ”Diagnosis as Planning Revisited” (Sohrabi et al., 2010).
- ”Counterplanning using Goal Recognition and Landmarks” (Pozanco et al. 2018)
- ”Learning action models with minimal observability” (Aineto et al., 2019)
2
SLIDE 5
What is decoding?
Decoding: finding the most likely explanation to some evidence. Basic reasoning tool in:
- ”Plan recognition as Planning” (Ramirez and Geffner, 2009).
- ”Diagnosis as Planning Revisited” (Sohrabi et al., 2010).
- ”Counterplanning using Goal Recognition and Landmarks” (Pozanco et al. 2018)
- ”Learning action models with minimal observability” (Aineto et al., 2019)
Contributions:
- Formalization of the decoding problem within a probabilistic framework.
- Extension of decoding to support sensor models.
2
SLIDE 6
Motivating example
0.25
- 3
- 2
- 1
Acting agent
0.25 0.25 0.25 0.25
O = (loc = (0, 2), loc = (1, 2), loc = (4, 2) 3
SLIDE 7
Motivating example
0.25
- 3
- 2
- 1
Acting agent
0.25 0.25 0.25 0.25
O = (loc = (0, 2), loc = (1, 2), loc = (4, 2) τ1 = ((at x0 y2), (at x1 y2), (at x2 y2), (at x3 y2), (at x4 y2)) 4
SLIDE 8
Motivating example
0.25
- 3
- 2
- 1
Observer Acting agent
0.25 0.25 0.25 0.25 0.9 0.1 1
O = (loc = (0, 2), loc = (1, 2), loc = (4, 2) τ1 = ((at x0 y2), (at x1 y2), (at x2 y2), (at x3 y2), (at x4 y2)) 5
SLIDE 9
Motivating example
0.25
- 3
- 2
- 1
Observer Acting agent
0.25 0.25 0.25 0.25 0.9 0.1 1
O = (loc = (0, 2), loc = (1, 2), loc = (4, 2) τ2 = ((at x0 y2), (at x1 y2), (at x1 y1), (at x2 y1), (at x3 y1), (at x4 y1), (at x4 y2)) 6
SLIDE 10
Problem Definition
SLIDE 11
Probabilistic Framework
Mp τ O Ms
Planning Model Trajectory Observations Sensor Model Synthesis Sensing
7
SLIDE 12
Sensor Model and Observations
A sensor model Ms = X, Y , Φ
- X are the state variables.
- Y are the observable variables.
- Φ is the set of sensing functions fi : Ci × Yi → [0, 1]
- exhaustive (
c∈Ci Sc = S), and
- exclusive (Sc ∩ Sc′ = ∅, ∀c, c′ ∈ Ci)
8
SLIDE 13
Sensor Model and Observations
A sensor model Ms = X, Y , Φ
- X are the state variables.
- Y are the observable variables.
- Φ is the set of sensing functions fi : Ci × Yi → [0, 1]
- exhaustive (
c∈Ci Sc = S), and
- exclusive (Sc ∩ Sc′ = ∅, ∀c, c′ ∈ Ci)
Blindspots example Clear tile (x ≥ 2): floc(atx,y, loc = (x, y)) = 0.9, floc(atx,y, loc = ǫ) = 0.1 Blindspot tile (x ≤ 1): floc(atx,y, loc = ǫ) = 1
8
SLIDE 14
Sensor Model and Observations
A sensor model Ms = X, Y , Φ
- X are the state variables.
- Y are the observable variables.
- Φ is the set of sensing functions fi : Ci × Yi → [0, 1]
- exhaustive (
c∈Ci Sc = S), and
- exclusive (Sc ∩ Sc′ = ∅, ∀c, c′ ∈ Ci)
An observation o = Y1 = w1, . . . , Y|Y | = w|Y | is a full assignment of Y .
8
SLIDE 15
The Observation Decoding Problem
An observation decoding problem is a triplet D = Mp, Ms, O where:
- Mp = X, A is a planning model,
- Ms = X, Y , Φ is a sensor model, and
- O = o0, o1, . . . , om is an input observation sequence.
The solution to D = Mp, Ms, O is the most likely trajectory τ ∗ defined as τ ∗ = arg max
τ∈T
P(O, τ|Mp, Ms),
9
SLIDE 16
Synthesis and Sensing Probabilities
τ ∗ = arg maxτ∈T P(O, τ|Mp, Ms) = arg maxτ∈T P(τ|Mp)P(O|τ, Ms)
Mp τ O Ms
Planning Model Trajectory Observations Sensor Model Synthesis Sensing
Synthesis probability The probability of generating τ with Mp: P(τ|Mp) = P(s0)
|τ|
- i=1
P(si|si−1, Mp), (1) Sensing probability The probability of perceiving O from τ: P(O|τ, Ms) =
|τ|
- i=1
P(oi|si, Ms), (2)
10
SLIDE 17
Observation decoding via Classical Planning
SLIDE 18
Compilation
From probability maximization to cost minimization: τ ∗ = arg maxτ∈T P(O, τ|Mp, Ms) → τ ∗ = arg minτ∈T − log P(O, τ|Mp, Ms) Compile D = Mp, Ms, O to a planning problem P′ = F ′, A′, I ′, G ′ such that A′ = At ∪ Ae where:
- transition actions At are the cost-normalized versions of A
- sensing actions Ae to process an observation
11
SLIDE 19
Compilation
From probability maximization to cost minimization: τ ∗ = arg maxτ∈T P(O, τ|Mp, Ms) → τ ∗ = arg minτ∈T − log P(O, τ|Mp, Ms) Compile D = Mp, Ms, O to a planning problem P′ = F ′, A′, I ′, G ′ If π is a solution plan for P′ then:
- cost(πt) = − log P(τ π|Mp).
- cost(πe) = − log P(O|τ π, Mp).
- cost(π) = − log P(O, τ π|Mp, Ms).
11
SLIDE 20
Sensing Actions
Ae contains a sensek action for each observation ok ∈ O
- Implement an acceptor automaton for trajectories that satisfy the observation.
- Accumulate − log P(O|τ, Ms)
sensed0 sensed1
...
sensedK-1 sensedK
start
sense1 senseK
guard(sensek) := P(ok|si, Ms) > 0 reset(sensek) := x+ = x − log P(ok|si, Ms)
12
SLIDE 21
Sensing Actions
O = (loc = (0, 2), loc = (1, 2), loc = (4, 2)
pre(sense2) sensed1 eff(sense2) sensed2∧ when (at x1 y2) increase total cost − log (0.9) when (not (at x1 y2)) (deadend)
13
SLIDE 22
Experimental Evaluation
SLIDE 23
Setup
Evaluate the effectiveness of using a sensor model for decoding.
- ODN: optimal plan that satisfies the observation.
- ODS: the approach presented here.
Metric: plan diversity1 δα(πi, πj) = |Si − Sj| |Si| + |Sj| + |Sj − Si| |Si| + |Sj|
1”Domain independent approaches for finding diverse plans” (Srivastava et al., 2007).
14
SLIDE 24
Results
Domain H L ODS ODN 100 0.03 0.18 Blindspots 80 20 0.08 0.20 60 40 0.11 0.17 100 0.58 Intrusion 80 20 0.07 0.18 60 40 0.13 0.14 100 0.34 Blocks 2h 80 20 0.05 0.27 60 40 0.07 0.26 100 0.58 Office 80 20 0.23 0.38 60 40 0.16 0.23
H: Observability of the high observability region L: Observability of the low observability region ODS : δα(π, πS) ODN : δα(π, πN) 15
SLIDE 25
Conclusions
SLIDE 26
Conclusions
- Formalization of the decoding problem within a probabilistic framework.
- Extension of decoding to support sensor models.
- Unifying probabilistic framework (future work).