Logics for Markov Decision Processes Pedro S anchez Terraf Joint - - PowerPoint PPT Presentation

logics for markov decision processes
SMART_READER_LITE
LIVE PREVIEW

Logics for Markov Decision Processes Pedro S anchez Terraf Joint - - PowerPoint PPT Presentation

Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Logics for Markov Decision Processes Pedro S anchez Terraf Joint work with P .R. DArgenio and N. Wolovick SLALM, UniAndes, 04 / 06 /


slide-1
SLIDE 1

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work

Logics for Markov Decision Processes

Pedro S´ anchez Terraf Joint work with P .R. D’Argenio and N. Wolovick SLALM, UniAndes, 04 / 06 / 2012

slide-2
SLIDE 2

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work

Contents

1

Introduction Labelled Transition Systems (LTS) Modal Logics

2

Labelled Markov Processes (LMP) and its Non Deterministic version Analytic Spaces and Unique Structure Proving Completeness

3

Results Logics for non-deterministic processes Some counterexamples

4

Future Work

slide-3
SLIDE 3

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Labelled Transition Systems (LTS)

A toy model

Labelled Transition Systems (LTS)

S,L,T such that Ta : S → Pow(S) for each a ∈ L.

slide-4
SLIDE 4

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Labelled Transition Systems (LTS)

A toy model

Labelled Transition Systems (LTS)

S,L,T such that Ta : S → Pow(S) for each a ∈ L.

Zig-zag morphism A surjective f : S → S′ such that for all a ∈ L and every s ∈ S, Pow(f)◦Ta = T′

a ◦f .

lts02.jpg

slide-5
SLIDE 5

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Labelled Transition Systems (LTS)

A toy model

Labelled Transition Systems (LTS)

S,L,T such that Ta : S → Pow(S) for each a ∈ L.

Zig-zag morphism A surjective f : S → S′ such that for all a ∈ L and every s ∈ S, Pow(f)◦Ta = T′

a ◦f .

lts02.jpg

We say that s simulates t because

s can perform every “sequence of

actions” that t can.

slide-6
SLIDE 6

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Labelled Transition Systems (LTS)

Simulation and Bisimulation on LTS

Simulation It is a relation R such that if s1 Rt1 and t1

a

→ t2 then there is s2 such

that s1

a

→ s2 and s2 Rt2. In that case we say that s1 simulates s2.

slide-7
SLIDE 7

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Labelled Transition Systems (LTS)

Simulation and Bisimulation on LTS

Simulation It is a relation R such that if s1 Rt1 and t1

a

→ t2 then there is s2 such

that s1

a

→ s2 and s2 Rt2. In that case we say that s1 simulates s2.

Bisimulation It is a symmetric simulation. We’ll say that s1 is bisimilar to t1 if there exists a bisimulation R such that s1 Rt1.

lts12.jpg

Note: Bisimulation is finer than “double simulation”. That’s to say, if

s1 is bisimilar to t1, then s1

simulates t1 and t1 simulates s1, but not conversely.

slide-8
SLIDE 8

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Labelled Transition Systems (LTS)

Coalgebraic presentation of processes and bisimulation

One categorical counterpart of a relation is a span of morphisms Bisimilarity (span)

S

f

  • g
  • S1

S2

slide-9
SLIDE 9

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Labelled Transition Systems (LTS)

Coalgebraic presentation of processes and bisimulation

One categorical counterpart of a relation is a span of morphisms Bisimilarity (span)

S

f

  • g
  • S1

S2

Behavioral equivalence (cospan)

S1

  • S2
  • T

There is a correspondence between cospans and logics

slide-10
SLIDE 10

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Labelled Transition Systems (LTS)

Coalgebraic presentation of processes and bisimulation

One categorical counterpart of a relation is a span of morphisms Bisimilarity (span)

S

f

  • g
  • S1

S2

Behavioral equivalence (cospan)

S1

  • S2
  • T

There is a correspondence between cospans and logics Semipullbacks A category has semipullbacks if every cospan can be completed to a commutative diagram with a span.

slide-11
SLIDE 11

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Labelled Transition Systems (LTS)

Coalgebraic presentation of processes and bisimulation

One categorical counterpart of a relation is a span of morphisms Bisimilarity (span)

S

f

  • g
  • S1

S2

Behavioral equivalence (cospan)

S1

  • S2
  • T

There is a correspondence between cospans and logics Semipullbacks A category has semipullbacks if every cospan can be completed to a commutative diagram with a span. It is the Amalgamation Property in the opposite category.

slide-12
SLIDE 12

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Modal Logics

Logics for Bisimulation

Hennessy-Milner Logic (HML)

ϕ ≡ ⊤ | ¬ϕ |

  • i

ϕi | aψ

slide-13
SLIDE 13

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Modal Logics

Simulation and Bisimulation on LTS

Simulation It is a relation R such that if s1 Rt1 and t1

a

→ t2 then there is s2 such

that s1

a

→ s2 and s2 Rt2. In that case we say that s1 simulates s2.

Bisimulation It is a symmetric simulation. We’ll say that s1 is bisimilar to t1 if there exists a bisimulation R such that s1 Rt1.

lts12.jpg

“t1 can make an a-transition after which a c-transition is not possible”.

t1 | = a¬c⊤ s1 | = a¬c⊤

slide-14
SLIDE 14

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Modal Logics

Logics for Bisimulation

Hennessy-Milner Logic (HML)

ϕ ≡ ⊤ | ¬ϕ |

  • i

ϕi | aψ

Logical Characterization of Bisimulation Two states in a LTS are bisimilar iff they satisfy the same HML formulas.

slide-15
SLIDE 15

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work

Labelled Markov Processes (LMP) and Non Determinism

LMP (Desharnais et al.)

S,S,L,t such that ta(s) ∈ P(S) for each s ∈ S and a ∈ L, where S,S is a measurable space; P(S) is the space of (sub)probability measures over S,S; ta : S → P(S) is measurable.

slide-16
SLIDE 16

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work

Labelled Markov Processes (LMP) and Non Determinism

LMP (Desharnais et al.)

S,S,L,t such that ta(s) ∈ P(S) for each s ∈ S and a ∈ L, where S,S is a measurable space; P(S) is the space of (sub)probability measures over S,S; ta : S → P(S) is measurable.

NLMP (D’Argenio and Wolovick)

S,S,L,T such that Ta(s) ⊆ P(S) para each s ∈ S y a ∈ L, where: S,S, P(S) as before;

For each s, Ta(s) is measurable. I.e., Ta : S → P(S).

Ta : S → P(S) is a measurable map.

slide-17
SLIDE 17

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Analytic Spaces and Unique Structure

A pinch of Descriptive Set Theory: Analytic Spaces

Definition An analytic topological space is the continuous image of a Borel set (v.g., of reals).

slide-18
SLIDE 18

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Analytic Spaces and Unique Structure

A pinch of Descriptive Set Theory: Analytic Spaces

Definition An analytic topological space is the continuous image of a Borel set (v.g., of reals). An measurable space is analytic if it is isomorphic to A,B(A) for some analytic topological space A.

slide-19
SLIDE 19

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Analytic Spaces and Unique Structure

A pinch of Descriptive Set Theory: Analytic Spaces

Definition An analytic topological space is the continuous image of a Borel set (v.g., of reals). An measurable space is analytic if it is isomorphic to A,B(A) for some analytic topological space A. Examples The convex hull of a Borel set in Rn; The relation of isomorphism between countable structures.

slide-20
SLIDE 20

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Analytic Spaces and Unique Structure

A pinch of Descriptive Set Theory: Analytic Spaces

Definition An analytic topological space is the continuous image of a Borel set (v.g., of reals). An measurable space is analytic if it is isomorphic to A,B(A) for some analytic topological space A. Examples The convex hull of a Borel set in Rn; The relation of isomorphism between countable structures. Unique Structure Theorem If a sub-σ-algebra S ⊆ B(A) is countably generated and separates points, then it is B(A).

slide-21
SLIDE 21

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Proving Completeness

Logics for bisimulation on LMP

HMLq (Larsen and Skou, Danos et al.)

ϕ ≡ ⊤ | ϕ1 ∧ϕ2 | aqϕ, q ∈ Q

slide-22
SLIDE 22

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Proving Completeness

Logics for bisimulation on LMP

HMLq (Larsen and Skou, Danos et al.)

ϕ ≡ ⊤ | ϕ1 ∧ϕ2 | aqϕ, q ∈ Q

Logical Characterization of Bisimulation for LMP (Danos et al.) Two states in a LMP S,S,L,t with S,S analytic are bisimilar iff they satisfy the same HMLq formulas.

slide-23
SLIDE 23

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Proving Completeness

Logics for bisimulation on LMP

HMLq (Larsen and Skou, Danos et al.)

ϕ ≡ ⊤ | ϕ1 ∧ϕ2 | aqϕ, q ∈ Q

Logical Characterization of Bisimulation for LMP (Danos et al.) Two states in a LMP S,S,L,t with S,S analytic are bisimilar iff they satisfy the same HMLq formulas. Proof Strategy (D’Argenio, Celayes, PST) This results holds for every process with an analytic state space and a logic L that satisfies: 1) L it contains ⊤ and ∧; 2) for every ϕ ∈ L, ϕ is measurable; 3) L is countable; and 4) L separates transitions “locally”.

slide-24
SLIDE 24

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Logics for non-deterministic processes

Logics for bisimulation on LMP

Lf (D’Argenio et. al)

ϕ ≡ ⊤ | ϕ1 ∧ϕ2 | a{ϕi,pi}n

i=1,

pi ∈ Q, n ∈ N

slide-25
SLIDE 25

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Logics for non-deterministic processes

Logics for bisimulation on LMP

Lf (D’Argenio et. al)

ϕ ≡ ⊤ | ϕ1 ∧ϕ2 | a{ϕi,pi}n

i=1,

pi ∈ Q, n ∈ N

The proof strategy immediately gives Logical Characterization of Bisimulation for image finite NLMP Two states in a image finite NLMP S,S,L,t with S,S analytic are bisimilar iff they satisfy the same Lf formulas.

slide-26
SLIDE 26

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Logics for non-deterministic processes

Logics for bisimulation on LMP

Lf (D’Argenio et. al)

ϕ ≡ ⊤ | ϕ1 ∧ϕ2 | a{ϕi,pi}n

i=1,

pi ∈ Q, n ∈ N

The proof strategy immediately gives Logical Characterization of Bisimulation for image finite NLMP Two states in a image finite NLMP S,S,L,t with S,S analytic are bisimilar iff they satisfy the same Lf formulas.

∆ (D’Argenio et. al) ϕ ≡ ⊤ | ϕ1 ∧ϕ2 | aψ ψ ≡

i∈I ψi | ¬ψ | [ϕ]≥q

slide-27
SLIDE 27

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Some counterexamples

Some counterexamples

Analiticity is necessary The category of LMP over arbitrary measurable spaces does not have semipullbacks and HMLq does not characterize bisimilarity (PST, Inf&

  • Comp. 209 2011)
slide-28
SLIDE 28

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work Some counterexamples

Some counterexamples

Analiticity is necessary The category of LMP over arbitrary measurable spaces does not have semipullbacks and HMLq does not characterize bisimilarity (PST, Inf&

  • Comp. 209 2011)

At least image-countable is necessary For NLMP over analytic spaces (D’Argenio, PST, Wolovick, Math.Struct.Comp.Sci, 22 2009).

slide-29
SLIDE 29

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work

Logics for bisimulation on LMP

HMLq (Larsen and Skou, Danos et al.)

ϕ ≡ ⊤ | ϕ1 ∧ϕ2 | aqϕ, q ∈ Q

Logical Characterization of Bisimulation for LMP (Danos et al.) Two states in a LMP S,S,L,t with S,S analytic are bisimilar iff they satisfy the same HMLq formulas. Proof Strategy (D’Argenio, Celayes, PST) This results holds for every process with an analytic state space and a logic L that satisfies: 1) L it contains ⊤ and ∧; 2) for every ϕ ∈ L, ϕ is measurable; 3) L is countable; and 4) L separates transitions “locally”.

slide-30
SLIDE 30

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work

Future Work

To decide whether there is a nice logical characterization of bisimulation for countable NLMP . Is there a countable logic for countable LTS?

slide-31
SLIDE 31

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work

Future Work

To decide whether there is a nice logical characterization of bisimulation for countable NLMP . Is there a countable logic for countable LTS? If possible, to extend the logical characterization to Radon spaces

S,S (i.e., S ⊆ universally measurable sets).

slide-32
SLIDE 32

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work

Thank You!

slide-33
SLIDE 33

uncubuntu Introduction Labelled Markov Processes (LMP) and its Non Deterministic version Results Future Work

References

[2006] V. DANOS, J. DESHARNAIS, F. LAVIOLETTE, AND

  • P. PANANGADEN

Bisimulation and cocongruence for probabilistic systems.

  • Inf. & Comp., vol. 204, pp. 503–523.

[1999] J. DESHARNAIS Labelled Markov Processes. Ph.D. dissertation, McGill University. [1991] K. LARSEN AND A. SKOU Bisimulation through Probabilistic Testing,

  • Inf. & Comp., vol. 94, pp. 1–28.