Markov chains and Markov decision processes in Isabelle/HOL
Johannes Hölzl January 2016
TU München, Germany
Markov chains and Markov decision processes in Isabelle/HOL - - PowerPoint PPT Presentation
Johannes Hlzl January 2016 TU Mnchen, Germany Markov chains and Markov decision processes in Isabelle/HOL Introduction Coalgebraic view on transition systems Fixed points to define queries on trace space Formalize probabilistic
TU München, Germany
3
3
3
3
3
3
1 2 1 2 1 2 1 2 1 3 1 3 1 3 1 3 1 3 1 3 1 2 1 4 1 4
2 3 1 3 2 3 1 3 2 3
5
1 2 1 2 1 2 1 2 1 3 1 3 1 3 1 3 1 3 1 3 1 2 1 4 1 4
2 3 1 3 2 3 1 3 2 3
5
1 2 1 2 1 2 1 2 1 3 1 3 1 3 1 3 1 3 1 3 1 2 1 4 1 4
2 3 1 3 2 3 1 3 2 3
n
n
6
1 2 1 2 1 2 1 2 1 3 1 3 1 3 1 3 1 3 1 3 1 2 1 4 1 4
2 3 1 3 2 3 1 3 2 3
n
n
6
1 2 1 2 1 2 1 2 1 3 1 3 1 3 1 3 1 3 1 3 1 2 1 4 1 4
2 3 1 3 2 3 1 3 2 3
n
n
6
1 2 1 2 1 2 1 2 1 3 1 3 1 3 1 3 1 3 1 3 1 2 1 4 1 4
2 3 1 3 2 3 1 3 2 3
6
1 2 1 2 1 2 1 2 1 3 1 3 1 3 1 3 1 3 1 3 1 2 1 4 1 4
2 3 1 3 2 3 1 3 2 3
ω
6
7
7
7
x µ x = 1
f y x
y
y x
8
x µ x = 1
f y=x
y
8
s
t
s t
t
9
s
t
s t
t
9
ω
t
ω
9
ω
t
ω
9
n s
s
10
n s
s
10
n s
s
10
n::N
s::σ
10
n::N
s::σ
10
n lfp
n gfp
ψUφ ω
n N lfp
n lfp
n n lfp
11
lfp
n gfp
ψUφ ω
n N lfp
n lfp
n n lfp
11
lfp
gfp
ψUφ ω
n N lfp
n lfp
n n lfp
11
lfp
gfp
ψUφ ω
lfp
n lfp
n n lfp
11
lfp
gfp
ψUφ ω
lfp
lfp
n n lfp
11
lfp
gfp
ψUφ ω
lfp
lfp
n⌊φ ωn⌋ lfp
11
lfp
gfp
ψUφ ω
lfp
lfp
n⌊φ ωn⌋ lfp
11
lfp
gfp
ψUφ ω
lfp
lfp
n⌊φ ωn⌋ lfp
11
lfp
gfp
ψUφ ω
lfp
ψUφ (tl ω)
lfp
n⌊φ ωn⌋ lfp
11
lfp
gfp
ψUφ ω
lfp
ψUφ (tl ω)
lfp
n⌊φ ωn⌋ lfp
11
lfp
gfp
ψUφ ω
lfp
ψUφ (tl ω)
lfp
n⌊φ ωn⌋ lfp
11
∞
i Ci i f Ci
12
i Ci i f Ci
12
i Ci i f Ci
12
i Ci i f Ci
12
i Ci i f Ci
12
i Ci) = ⊔ i f Ci
12
i Ci i f Ci
12
i Ci i f Ci
12
def
lfp
s
t
13
def
lfp
s
t
13
def
lfp
ω
t
13
def
lfp
ω
t
13
s
14
ω
14
ω
14
ω
n→∞ Prs(ωn = t) = N t
1 2
15
σ U)
16
18
18
18
α
1 2 1 4 1 4
β 1
19
20
s
min s f D Ks t min s f t
21
s
s
t
s
21
p= 1
2
n=0
p= 5
8
n= 1
4
p=1 n=1
p=0 n=0
p=0 n=0
3 4 1 4
1 2 1 2
s
s
22
s
23
s
23
s
23
s
23
s
23
24
⩾0
⩾0
25
1, s′).
1 c2, s′)
1 ̸= Skip
26
lfp
min c s r f
27
lfp
(c,s)(r f) = wp c f s 27
(c,s) (r f) = lfp
(c,s)
(Seq c1 c2,s)(r f) = Emin (c1,s)
(c2,s′)(r f)
(d,t)
28
30
31
31
31
31
31
31
31
31
31
31
F
F F
32
F
F F
32
32
32
1
33
κ⩽κ1 κ⩽κ2 κ⩽κ1 κ⩽κ2 α set⩽κ α set⩽κ 34
35
37
38
38
38
38
38
38
38
38
38
38