[PPT] - Dependability Evaluation through Markovian model Markovian model PowerPoint Presentation

SLIDE 1

Dependability Evaluation through Markovian model

SLIDE 2

Markovian model

The combinatorial methods are unable to:

take care easily of the coverage factor
model the maintenance

The Markov model is an alternative to the combinatorial methods. T wo main concepts:

state
state transition

SLIDE 3

State and state transitions

State: the state of a system represents all that must be known to describe the system at any given instant of time

For the reliability/availability models each state represents a distinct combination of faulty and fault-free components

State transitions govern the changes if state that occur within a system

For the reliability/availability models each transition takes place when one or more components change state due to an event of a fault or a repair action

SLIDE 4

State and state transitions (cnt.)

State transitions are characterized by probabilities, such as probability of fault,

fault coverage and the probability of repair

The probability of being in any given state, s, at some time,t+∆t depends both:

– the probability that the system was in a state from which it could transit to state state s given that the transition occurs during ∆t – the probability that the system was in state s at instant t and there was no event in the interval time ∆t

The initial state should be any state, normally it is that representing all fault-

free components IMPORTANT: IN A MARKOV CHAIN THE PROBABILITY TRANSITION DEPENDS ONLY ON THE ACTUAL STATE (Memoryless Property)

SLIDE 5

TMR reliability evaluation

I O

C1 C2 C3 r/n

There are 4 components (1 voter + computation

module), therefore each state is represented by 4 bit:

if the component is fault-free then the bit value is 1
otherwise the bit value is 0.
For example (1,1,1,1) represents the faut-free state
For example (0,0,0,0) represents all components faulty

SLIDE 6

TMR reliability evaluation: states diagram

1,1,1,1 1,1,0,1 0,1,1,1 0,0,1,1 0,1,1,0 0,0,1,0 1,0,1,1 1,0,1,0 0,1,0,0 0,0,0,0 1,1,1,0 1,0,0,0 1,1,0,0 1,0,0,1 0,0,0,1 0,1,0,1

SLIDE 7

Markov chain reliability evaluation methodology

State transition probability evaluation:
If the fault occurence of a component is

exponentially distributed (e-λt) with fault rate equal to (λ), then the probability that the fault-free component at istant t in the interval ∆t become faulty is equal to:

1 – e-λ∆t

SLIDE 8

Probability property

Prob{there is a fault between t e t+∆t} = = Prob{there is a fault before t+∆t/the component was fault-free at t} = = Prob{there is a faul before t+∆t and the component was fault-free at t} Prob{the component was fault-free at t} = Prob{there is a fault before t+∆t} − Prob{there is a fault before t} = Prob{the component was fault-free at t} = (1 – e-λ(t+∆t)) − (1 – e-λt) = 1 – e-λ(t+∆t) − 1 + e-λt e-λt e-λt

SLIDE 9

Probability property

= e-λt – e-λ(t+∆t) = e-λt = e-λt _ e-λ(t+∆t) = 1 − e-λ∆t

e-λt

If we expand the exponential part we have the following series: 1 – e-λ∆t = 1 − 1 + (−λ∆t) + (−λ∆t)2 + … 2! = λ∆t − (−λ∆t)2 − … 2! For value of λ∆t << 1, we have the following good approximation: 1 – e-λ∆t ≈ λ∆t

c

SLIDE 10

TMR reliability evaluation: reduced states diagram

3λe 2,1 G 2λe+λv 3,1 λv

State (3,1) → (1,1,1,1) State (2,1) → (0,1,1,1) + (1,0,1,1) + (1,1,0,1) State (G) → all the other states

Transition probability (in the interval between t and t+∆t):

from state (3,1) to state (2,1) -> 3λe∆t ;
from state (3,1) to state (G) -> λv∆t ;
from state(2,1) to state (G) -> 2λe∆t+λv∆t .

SLIDE 11

TMR reliability evaluation

Given the Markov process properties, i.e.

the probability of being in any given state, s, at some time, t+∆t depends both:

– the probability that the system was in a state from which it could transit to state state s given that the transition

ccurs during ∆t

– the probability that the system was in state s at instant t and there was no event in the interval time ∆t

we have that:

P(3,1) (t+∆t) = (1− 3λe ∆t − λv ∆t) P(3,1) (t) P(2,1) (t+∆t) = 3λe ∆t P(3,1) (t) + (1− 2λe ∆t − λv ∆t) P(2,1) (t) P(G) (t+∆t) = λv ∆t P(3,1) (t) + (2λe ∆t + λv ∆t) P(2,1) (t) + P(G) (t)

SLIDE 12

TMR reliability evaluation

With algebric operations:

∆t → 0

P(3,1) (t+∆t) − P(3,1) (t) = − (3λe + λv) P(3,1) (t) = d P(3,1) (t) ∆t dt ∆t → 0 P(2,1) (t+∆t) − P(2,1) (t) = 3λe P(3,1) (t) − (2λe + λv) P(2,1) (t) = d P(2,1) (t) ∆t dt ∆t → 0 P(G) (t+∆t) − P(G) (t) = λv P(3,1) (t) + (2λe + λv) P(2,1) (t) = d P(G) (t) ∆t dt

SLIDE 13

TMR reliability evaluation

i.e: P'3,1 (t) = − (3λe + λv )P3,1 (t) P'2,1 (t) = 3λe P3,1 (t) − (2λe + λv )P2,1 (t) P'G (t) = λv P3,1 (t) + (2λe + λv )P2,1 (t)

That in matrix notation can be expressed as: π(t) = π(t) Q(t) dt (P'3,1 P'2,1 P'G) = (P3,1 P2,1 PG) * Q

−

SLIDE 14

TMR reliability evaluation

the reliability is the probability of being in any fault- free state, i.e, in this case of being in state (3,1) or (2,1). R(t) = P3,1 (t) + P2,1 (t) = 1 − PG (t) with the initial condition P3,1(0) = 1

SLIDE 15

TMR reliability evaluation

where:

−(3λe+λv) 3λe λv Q = 0 −(2λe+λv) (2λe+λv) 0 0 0 P = Q + I → Q = P − I 1−(3λe+λv) 3λe λv P = 0 1−(2λe+λv) (2λe+λv) 0 0 1

SLIDE 16

Properties of Laplace’s transformation

SLIDE 17

Markov Processes for maintenable systems

Two kinds of events:

fault of a component (module or voter)
repair of the system (of a module or the voter or both)

Hypothesis: the maintenance process is exponentially distributed with repair rate equal to µ

3λe

2,1 G

2λe+λv

3,1

λv µ µ

SLIDE 18

Availability evaluation of TMR system

3λe

2,1 G

2λe+λv

3,1

λv µ µ

P3,1(t) + P2,1(t) + PG(t) = 1 P3,1(0) = 1 P’3,1(t) = − (3λe + λv ) P3,1(t) + µ P2,1(t) + µ PG(t) P’2,1(t) = 3λe P3,1(t) − (2λe + λv + µ) P2,1(t) P’G(t) = λv P3,1(t) + (2λe + λv) P2,1(t) − µ PG(t)

dπ(t) = π(t) Q(t)

dt

i.e. (P'3,1 P'2,1 P'G) = (P3,1 P2,1 PG) * Q

SLIDE 19

Availability evaluation of TMR system

−(3λe+λv) 3λe λv Q = µ −(2λe+λv+µ ) (2λe+λv) µ − µ Q = P − I → P = Q + I 1 −(3λe+λv) 3λe λv P = µ 1 − (2λe+λv+µ) (2λe+λv) µ 1 − µ

SLIDE 20

Istantaneous Availability evaluation

f TMR system

The Istantaneous Availability is the probability of being in any fault-free state (in this case: state (3,1) or (2,1)). A(t) = P3,1 (t) + P2,1 (t) = 1 − PG (t) with the initial conditionP3,1(0) = 1

SLIDE 21

Limiting or steady state Availability evaluation

f TMR system

P3,1(t) + P2,1(t) + PG(t) = 1 P3,1(0) = 1 with t 00 we have that P’(t) = 0 P’3,1(t) = 0 = − (3λe + λv ) P3,1(t) + µ P2,1(t) + µ PG(t) P’2,1(t) = 0 = 3λe P3,1(t) − (2λe + λv + µ) P2,1(t) P’G(t) = 0 = λv P3,1(t) + (2λe + λv) P2,1(t) − µ PG(t)

SLIDE 22

Limiting or steady state Availability evaluation

f TMR system

P3,1(t) + P2,1(t) + PG(t) = 1 P3,1(0) = 1 with t 00 we have that P’(t) = 0 and P(t) = P P’3,1(t) = 0 = − (3λe + λv ) P3,1 + µ P2,1 + µ PG P’2,1(t) = 0 = 3λe P3,1 − (2λe + λv + µ) P2,1 P’G(t) = 0 = λv P3,1 + (2λe + λv) P2,1(t) − µ PG

SLIDE 23

Limiting or steady state Availability evaluation

f TMR system

P3,1 + P2,1 + PG= 1 P3,1 = P2,1 = PG =

SLIDE 24

Safety evaluation

Four types of events:

fault of a component (module or voter) correcttly

diagnoticated

fault of a component not detected
correct repair of the system (of a module or the voter
r both)
uncorrect repair of the system

λ → fault rate µ → repair rate Cg → fault detection coverage factor Cr → correct repair coverage factor

SLIDE 25

Single component Safety evaluation

λ Cf

S U

µ (1-Cr)

λ (1-Cf)

µ Cr

→ fault free state S→ safe fault state U→ unsafe fault state

Hypothesis:

if a fault is not well diagnosticated then it

will never be detected

If a reconfiguration is not wel done then it

will be never detected

Therefore U is an absorbing state

SLIDE 26

Single component Safety evaluation

Safety = probability to stay in state 0 or GS

PO(t) + PGS(t) = 1 − PGI(t) PO(0) = 1 P’O(t) = − (λ(1− Cg) + λCg)) PO(t) + µCr PGS(t) P’GS(t) = λCg PO(t) − (µ(1− Cr)+ µCr) PGS(t) P’GI(t) = λ(1− Cg) PO(t) + (µ(1− Cr)PGS(t)

SLIDE 27

Single component Safety evaluation

−λ λCg λ(1-Cg) Q = µCr µ µ (1-Cr)

dπ(t) = π(t) Q(t)

dt

i.e. (P'3,1 P'2,1 P'G) = (P3,1 P2,1 PG) * Q

SLIDE 28

Performability

Index taking into account even the performance of the system given its state (related to the number of fault-free components)

f(z(t)) t

fmin

S1 S2

We will discuss it when we will know how evaluate the performance of a system

SLIDE 29

I O C21 C23 C22 C111 C12 C112 C11 C1 C2

R 11 = R 111 . R 112 R 1 = 1 - (1 - R 11) . (1 - R 12) R 2 = 1 - (1 - R 21) . (1 - R 22) . (1 - R 23) R = R 1 . R 2

Dependability Evaluation through Markovian model

Markovian model

The combinatorial methods are unable to:

The Markov model is an alternative to the combinatorial methods. T wo main concepts:

State and state transitions

State: the state of a system represents all that must be known to describe the system at any given instant of time

For the reliability/availability models each state represents a distinct combination of faulty and fault-free components

State transitions govern the changes if state that occur within a system

For the reliability/availability models each transition takes place when one or more components change state due to an event of a fault or a repair action

State and state transitions (cnt.)

TMR reliability evaluation

module), therefore each state is represented by 4 bit:

TMR reliability evaluation: states diagram

Markov chain reliability evaluation methodology

exponentially distributed (e-λt) with fault rate equal to (λ), then the probability that the fault-free component at istant t in the interval ∆t become faulty is equal to:

Probability property

Probability property

= e-λt – e-λ(t+∆t) = e-λt = e-λt _ e-λ(t+∆t) = 1 − e-λ∆t

e-λt

If we expand the exponential part we have the following series: 1 – e-λ∆t = 1 − 1 + (−λ∆t) + (−λ∆t)2 + … 2! = λ∆t − (−λ∆t)2 − … 2! For value of λ∆t << 1, we have the following good approximation: 1 – e-λ∆t ≈ λ∆t

TMR reliability evaluation: reduced states diagram

Transition probability (in the interval between t and t+∆t):

TMR reliability evaluation

the probability of being in any given state, s, at some time, t+∆t depends both:

we have that:

P(3,1) (t+∆t) = (1− 3λe ∆t − λv ∆t) P(3,1) (t) P(2,1) (t+∆t) = 3λe ∆t P(3,1) (t) + (1− 2λe ∆t − λv ∆t) P(2,1) (t) P(G) (t+∆t) = λv ∆t P(3,1) (t) + (2λe ∆t + λv ∆t) P(2,1) (t) + P(G) (t)

TMR reliability evaluation

TMR reliability evaluation

i.e: P'3,1 (t) = − (3λe + λv )P3,1 (t) P'2,1 (t) = 3λe P3,1 (t) − (2λe + λv )P2,1 (t) P'G (t) = λv P3,1 (t) + (2λe + λv )P2,1 (t)

TMR reliability evaluation

the reliability is the probability of being in any fault- free state, i.e, in this case of being in state (3,1) or (2,1). R(t) = P3,1 (t) + P2,1 (t) = 1 − PG (t) with the initial condition P3,1(0) = 1

TMR reliability evaluation

Properties of Laplace’s transformation

Markov Processes for maintenable systems

Availability evaluation of TMR system

Availability evaluation of TMR system

Istantaneous Availability evaluation

The Istantaneous Availability is the probability of being in any fault-free state (in this case: state (3,1) or (2,1)). A(t) = P3,1 (t) + P2,1 (t) = 1 − PG (t) with the initial conditionP3,1(0) = 1

Limiting or steady state Availability evaluation

Limiting or steady state Availability evaluation

Limiting or steady state Availability evaluation

Safety evaluation

Single component Safety evaluation

S U

Hypothesis:

Therefore U is an absorbing state

Single component Safety evaluation

Safety = probability to stay in state 0 or GS

Single component Safety evaluation

Performability

Reliability/Availability/Safety evaluation of complex system