Dependability Evaluation through Markovian model Markovian model - - PowerPoint PPT Presentation

dependability evaluation through markovian model
SMART_READER_LITE
LIVE PREVIEW

Dependability Evaluation through Markovian model Markovian model - - PowerPoint PPT Presentation

Dependability Evaluation through Markovian model Markovian model The combinatorial methods are unable to: - take care easily of the coverage factor - model the maintenance The Markov model is an alternative to the combinatorial methods. T


slide-1
SLIDE 1

Dependability Evaluation through Markovian model

slide-2
SLIDE 2

Markovian model

The combinatorial methods are unable to:

  • take care easily of the coverage factor
  • model the maintenance

The Markov model is an alternative to the combinatorial methods. T wo main concepts:

  • state
  • state transition
slide-3
SLIDE 3

State and state transitions

State: the state of a system represents all that must be known to describe the system at any given instant of time

For the reliability/availability models each state represents a distinct combination of faulty and fault-free components

State transitions govern the changes if state that occur within a system

For the reliability/availability models each transition takes place when one or more components change state due to an event of a fault or a repair action

slide-4
SLIDE 4

State and state transitions (cnt.)

  • State transitions are characterized by probabilities, such as probability of fault,

fault coverage and the probability of repair

  • The probability of being in any given state, s, at some time,t+∆t depends both:

– the probability that the system was in a state from which it could transit to state state s given that the transition occurs during ∆t – the probability that the system was in state s at instant t and there was no event in the interval time ∆t

  • The initial state should be any state, normally it is that representing all fault-

free components IMPORTANT: IN A MARKOV CHAIN THE PROBABILITY TRANSITION DEPENDS ONLY ON THE ACTUAL STATE (Memoryless Property)

slide-5
SLIDE 5

TMR reliability evaluation

I O

C1 C2 C3 r/n

  • There are 4 components (1 voter + computation

module), therefore each state is represented by 4 bit:

  • if the component is fault-free then the bit value is 1
  • otherwise the bit value is 0.
  • For example (1,1,1,1) represents the faut-free state
  • For example (0,0,0,0) represents all components faulty
slide-6
SLIDE 6

TMR reliability evaluation: states diagram

1,1,1,1 1,1,0,1 0,1,1,1 0,0,1,1 0,1,1,0 0,0,1,0 1,0,1,1 1,0,1,0 0,1,0,0 0,0,0,0 1,1,1,0 1,0,0,0 1,1,0,0 1,0,0,1 0,0,0,1 0,1,0,1

slide-7
SLIDE 7

Markov chain reliability evaluation methodology

  • State transition probability evaluation:
  • If the fault occurence of a component is

exponentially distributed (e-λt) with fault rate equal to (λ), then the probability that the fault-free component at istant t in the interval ∆t become faulty is equal to:

  • 1 – e-λ∆t
slide-8
SLIDE 8

Probability property

Prob{there is a fault between t e t+∆t} = = Prob{there is a fault before t+∆t/the component was fault-free at t} = = Prob{there is a faul before t+∆t and the component was fault-free at t} Prob{the component was fault-free at t} = Prob{there is a fault before t+∆t} − Prob{there is a fault before t} = Prob{the component was fault-free at t} = (1 – e-λ(t+∆t)) − (1 – e-λt) = 1 – e-λ(t+∆t) − 1 + e-λt e-λt e-λt

slide-9
SLIDE 9

Probability property

= e-λt – e-λ(t+∆t) = e-λt = e-λt _ e-λ(t+∆t) = 1 − e-λ∆t

e-λt

e-λt

If we expand the exponential part we have the following series: 1 – e-λ∆t = 1 − 1 + (−λ∆t) + (−λ∆t)2 + … 2! = λ∆t − (−λ∆t)2 − … 2! For value of λ∆t << 1, we have the following good approximation: 1 – e-λ∆t ≈ λ∆t

c

slide-10
SLIDE 10

TMR reliability evaluation: reduced states diagram

3λe 2,1 G 2λe+λv 3,1 λv

State (3,1) → (1,1,1,1) State (2,1) → (0,1,1,1) + (1,0,1,1) + (1,1,0,1) State (G) → all the other states

Transition probability (in the interval between t and t+∆t):

  • from state (3,1) to state (2,1) -> 3λe∆t ;
  • from state (3,1) to state (G) -> λv∆t ;
  • from state(2,1) to state (G) -> 2λe∆t+λv∆t .
slide-11
SLIDE 11

TMR reliability evaluation

Given the Markov process properties, i.e.

the probability of being in any given state, s, at some time, t+∆t depends both:

– the probability that the system was in a state from which it could transit to state state s given that the transition

  • ccurs during ∆t

– the probability that the system was in state s at instant t and there was no event in the interval time ∆t

we have that:

P(3,1) (t+∆t) = (1− 3λe ∆t − λv ∆t) P(3,1) (t) P(2,1) (t+∆t) = 3λe ∆t P(3,1) (t) + (1− 2λe ∆t − λv ∆t) P(2,1) (t) P(G) (t+∆t) = λv ∆t P(3,1) (t) + (2λe ∆t + λv ∆t) P(2,1) (t) + P(G) (t)

slide-12
SLIDE 12

TMR reliability evaluation

With algebric operations:

∆t → 0

P(3,1) (t+∆t) − P(3,1) (t) = − (3λe + λv) P(3,1) (t) = d P(3,1) (t) ∆t dt ∆t → 0 P(2,1) (t+∆t) − P(2,1) (t) = 3λe P(3,1) (t) − (2λe + λv) P(2,1) (t) = d P(2,1) (t) ∆t dt ∆t → 0 P(G) (t+∆t) − P(G) (t) = λv P(3,1) (t) + (2λe + λv) P(2,1) (t) = d P(G) (t) ∆t dt

slide-13
SLIDE 13

TMR reliability evaluation

i.e: P'3,1 (t) = − (3λe + λv )P3,1 (t) P'2,1 (t) = 3λe P3,1 (t) − (2λe + λv )P2,1 (t) P'G (t) = λv P3,1 (t) + (2λe + λv )P2,1 (t)

That in matrix notation can be expressed as: π(t) = π(t) Q(t) dt (P'3,1 P'2,1 P'G) = (P3,1 P2,1 PG) * Q

slide-14
SLIDE 14

TMR reliability evaluation

the reliability is the probability of being in any fault- free state, i.e, in this case of being in state (3,1) or (2,1). R(t) = P3,1 (t) + P2,1 (t) = 1 − PG (t) with the initial condition P3,1(0) = 1

slide-15
SLIDE 15

TMR reliability evaluation

where:

−(3λe+λv) 3λe λv Q = 0 −(2λe+λv) (2λe+λv) 0 0 0 P = Q + I → Q = P − I 1−(3λe+λv) 3λe λv P = 0 1−(2λe+λv) (2λe+λv) 0 0 1

slide-16
SLIDE 16

Properties of Laplace’s transformation

slide-17
SLIDE 17

Markov Processes for maintenable systems

Two kinds of events:

  • fault of a component (module or voter)
  • repair of the system (of a module or the voter or both)

Hypothesis: the maintenance process is exponentially distributed with repair rate equal to µ

3λe

2,1 G

2λe+λv

3,1

λv µ µ

slide-18
SLIDE 18

Availability evaluation of TMR system

3λe

2,1 G

2λe+λv

3,1

λv µ µ

P3,1(t) + P2,1(t) + PG(t) = 1 P3,1(0) = 1 P’3,1(t) = − (3λe + λv ) P3,1(t) + µ P2,1(t) + µ PG(t) P’2,1(t) = 3λe P3,1(t) − (2λe + λv + µ) P2,1(t) P’G(t) = λv P3,1(t) + (2λe + λv) P2,1(t) − µ PG(t)

dπ(t) = π(t) Q(t)

dt

i.e. (P'3,1 P'2,1 P'G) = (P3,1 P2,1 PG) * Q

slide-19
SLIDE 19

Availability evaluation of TMR system

−(3λe+λv) 3λe λv Q = µ −(2λe+λv+µ ) (2λe+λv) µ − µ Q = P − I → P = Q + I 1 −(3λe+λv) 3λe λv P = µ 1 − (2λe+λv+µ) (2λe+λv) µ 1 − µ

slide-20
SLIDE 20

Istantaneous Availability evaluation

  • f TMR system

The Istantaneous Availability is the probability of being in any fault-free state (in this case: state (3,1) or (2,1)). A(t) = P3,1 (t) + P2,1 (t) = 1 − PG (t) with the initial conditionP3,1(0) = 1

slide-21
SLIDE 21

Limiting or steady state Availability evaluation

  • f TMR system

P3,1(t) + P2,1(t) + PG(t) = 1 P3,1(0) = 1 with t 00 we have that P’(t) = 0 P’3,1(t) = 0 = − (3λe + λv ) P3,1(t) + µ P2,1(t) + µ PG(t) P’2,1(t) = 0 = 3λe P3,1(t) − (2λe + λv + µ) P2,1(t) P’G(t) = 0 = λv P3,1(t) + (2λe + λv) P2,1(t) − µ PG(t)

slide-22
SLIDE 22

Limiting or steady state Availability evaluation

  • f TMR system

P3,1(t) + P2,1(t) + PG(t) = 1 P3,1(0) = 1 with t 00 we have that P’(t) = 0 and P(t) = P P’3,1(t) = 0 = − (3λe + λv ) P3,1 + µ P2,1 + µ PG P’2,1(t) = 0 = 3λe P3,1 − (2λe + λv + µ) P2,1 P’G(t) = 0 = λv P3,1 + (2λe + λv) P2,1(t) − µ PG

slide-23
SLIDE 23

Limiting or steady state Availability evaluation

  • f TMR system

P3,1 + P2,1 + PG= 1 P3,1 = P2,1 = PG =

slide-24
SLIDE 24

Safety evaluation

Four types of events:

  • fault of a component (module or voter) correcttly

diagnoticated

  • fault of a component not detected
  • correct repair of the system (of a module or the voter
  • r both)
  • uncorrect repair of the system

λ → fault rate µ → repair rate Cg → fault detection coverage factor Cr → correct repair coverage factor

slide-25
SLIDE 25

Single component Safety evaluation

λ Cf

S U

µ (1-Cr)

λ (1-Cf)

µ Cr

→ fault free state S→ safe fault state U→ unsafe fault state

Hypothesis:

  • if a fault is not well diagnosticated then it

will never be detected

  • If a reconfiguration is not wel done then it

will be never detected

Therefore U is an absorbing state

slide-26
SLIDE 26

Single component Safety evaluation

Safety = probability to stay in state 0 or GS

PO(t) + PGS(t) = 1 − PGI(t) PO(0) = 1 P’O(t) = − (λ(1− Cg) + λCg)) PO(t) + µCr PGS(t) P’GS(t) = λCg PO(t) − (µ(1− Cr)+ µCr) PGS(t) P’GI(t) = λ(1− Cg) PO(t) + (µ(1− Cr)PGS(t)

slide-27
SLIDE 27

Single component Safety evaluation

−λ λCg λ(1-Cg) Q = µCr µ µ (1-Cr)

dπ(t) = π(t) Q(t)

dt

i.e. (P'3,1 P'2,1 P'G) = (P3,1 P2,1 PG) * Q

slide-28
SLIDE 28

Performability

Index taking into account even the performance of the system given its state (related to the number of fault-free components)

f(z(t)) t

fmin

S1 S2

We will discuss it when we will know how evaluate the performance of a system

slide-29
SLIDE 29

I O C21 C23 C22 C111 C12 C112 C11 C1 C2

R 11 = R 111 . R 112 R 1 = 1 - (1 - R 11) . (1 - R 12) R 2 = 1 - (1 - R 21) . (1 - R 22) . (1 - R 23) R = R 1 . R 2

Reliability/Availability/Safety evaluation of complex system