DFTCalc: A tool for Advanced Reliablity, Availability, Maintenance - - PowerPoint PPT Presentation

dftcalc a tool for advanced reliablity availability
SMART_READER_LITE
LIVE PREVIEW

DFTCalc: A tool for Advanced Reliablity, Availability, Maintenance - - PowerPoint PPT Presentation

DFTCalc: A tool for Advanced Reliablity, Availability, Maintenance and Safety analysis. Enno Ruijters, University of Twente Supervisor: Marielle Stoelinga 28/01/16 1 Reliability of critical systems System failures can be catastrophic


slide-1
SLIDE 1

28/01/16 1

DFTCalc: A tool for Advanced Reliablity, Availability, Maintenance and Safety analysis.

Enno Ruijters, University of Twente Supervisor: Marielle Stoelinga

slide-2
SLIDE 2

Reliability of critical systems

2

  • System failures can be catastrophic
  • Airplanes, nuclear power stations, etc.
  • How to ensure reliability:
  • At design stage: component selection,

redundancy, diversity, isolation

  • During operation: Inspection, maintenance,

repairs, replacement

slide-3
SLIDE 3
  • Effect of maintenace
  • Maintenance:
  • Improves reliability
  • Adds maintenance costs
  • Reduces costs of failure and downtime
  • Goal: Find cost-optimal maintenance

policy

0.5 1 1.5 2 1 2 3 4 5 6 7 8 Cost Inspections per year Inspection cost Downtime cost T

  • tal cost
slide-4
SLIDE 4

DFTCalc: 3 key ingredients

Fault Trees Maintenance Model checking

DFTCalc analysis goals:

  • What is the effect of maintenance on system performance:
  • reliability, availability, mean time to failures? …
  • Can we do better (lower costs / better performance)?

DFTCalc analysis goals:

  • What is the effect of maintenance on system performance:
  • reliability, availability, mean time to failures? …
  • Can we do better (lower costs / better performance)?

4

slide-5
SLIDE 5

Ingredient 1: fault trees

Preferred tool for RAMS

  • Model
  • How do component failures

propagate to system failures?

  • Analysis
  • P[failure within mission time]

(Reliability)

  • E[up-time] (Availability)
  • MTTF, MTBF
  • ….

Preferred tool for RAMS

  • Model
  • How do component failures

propagate to system failures?

  • Analysis
  • P[failure within mission time]

(Reliability)

  • E[up-time] (Availability)
  • MTTF, MTBF
  • ….

Talk:

  • Add maintenance
  • Large effect on MTTF
  • Hardly considered

Talk:

  • Add maintenance
  • Large effect on MTTF
  • Hardly considered

5

slide-6
SLIDE 6

Ingredient 1: fault trees

Graphical formalism

  • Decompose system failures

into combinations of component failures

  • Gates: failure propagation
  • Leaves component failures
  • Traditionally contain failure

rates/probabilities

  • We add degradation

behavior

  • Related: attack trees

in security

6

phone phone engine engine road trip road trip car car tire 1 tire 1 tire 2 tire 2 tire 3 tire 3 tire 4 tire 4 spare spare tires tires

slide-7
SLIDE 7

fault trees: who uses them? fault trees: who uses them?

7

slide-8
SLIDE 8

Ingredient 2: maintenance

Types corrective maintenance preventive maintenance Strategies condition-based age-based usage-based Our approach model these in FT leaves

8

slide-9
SLIDE 9

BE model

  • Describes one failure mode / cause (eg from FMECA)
  • Degradation behavior (phases)
  • Detection threshold
  • Maintenance effects

 condition-based maintenance

Modelling: failure behaviour in BEs

9

slide-10
SLIDE 10

Modelling: Inspection module

10

Inspection module

  • Above: dedicated for 1 components
  • More complex for multiple components
slide-11
SLIDE 11

Ingredient 3: (stochastic) model checking

2 flavors

  • verification: complete search
  • statistical: simulation

 complimentary 2 flavors

  • verification: complete search
  • statistical: simulation

 complimentary Model checking

  • state-of-art stochastic analysis
  • flexible, rigorous
  • used in HW verification
  • 2007: Turing Award

Model checking

  • state-of-art stochastic analysis
  • flexible, rigorous
  • used in HW verification
  • 2007: Turing Award

Many tools MRMC, Prism, UPPAAL, nuSMV, IMCA, ... Many tools MRMC, Prism, UPPAAL, nuSMV, IMCA, ...

11

slide-12
SLIDE 12

Recap: 3 key ingredients

Fault Trees Maintenance Model checking

DFTCalc analysis goals:

  • What is the effect of maintenance on system performance:
  • reliability, availability, mean time to failures? …
  • Can we do better (lower costs / better performance)?

DFTCalc analysis goals:

  • What is the effect of maintenance on system performance:
  • reliability, availability, mean time to failures? …
  • Can we do better (lower costs / better performance)?

12

slide-13
SLIDE 13

Outline

13

  • Introduction
  • Approach
  • Case studies
  • Conclusions
slide-14
SLIDE 14

14

Our approach: how does it work?

DFTCalc

FT + maintenance

  • Gates: AND, SPARE
  • BEs: failure behavior
  • IM/RU: inspections,

repairs Analysis

  • system reliability over time
  • mean time to failure
  • availability

DFTCalc

  • Extensible framework

Questions:

  • Does system meets reliability / availability requirements? Can we do better?
  • What is the effect of different maintenance policies? (= different BEs / parameters)

Questions:

  • Does system meets reliability / availability requirements? Can we do better?
  • What is the effect of different maintenance policies? (= different BEs / parameters)
slide-15
SLIDE 15

Our approach: how does it work?

DFTCalc

15

Fault Tree maintenance model

Translation Analysis Efficiency:

  • Compositional aggregation
  • Context-dependent state space

generation Efficiency:

  • Compositional aggregation
  • Context-dependent state space

generation

slide-16
SLIDE 16

Our approach: alternative

Uppaal-SMC

16

Fault Tree maintenance model

Benefits:

  • Often much faster
  • Supports arbitrary failure time distributions

Disadvantages:

  • Results are less precise
  • Can be much slower if high accuracy is desired

Benefits:

  • Often much faster
  • Supports arbitrary failure time distributions

Disadvantages:

  • Results are less precise
  • Can be much slower if high accuracy is desired

Manual translation

slide-17
SLIDE 17

DFTCalc: Extensions

17

  • New Inspection module
  • New repair module
  • New maintainable Basic

Events

  • Context dependent

generation

  • Inspection and repair

communication

slide-18
SLIDE 18

DFTCalc: web-interface

18

http://fmt.ewi.utwente.nl/puptol/dftcalc/

slide-19
SLIDE 19

Outline

19

  • Introduction
  • Approach
  • Case studies
  • Conclusions
slide-20
SLIDE 20

Case 1: Electrically Insulated Joint

20

  • Electrically separates tracks
  • 45.000 EIJs in the Netherlands
  • Important cause of train disruptions
slide-21
SLIDE 21

EI-joint: modeling

21

New features:

  • RDEP gate
  • Advanced BEs
slide-22
SLIDE 22

EI-joint: maintenance

22

Maintenance policy:

  • Four trackside inspections per year.
  • Repair action can either repair specific failure
  • (e.g. removing a foreign object)
  • Or needs to replace the entire joint.
  • Costs for inspections and maintenance actions are

known.

  • Costs for failures depends on how many passengers

are affected.

slide-23
SLIDE 23

Results EI-joint: Current maintenance policy

23

Result:

  • Failure behaviour is very linear after first few years.
slide-24
SLIDE 24

Results EI-joint: Current maintenance policy

24

Breakdown of failure causes:

  • Majority of failures are due to electrical insulation
  • Almost all electrical failures are due to external shorts
slide-25
SLIDE 25

Results EI-joint: Different maintenance policies

25

Result:

  • Inspections are clearly important.
  • Does increased reliability lead to lower cost?
slide-26
SLIDE 26

Results EI-joint: Different maintenance policies

26

Result:

  • Inspections are important, but the exact frequency

does not strongly affect cost.

slide-27
SLIDE 27

Results EI-joint: Maintenance optimization

27

Result:

  • Cost optimum around 3 – 4 inspections per year.
  • Costs fairly constant between 3 and 6 per year.
slide-28
SLIDE 28

EI-joint: modeling process

28

  • Fault tree based on existing FMECA by Prorail.
  • Structure of FT is clear from context.
  • Total failure rate per failure mode is documented.
  • More details obtained using questionnaire to experts:
  • Variance of failure rate
  • External factors affecting failure

(location, surface condition, etc.)

  • Translation of physical description of maintenance

threshold ('>5mm vertical movement') to time- based description ('repair needed within 1 month')

  • Tweaking and validation using recorded failure data.
slide-29
SLIDE 29
  • Conclusions EI-joint
  • Our model of the EI-joint agrees with

reality under the current maintenance policy.

  • We find the cost-optimal maintenance

policy consists of four inspections per year.

  • More inspections result in noticably

fewer disruptions, but are not cost- effective.

slide-30
SLIDE 30

Case 2: pneumatic compressor

30

Purpose: Provide compressed air for brakes, automatic doors, etc.

  • Complex maintenance policy with

several levels of inspections and repairs.

  • Modeling performed by NedTrain,

analysis by UT.

slide-31
SLIDE 31

Compressor: modeling

31

Similar features to the EI-joint fault tree

slide-32
SLIDE 32

Compressor: maintenance policy

32

  • Quick inspection every two days.
  • Check diagnostic computer logs for errors.
  • Visual inspection for obvious problems (e.g. oil leak).
  • Services every 3 months, more intensive every 9.
  • Replace consumables (e.g. filters)
  • Functional tests.
  • Minor overhaul every 3 years, major overhaul at 6.
  • Compressor disassembled, components inspected.
  • After major overhaul, compressor is as good as new.
  • At any level, if a fault cannot be repaired, the next

level of maintenance is performed, at increased cost (called an unplanned maintenance event).

slide-33
SLIDE 33

Results compressor: Current policy

33

Result:

  • Outcomes are fairly close to reality
slide-34
SLIDE 34

Results compressor: Other policies

34

Results:

  • Service period is important to maintain reliability.
  • Minor overhaul does not have much effect.
slide-35
SLIDE 35

Conclusions case studies

35

  • Fault maintenance trees can model realistic maintenance

strategies.

  • We can analyze systems with maintenance and gain insight

into cost-optimal performance.

  • Our results are in agreement with reality.
slide-36
SLIDE 36

Current work

36

  • Replace phased degradation by continuous degradation

(Completed but untested).

  • Significant optimization of computation of fault trees

with maintenance (completed, not yet public).

  • Support for more advanced gates involving combinations of

degraded BEs.

  • Decent input language for fault trees with maintenance.
  • Automatic optimization of complex maintenance policies.
slide-37
SLIDE 37

Conclusions

37

  • Maintenance has large effect on RAMS
  • should be analyzed in integral way
  • Fault maintenance trees
  • Extend fault trees to include maintenance
  • DFTCalc
  • extensible tool for reliability & availability analysis
  • compare different maintenance policies
slide-38
SLIDE 38