RTNS: Scheduling Analysis under Fault Bursts Florian Many, Frdric - - PowerPoint PPT Presentation

rtns scheduling analysis under fault bursts
SMART_READER_LITE
LIVE PREVIEW

RTNS: Scheduling Analysis under Fault Bursts Florian Many, Frdric - - PowerPoint PPT Presentation

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions RTNS: Scheduling Analysis under Fault Bursts Florian Many, Frdric Boniol, David Doose 5 November 2010 RTNS: Scheduling Analysis under Fault Bursts 1 / 25


slide-1
SLIDE 1

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

RTNS: Scheduling Analysis under Fault Bursts

Florian Many, Frédéric Boniol, David Doose 5 November 2010

1 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-2
SLIDE 2

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Context (1/3)

H a r d w a r e A r c h i t e c t u r e S

  • f

t w a r e Different Layers of Protection

Hardware Layer Architecture Layer Software Layer

2 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-3
SLIDE 3

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Context (2/3)

H a r d w a r e A r c h i t e c t u r e S

  • f

t w a r e Fault Tolerance Mechanisms

Hardware Layer

Shield Location

Architecture Layer

Duplication et triplication of critical equipments

Software Layer

Robust data model Method based on code execution or re-execution

3 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-4
SLIDE 4

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Context (3/3)

Real-Time System Overview

A set of tasks with hard temporal constraints A scheduler to assign task to processors

Some Relevant Questions

Assign priority to tasks Manage shared ressources Manage fault tolerance mechanisms

Schedulability Analysis Prove a priori the respect of all temporal constraints

4 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-5
SLIDE 5

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Plan of this Presentation

Problematic Coupling Scheduling Analysis and Fault Tolerance Guidelines

Definition of a fault model Definition of the scheduler behaviour when an error occurs Schedulability Analysis

5 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-6
SLIDE 6

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions 1

Fault Burst Model Fault Features Fault Burst Model Example

2

Detection, Correction and Strategies Error-Detection and Error-Correction Error Recovery Strategies

3

Scheduling Analysis Background Worst Case Response Time Equation Evaluation of Recovery Term Fi

4

Performance

6 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-7
SLIDE 7

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Fault Features

Origins of Faults

Inner faults

Bad design or implementation Electromagnetic Compatibility : Power supply and computer

Environmental faults

Sensors masked by an outer object Electromagnetic fields (radar waves), space rays

Consequences on Real-Time Systems

Permanent ⇒ Spatial Redundancy Transient ⇒ Temporal Redundancy

Temporal Distributions

Pseudo-periodic fault Fault bursts

8 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-8
SLIDE 8

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Fault Burst Model

∆F ∆F TF TF

Burst Definition

∆F = time interval during which there are potential faults Inner temporal distribution of faults unknown No fault outside a burst TF = minimum time interval between two fault burst starts

Example of Phenomenon

Aircraft through an electromagnetic field generated by radar waves

9 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-9
SLIDE 9

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

An Illustrated Example

Case of Rotative Air Radar [1, 2]

For a fly-by or over ground aircraft :

Elapsed time between two swept : few seconds Exposure time : tenth of seconds

Worst case for a slow aircraft :

15 swepts (2 seconds between swepts) 100 ms of exposure time by swept

RTCA and EUROCAE Guide to Certification Of Aircraft in a High Intensity Radiated Field (HIRF) Environment ED 107 - ARP 5583, 2001. RTCA and EUROCAE Environmental Conditions and Testprocedures for Airborne Equipment ED 14E - DO 160E, 2005. 10 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-10
SLIDE 10

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Error-Detection and Error-Correction

Detection Mechanisms

Use of acceptance tests, checksums, timer watchdogs etc... Instant of detection :

At the end of task Checkpoints (splitted tasks)

Correction Method

Re-execution of code

Full or partial re-execution of the erroneous task Alternative tasks, recovery blocks Exception Handlers

Assumption : Re-execution of the task corrects all errors

12 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-11
SLIDE 11

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Error Recovery Strategies

At Task Level

Tactic = error-detection + error-correction

At System Level

At error detection, different actions :

Manage preempted tasks Anticipate potential undetected errors

Strategies

Definition of scheduler behaviour towards preempted tasks

Remark Error recovery strategies infer fault tolerance

13 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-12
SLIDE 12

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Focused Error Recovery Strategies

End Detection/Full Re-execution/Simple Strategy

End Detection Full Re-execution of the faulty task Correction of the erroneous task Ex : Erronated data on a sensor

τ1 τ2 τ3

End Detection/Full Re-execution/Multiple Strategy

End Detection Full Reexecution Correction ot the erroneous task Preventive correction of preempted tasks Ex : Corrupted shared data

τ1 τ2 τ3

14 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-13
SLIDE 13

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Computational Model

Di

Ti Ti Ci Ci Ri = ri,0 ri,1 ri,2 di,0 di,1 Task Features

WCET : Ci , Deadline : Di,, Period : Ti Deadline less than or equal to period : Di ≤ Ti independent,periodic distinct priority

System Features

uniprocessor fixed priority assignement fault free scheduler

16 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-14
SLIDE 14

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Evaluation of Task Set Feasibility

Validation Techniques

Upper bound to the processor utilisation Worst Case Response Time Model Checking (multiprocessor) Workload

Worst Case Response Time

(Completion time - release date) task in the worst case schedulable task τi : WCRT ≤ Di task set feasible : ∀i, τi schedulable

17 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-15
SLIDE 15

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Worst Case Response Time Equation

τ1 τ2 τ3

I ∆F

3

+ F3 R3 ∆F I ∆F

3

+ F3 Task Instance Faulty Instance Re-execution Error Detection Task Release

Computation of the Worst Case Response Time R∆F

i

R∆F

i

= R∆F

i

= Ri R∆F

i

= Ri+∆F R∆F

i

= R (1)

Ri : Free fault WCRT ∆F : Duration of the fault burst Interference due to the highest priority tasks after the fb end I

∆F I

= X

hp(i)

& R

∆F i

− (Ri + ∆F ) Tj ’ Cj (2) Fi : Additional temporal cost due to the error recovery strategies

18 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-16
SLIDE 16

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Evaluation of Recovery Term Fi

Computation of Fi for the ED/FR/S strategy

τ1 τ2 τ3

R3 ∆F I ∆F

3

+ F3

Fi = 2 × X

hp(i)

Cj + 2 × Ci (3) Computation of the Fi for the ED/FR/M Strategy

τ1 τ2 τ3

R3 ∆F I ∆F

3

+ F3

Fi = max

j∈hp(i)

@Cj +

k=j

X

k=i−1

Ck 1 A + Ci (4)

19 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-17
SLIDE 17

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Example

∆F = 100 P T C D R S R∆F M2 R∆F 1 300 10 300 10 20 130 20 130 2 500 50 500 60 120 290 70 240 3 800 150 800 210 420 800 260 630

Descritpion

3-task set with Di = Ti scheduler : Rate Monotonic

Benefits

Efficiency of strategies : significative reduction of WCRT (25%) Unvailibility of the system : TF = 800, ∆F = 100 ⇒ 12, 5%

First impression

Multiple strategy better than simple

21 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-18
SLIDE 18

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Simulation (1/2)

τ1 τ2

R2 ∆F I ∆F

2

+ F2

τ1 τ2

R2 ∆F I ∆F

2

+ F2

Qualitative explanation of the benefits

"Temporal Economy" ⇒ reduction of necessary error-detections In practice, temporal additional cost (preventive re-executions) But effective approach for the validation of RTS

22 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-19
SLIDE 19

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Simulation (2/2)

Description

10-task sets 1000 task sets for a given range of processor utilisation variation of the fault burst

Comparison of Strategies : U = 0.5 [1]

Simple : ∆F = 3% of the longuest period Multiple : ∆F = 14% of the l.p

0.5 50 100 150 200 250 300 500 1000 Processor Utilisation Fault Burst Duration Schedulable Task Sets ED/FR/S ED/FR/M2

  • M. Pandya and M. Malek

Minimum achievable utilization for fault-tolerant processing of periodic tasks IEEE Transactions on Computers, 47(10) :1102–1112, 1998. 23 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-20
SLIDE 20

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Conclusions

Conclusion

A representative issue : UAVs in Radar waves (ONERA research) Results :

Fault Burst Model Error recovery strategies Schedulability Analysis Realistic approach showed by simulation

Perspectives

Implement strategies in a RTOS Works at system level ⇒ entry points :

safety : equipment failure platform features

24 / 25 RTNS: Scheduling Analysis under Fault Bursts

slide-21
SLIDE 21

Introduction Fault Model Strategies Scheduling Analysis Performance Conclusions

Thanks for your attention

25 / 25 RTNS: Scheduling Analysis under Fault Bursts