Mixed Criticality Systems: Beyond Transient Faults Abhilash - PowerPoint PPT Presentation

Mixed Criticality Systems: Beyond Transient Faults Abhilash Thekkilakattil, Alan Burns, Radu Dobrin and Sasikumar Punnekkat

Motivation and Contribution Ø State of the art of mixed criticality scheduling mainly focuses on WCET overruns Ø WCET overruns are one example of transient faults Ø We propose an approach for design and scheduling of mixed criticality systems under permanent faults

Introduction Ø Mixed criticality scheduling deals with scheduling real-time tasks with varying levels of WCET assurances Ø Growing interest in mixed criticality scheduling since Vestal’s RTSS’07 paper Ø 230 citations according to Google Scholar Ø Over 200 follow-up papers according to “Mixed Criticality Systems- A Review” (6 th ed.) by Burns and Davis

Goals of Mixed Criticality Scheduling Ø Enable certification by different certifying authorities – Demonstrate timeliness under different WCETs Ø Enable efficient utilization of the underlying computing infrastructure – Enabling safe sharing of the computing infrastructure – Ensuring isolation of critical from less critical tasks

State of the Art Mixed Criticality Scheduling Ø Criticality monotonic priority ordering Ø Adaptive and static scheduling Ø Scheduling with virtual deadlines/periods Ø Mixed criticality scheduling under faults

The Dependability Perspective Faults Threats Errors Failures Reliability Safety Focus of MC scheduling Maintainability Attributes Dependability Confidentiality Integrity Availability Fault tolerance Fault Prevention Means Fault Removal Fault Forecasting Avizienis et al., Basic Concepts and Taxonomy of Dependable and Secure Computing, IEEE Transactions of Dependable and Secure Computing, 2004

Faults, Errors and Failures Fault Error Failure A bit flip Wrong computed value Incorrect actuation Task deadline miss WCET overrun High criticality deadline miss Many different types of faults (except WCET overruns) are not covered by Vestal-like models

Classification of Faults Faults Transient Faults Permanent Faults • Fault whose presence is • Fault whose presence is limited in time continuous in time • Examples include bit flips • Examples include memory and WCET overruns and processor failures • Solution: temporal • Solution: spatial redundancy redundancy e.g., task re- e.g., using additional executions hardware

Transient Fault Tolerance Level 4 WCET Level 3 WCET Level 2 WCET Level 1 WCET Ø Temporal redundancy: replicate the tasks in time • Re-execute the task • Execute an alternate task Ø The time for re-execution/alternate task execution can be seen as the “extra time” needed in Vestal’s model

Classification of Faults Fault Transient Faults Permanent Faults • Fault whose presence is • Fault whose presence is limited in time continuous in time • Examples include bit flips • Examples include memory and WCET overruns and processor failures • Solution: temporal • Solution: spatial redundancy redundancy e.g., task re- e.g., using additional executions hardware

Focus of this Paper How to design mixed criticality real-time architectures to tolerate permanent faults? Contribution: 1. Propose a fault coverage based mapping of criticalities 2. Present a taxonomy of fault tolerance mechanisms in the context of mixed criticality systems

Classification of Permanent Faults Ø Design Faults – Faults due to deficiencies in design and development e.g., manufacturing defects in computers – Hardware and software design faults Ø Random Faults – Faults whose time of occurrence nor the cause can be determined e.g., faults due to wear and tear Ø Byzantine faults – Faults in which replicas behave arbitrarily differently – Worst kind of faults: requires high amount of redundancy

Tolerating Permanent Faults input Replica 1 output input Replica 2 Voter input Replica 3 Requires additional hardware (N-modular paradigm) – Replicate the tasks on multiple hardware – Perform voting to determine and mask failures – Diversity to prevent common cause failures

Goals of Mixed Criticality Scheduling Ø Enable certification by different certifying authorities – Demonstrate timeliness under different WCETs Timeliness does not imply certification Safety standards mandate redundancy for safety Ø Enable efficient utilization of the underlying computing infrastructure – Enabling safe sharing of the computing infrastructure – Ensuring isolation of critical from lesser critical tasks

Goals of Mixed Criticality Scheduling Ø Enable certification by different certifying authorities – Demonstrate timeliness under different WCETs Timeliness does not imply certification Safety standards mandate redundancy for safety Ø Enable efficient utilization of the underlying computing infrastructure – Enabling safe sharing of the computing infrastructure – Ensuring isolation of critical from lesser critical tasks Highest level of “protection” for all tasks?

Mapping Criticalities Based on Fault Coverage Design Faults Criticality Transient Random Software Hardware Byzantine Faults Faults Faults Faults Faults High Medium Low Non-critical Partially covered Partially covered

High Criticality Tasks input Replica 1 input Replica 2 output Voter (byzantine fault tolerance) input Replica 3 …… input Replica 3b +1 • Dedicated hardware to guarantee isolation • 3b+1 replicas and byzantine fault tolerance mechanism to tolerate b byzantine faults • Hardware and Software diversity to protect against design faults

Medium Criticality Tasks Task A high integrity processor 1 Task B output Voter Task A high integrity processor 2 Task B • High integrity hardware that is shared among medium criticality tasks • Time triggered scheduling and lock-step execution • Replication for protection against random faults • Hardware and software diversity for protection against design faults

Low Criticality Tasks Time aware voter: • Manages outputs delivered at different instants Task A • Signals early and late timing errors Core1 unfinished execution (scheduler: EDF) A1 Task B output A2 Time aware voter Task A B2 Core 2 (scheduler: FPS) Task B • COTS hardware, e.g., a multicore processor, that is shared among low criticality tasks • Time aware voter and loose synchronization: less development effort • Replication for protection against random faults • Software diversity for protection against software design faults

Non-Critical Tasks • Scheduled along with low criticality tasks • Timeliness is guaranteed in the absence of faults • Discarded upon failures • Possibility of using existing MC scheduling algorithms • Guarantees isolation of higher criticality tasks • Limited form of redundancy can be provided exploiting spare processing capacity

Mapping Criticalities Based on Fault Coverage Design Faults Criticality Transient Random Software Hardware Byzantine Faults Faults Faults Faults Faults byzantine fault High redundancy redundancy software diversity hardware diversity tolerance Medium redundancy redundancy software diversity hardware diversity Low redundancy redundancy software diversity Limited Non-critical Limited redundancy redundancy

Conclusions • Approach for design of mixed criticality systems in the context of permanent faults through: – Fault coverage based mapping of criticalities – Criticality based provisioning of resources – Isolation of higher criticality tasks – Implicit coverage of WCET overrun faults • Future Work – Methods for efficient allocation of replicas to processors – Consideration of safety analysis in the allocation and scheduling of tasks – Providing better-than-average service to non-critical tasks

Thank You ! Questions ?

Mixed Criticality Systems: Beyond Transient Faults Abhilash - PowerPoint PPT Presentation

Mixed Criticality Systems: Beyond Transient Faults Abhilash Thekkilakattil, Alan Burns, Radu Dobrin and Sasikumar Punnekkat Motivation and Contribution State of the art of mixed criticality scheduling mainly focuses on WCET overruns WCET

Mixed Criticality A Personal View Alan Burns Contents Some discussion on the notion of mixed

Bounding and Shaping the Demand of Mixed-Criticality Sporadic Tasks Pontus Ekberg & Wang Yi

I m pact of I nterm ittent Faults on Nanocom puting Devices Cristian Constantinescu June 28th,

Facing Up to Faults Facing Up to Faults Facing Up to Faults (v.2.0.1) (v.2.0.1) (v.2.0.1)

Ubiquitous faults T-79.4001 Seminar on Theoretical Computer Science Tero Pietilinen 4.4.2007

Programming with Time for Mixed Criticality Systems Dagstuhl Seminar, March 16-20, 2015 Mixed

Transient Fault Detection and Reducing Transient Error Rate Jose Lugo-Martinez CSE 240C:

Fault Diagnosis of Discrete-Event Systems Alejandro White, Doctoral Candidate Advisor: Dr.

Mixed Criticality Systems with Weakly-Hard Constraints Oliver Gettings Sophie Quinton Rob Davis

AdaptMC A Control-Theoretic Approach for Achieving Resilience in Mixed-Criticality Systems

A Practical Degradation Model for Mixed Criticality Systems Vijaya Kumar Sundar, Arvind Easwaran

State-Based Mode Switching with Applications to Mixed-Criticality Systems Pontus Ekberg , Martin

INTERACTING FAULTS By Tyler Lagasse Faults typically form as a network How do we best

Mixed Criticality Systems view from the industry side MAXI M Cristia n Airb us Ope ra tio

Graceful Degradation of Low-Criticality Tasks in Multiprocessor Dual-Criticality Systems Lin

Outline Side and covert channels Transient execution CSci 5271 Introduction to Computer

Workshop Technology evaluation, The EASA perspective Working for quieter and cleaner aviation.

INNOVATION in FINANCIAL SERVICES HOW TO OVERCOME THE CHALLANGES presented by ANTHONY VIVIANO UX

Oyu Tolgoi:Creating long-term value at worlds best developing copper project Turquoise

TD Securities Mining Conference Alistair Baker, Director Business Development January 23, 2020

Program Tom Martin, P.Eng., CMVP Business Development Manager, Analytics & Optimization

P O I S E D F O R G R O W T H I N T H E A M E R I C A S J u l y 2 0 1 8 TSX-V: F 1

1 O ral O ral Health Health P rogram P rogram to to E ngage E ngage N on N on- - Dental

DSHS Grand Rounds Its Hard to Make As with a Toothache: Oral Health Status of Texas

Sambuz

Useful Links

Newsletter

Mail Us

Mixed Criticality Systems: Beyond Transient Faults Abhilash - PowerPoint PPT Presentation

Mixed Criticality Systems: Beyond Transient Faults Abhilash Thekkilakattil, Alan Burns, Radu Dobrin and Sasikumar Punnekkat Motivation and Contribution State of the art of mixed criticality scheduling mainly focuses on WCET overruns WCET

Mixed Criticality A Personal View Alan Burns Contents Some discussion on the notion of mixed

Bounding and Shaping the Demand of Mixed-Criticality Sporadic Tasks Pontus Ekberg &amp; Wang Yi

I m pact of I nterm ittent Faults on Nanocom puting Devices Cristian Constantinescu June 28th,

Facing Up to Faults Facing Up to Faults Facing Up to Faults (v.2.0.1) (v.2.0.1) (v.2.0.1)

Ubiquitous faults T-79.4001 Seminar on Theoretical Computer Science Tero Pietilinen 4.4.2007

Programming with Time for Mixed Criticality Systems Dagstuhl Seminar, March 16-20, 2015 Mixed

Transient Fault Detection and Reducing Transient Error Rate Jose Lugo-Martinez CSE 240C:

Fault Diagnosis of Discrete-Event Systems Alejandro White, Doctoral Candidate Advisor: Dr.

Mixed Criticality Systems with Weakly-Hard Constraints Oliver Gettings Sophie Quinton Rob Davis

AdaptMC A Control-Theoretic Approach for Achieving Resilience in Mixed-Criticality Systems

A Practical Degradation Model for Mixed Criticality Systems Vijaya Kumar Sundar, Arvind Easwaran

State-Based Mode Switching with Applications to Mixed-Criticality Systems Pontus Ekberg , Martin

INTERACTING FAULTS By Tyler Lagasse Faults typically form as a network How do we best

Mixed Criticality Systems view from the industry side MAXI M Cristia n Airb us Ope ra tio

Graceful Degradation of Low-Criticality Tasks in Multiprocessor Dual-Criticality Systems Lin

Outline Side and covert channels Transient execution CSci 5271 Introduction to Computer

Workshop Technology evaluation, The EASA perspective Working for quieter and cleaner aviation.

INNOVATION in FINANCIAL SERVICES HOW TO OVERCOME THE CHALLANGES presented by ANTHONY VIVIANO UX

Oyu Tolgoi:Creating long-term value at worlds best developing copper project Turquoise

TD Securities Mining Conference Alistair Baker, Director Business Development January 23, 2020

Program Tom Martin, P.Eng., CMVP Business Development Manager, Analytics &amp; Optimization

P O I S E D F O R G R O W T H I N T H E A M E R I C A S J u l y 2 0 1 8 TSX-V: F 1

1 O ral O ral Health Health P rogram P rogram to to E ngage E ngage N on N on- - Dental

DSHS Grand Rounds Its Hard to Make As with a Toothache: Oral Health Status of Texas

Sambuz

Useful Links

Newsletter

Mail Us

Bounding and Shaping the Demand of Mixed-Criticality Sporadic Tasks Pontus Ekberg & Wang Yi

Program Tom Martin, P.Eng., CMVP Business Development Manager, Analytics & Optimization