Improving the Quality of Error-Handling Code in Systems Software - PowerPoint PPT Presentation

Improving the Quality of Error-Handling Code in Systems Software using Function-Local Information Suman Saha Laboratoire d'Informatique de Paris 6 Regal 25 th March, 2013

Outline ● Motivation ● Contribution 1: Understanding error-handling code in systems software ● Contribution 2: Improving the structure of error-handling code ● Contribution 3: Finding omission faults in error-handling code ● Future work and Conclusion 2

Motivation Research Questions: 1. Why are the faults in error-handling serious? 2. Why is it difficult to identify them? 3

Reliability of Systems Code Reliability of systems code is critical – Handling transient run-time errors is essential – Cause deadlocks, memory leaks and crashes – Key to ensuring reliability 4

Issues Error-handling code is not tested often – Research has shown there are many faults in error-handling code [Weimer OOPSLA:04 ] – Fixing these faults requires knowing what kind of error-handling code is required 5

Existing work on Error-Handling Code ● Proposing new language features [Bruntink ICSE:06 ] – introducing macros ● Finding faults in Error-Handling Code – focused on error-detection and propagation [Gunawi FAST:08 , Banabic EuroSys:12 ] 6

Error-Handling Code in C Programs Error-Handling code handles exceptions. – Returns the system to a coherent state. param = copy_dev_ioctl(user); … err = validate_dev_ioctl(command, param); if ( err ) goto out ; ... fn = lookup_dev_ioctl(cmd); if ( !fn ) { AUTOFS_WARN(“...”, command); return -ENOTTY ; } … out: free_dev_ioctl (param); return err; 7 Autofs4 code containing a fault

Understanding Error-Handling Code in Systems Software Research Questions: 1. How is error-handling code important for systems software? 2. What are the typical ways to write error-handling code in systems software? 8

Considered Systems Software SL Project Lines of code Version Description 1 Linux drivers 4.6 MLoC 2.6.34 Linux device drivers 2 Linux sound 0.4 MLoC 2.6.34 Linux sound drivers 3 Linux net 0.4 MLoC 2.6.34 Linux networking 4 Linux fs 0.7 MLoC 2.6.34 Linux file systems 5 Wine 2.1 MLoC 1.5.0 Windows emulator 6 PostgreSQL 0.6 MLoC 9.1.3 Database 7 Apache httpd 0.1 MLoC 2.4.1 HTTP server 8 Python 0.4 MLoC 2.7.3 Python runtime 9 Python 0.3 MLoC 3.2.3 Python runtime 10 PHP 0.6 MLoC 5.4.0 PHP runtime 9

Error-Handling in Linux 10

Error-Handling in Python 11

Error-Handling in Systems Code Percentage of code found within functions that have 0 or more blocks of error-handling code. 12

Error-Handling: Basic Strategy A typical, initial way to write error-handling code Basic strategy x = alloc(); … if(!y) { free(x); return -ENOMEM; } ... if(!z) { free(x); return -ENOMEM; } ... 13

Basic Strategy: Problems Problems Basic strategy ● Duplicates code x = alloc(); … if(!y) { free(x); return -ENOMEM; } ... if(!z) { free(x); return -ENOMEM; } ... 14

Basic Strategy: Problems Problems Basic strategy ● Duplicates code x = alloc(); ... ● Obscures what error handling if(!y) { free(x); code to use for new operations return -ENOMEM; } if(!m) { free(x); return -ENOMEM; } ... if(!z) { free(x); return -ENOMEM; 15 } ...

Basic Strategy: Problems Basic strategy Problems x = alloc(); ● Duplicates code … n = alloc(); ● Obscures what error handling ... if(!y) { code to use for new operations free(n); free(x); ● Requires lots of changes when return -ENOMEM; } adding a new operation if(!m) { free(n); free(x); return -ENOMEM; } if(!z) { free(n); free(x); 16 return -ENOMEM; }

Error-Handling: Goto-based Strategy Goto-based strategy ● State-restoring operations x = alloc(); appear in a single labelled ... sequence at the end of the if(!y) goto out; function ... if(!z) goto out; … out: free(x); return -ENOMEM; 17

Goto-based Strategy: Benefits Goto-based strategy ● State-restoring operations x = alloc(); appear in a single labelled ... sequence at the end of the if(!y) goto out; function ... if(!z) ● No code duplication goto out; … out: free(x); return -ENOMEM; 18

Goto-based Strategy: Benefits Goto-based strategy ● State-restoring operations x = alloc(); ... appear in a single labelled if(!y) sequence at the end of the goto out; … function if(!m) goto out; ● No code duplication if(!z) goto out; … out: free(x); return -ENOMEM; 19

Goto-based Strategy: Benefits Goto-based strategy ● State-restoring operations x = alloc(); … appear in a single labelled n = alloc(); sequence at the end of the ... if(!y) function goto out; … ● No code duplication if(!m) goto out; if(!z) goto out; … out: free(n); free(x); 20 return -ENOMEM;

Basic vs Goto-based strategies in Linux 21

Basic vs Goto-based strategies in Python 22

Summary ● The number of functions with error-handling code is increasing version by version ● Many more functions use the basic strategy than use the goto strategy – Leads to a lot of duplicate code – Difficult to maintain – Error prone – Hard to debug 23

Improving the Structure of Error-Handling Code in Systems Software [ LCTES11 ] Three steps: 1. Find error handling code 2. Identify operations for sharing 3. Perform transformation 24

1. Find Error Handling Code  No recognizable error handling abstractions in C code  Heuristics: – An if branch ending in a return – An if branch containing at least one non-debugging function call (something to share) Examples: if(ns->bacct == NULL) { ... if(ns->bacct == NULL) { if(acct == NULL) { ns->bacct = acct; filp_close(file, NULL); acct = NULL; return -ENOMEM; } } } 25

2. Identify Operations for Sharing For each branch 1.Extract the code that is specific to the error condition 2.Extract the code that can be shared with other error handling code (cleanup code) if(x) { 1 ret = -ENOMEM; if(x) { h(“s”); ret = -ENOMEM; goto out; h(“s”); } f(a); return ret; out: } f(a); return ret; 26 2

Reasons for no sharing 1. No state restoring operations if(x) { ret = -ENOMEM; h(“s”); if(x) { goto out; ret = -ENOMEM; } h(“s”); return ret; out: } return ret; 27

Reasons for no sharing 2. Only one branch to f() { transform ... x = allocate(); … y = noallocate(); if(y) { ... free(x); return ret; } } 28

Reasons for no sharing … if(y) { 3. Nothing in common with free(x); other error handling code return ret; } ... if(z) { free(x); return ret; } ... free(x); ... if(l) { action(z); 29 return ret; }

3. Transformation Classify the branches according to how difficult they are to transform 1. Simple 2. Hard 3. Harder 4.Hardest 30

3. Transformation: Simple ● Exactly same code in the branch and the label ● Reduce duplicate code by reusing the existing label ... if (!sl->data) { ... clear_bit(n,sbi-symlink_bitmap); if (!sl->data) unlock_kernel(); goto out; return ret; } … … out: out: clear_bit(n,sbi-symlink_bitmap); clear_bit(n,sbi-symlink_bitmap); unlock_kernel(); unlock_kernel(); return ret; return ret; 31

3. Transformation: Hard ● Code in the branch is a subset of the code in the label ● Reduce duplicate code by creating a new label in existing code ... ... if (!sl->data) { if (!sl->data) unlock_kernel(); goto out1; return ret; } … … out: out: clear_bit(n,sbi-symlink_bitmap); clear_bit(n,sbi-symlink_bitmap); out1: unlock_kernel(); unlock_kernel(); return ret; return ret; 32

3. Transformation: Harder ● Branches do have similar code but no label has ● Reduce duplicate code by creating a new label and moving code to that label ... if (!sl->data) { if (!sl->data) { clear_bit(n,sbi-symlink_bitmap); clear_bit(n,sbi-symlink_bitmap); unlock_kernel(); unlock_kernel(); return ret; return ret; } } ... ... if (!ent) if (!ent) { goto out; kfree(sl->data); … clear_bit(n,sbi-symlink_bitmap); out: unlock_kernel(); kfree(sl->data); return ret; clear_bit(n,sbi-symlink_bitmap); 33 } unlock_kernel(); ... return ret;

3. Transformation: Hardest ● Combination of Simple (common code in branch and label) and Harder (noncommon code in them). ... ... if (!ent){ if (!ent) kfree(sl->data); goto out1; clear_bit(n,sbi-symlink_bitmap); … unlock_kernel(); return 0; return ret; out1: } kfree(sl->data); … out: return 0; clear_bit(n,sbi-symlink_bitmap); out: unlock_kernel(); clear_bit(n,sbi-symlink_bitmap); return ret; unlock_kernel(); 34 return ret;

Results ● Applied to 7 widely used systems including Linux, Python, Apache, PHP and PostgreSQL ● 46% of basic strategy functions have only one if . So, those are not transformed ● 54% of basic strategy functions are taken for transformation – 59% of these are not transformed due to lack of sharing – 41% are transformed 35

Summary We proposed an automatic transformation that converts basic strategy error-handling code to the goto-based strategy – The algorithm identifies many opportunities for code sharing What about possible defects in error-handling code? 36

Finding Omission Faults in Error-Handling Code [PLOS11, DSN13] 37

Improving the Quality of Error-Handling Code in Systems Software - PowerPoint PPT Presentation

Improving the Quality of Error-Handling Code in Systems Software using Function-Local Information Suman Saha Laboratoire d'Informatique de Paris 6 Regal 25 th March, 2013 Outline Motivation Contribution 1: Understanding error-handling

Error Handling in RCMS Error Handling in RCMS An Overview Francesco Lelli

llvm::Error Rich Error Handling in LLVM Error Handling History LLVMs APIs historically

Material Handling Chapter 5 Designing material handling systems Overview of material

Compilers Error Handling Alex Aiken Error Handling Purpose of the compiler is To detect

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

Error Handling Marco Chiarandini Department of Mathematics & Computer Science University of

Principles of Programming CM10227 Lecture D.11: Java: Error Handling & Writing Better Code

ERROR DETECTON & CORRECTION Error Detection EDC= Error Detection and Correction bits

Functional Error Handling 1 / 13 Whats right with exceptions? Exceptions provide a way to

Control Exception Handling: Exception handling is the control of error conditions or other

ERROR HANDLING IN COCOA AGENDA Objective-Cs NSError Swifts try/catch/throws Swifts

The Capability of Error Correction for Burst-noise Channels Using Error Estimating Code Yaoyu

Error Codes Correcting Gary Lecture 11 toolkit CMU Preliminaries Setting Error of

Human Error and Human Error Identification Techniques adapted from an IE 545 presentaton by

An Overview of Human Error Drawn f rom J . Reason, Human Error , Cambridge, 1990 Aaron Brown CS

Questions From Chapter 1 Figure 1.1: Testing life cycle Ch 12 Error vocabulary 1

Amelioration Initiative Project Mayumi ISHIZUKA Department of Environmental Veterinary Sciences,

Building a Global Asset Management Firm VINCENT DUHAMEL LUCAS PONTILLO Global President and

EMI Modeling of a 32-bit Microcontroller in Wait Mode Jean-Pierre Leca 1,2 , Nicolas Froidevaux 1

HOL O CAUST IPA: /hlkst/ pronounced [holuhkawst, hohluh] noun 1. a great or

Professor Stuart B. Hill Social Ecology PowerPoint presentation ABSTRACTS All are

QCBFD Water System Update June 2020 Take Home Message RIDOH Absolute Mandate April - May 2020

DAMAGE DETECTION AND AMELIORATION BY ELECTRICAL RESISTANCE FOR SMART COMPOSITES T. J. Swait*, F.

Atelier de Renforcement de Capacits en Gestion de Projet Bonnes Pratiques de Gestion de Projet

Improving the Quality of Error-Handling Code in Systems Software - PowerPoint PPT Presentation

Improving the Quality of Error-Handling Code in Systems Software using Function-Local Information Suman Saha Laboratoire d'Informatique de Paris 6 Regal 25 th March, 2013 Outline Motivation Contribution 1: Understanding error-handling

Error Handling in RCMS Error Handling in RCMS An Overview Francesco Lelli

llvm::Error Rich Error Handling in LLVM Error Handling History LLVMs APIs historically

Material Handling Chapter 5 Designing material handling systems Overview of material

Compilers Error Handling Alex Aiken Error Handling Purpose of the compiler is To detect

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

Error Handling Marco Chiarandini Department of Mathematics &amp; Computer Science University of

Principles of Programming CM10227 Lecture D.11: Java: Error Handling &amp; Writing Better Code

ERROR DETECTON &amp; CORRECTION Error Detection EDC= Error Detection and Correction bits

Functional Error Handling 1 / 13 Whats right with exceptions? Exceptions provide a way to

Control Exception Handling: Exception handling is the control of error conditions or other

ERROR HANDLING IN COCOA AGENDA Objective-Cs NSError Swifts try/catch/throws Swifts

The Capability of Error Correction for Burst-noise Channels Using Error Estimating Code Yaoyu

Error Codes Correcting Gary Lecture 11 toolkit CMU Preliminaries Setting Error of

Human Error and Human Error Identification Techniques adapted from an IE 545 presentaton by

An Overview of Human Error Drawn f rom J . Reason, Human Error , Cambridge, 1990 Aaron Brown CS

Questions From Chapter 1 Figure 1.1: Testing life cycle Ch 12 Error vocabulary 1

Amelioration Initiative Project Mayumi ISHIZUKA Department of Environmental Veterinary Sciences,

Building a Global Asset Management Firm VINCENT DUHAMEL LUCAS PONTILLO Global President and

EMI Modeling of a 32-bit Microcontroller in Wait Mode Jean-Pierre Leca 1,2 , Nicolas Froidevaux 1

HOL O CAUST IPA: /hlkst/ pronounced [holuhkawst, hohluh] noun 1. a great or

Professor Stuart B. Hill Social Ecology PowerPoint presentation ABSTRACTS All are

QCBFD Water System Update June 2020 Take Home Message RIDOH Absolute Mandate April - May 2020

DAMAGE DETECTION AND AMELIORATION BY ELECTRICAL RESISTANCE FOR SMART COMPOSITES T. J. Swait*, F.

Atelier de Renforcement de Capacits en Gestion de Projet Bonnes Pratiques de Gestion de Projet

Error Handling Marco Chiarandini Department of Mathematics & Computer Science University of

Principles of Programming CM10227 Lecture D.11: Java: Error Handling & Writing Better Code

ERROR DETECTON & CORRECTION Error Detection EDC= Error Detection and Correction bits