AUTOMATIC PROGRAM REPAIR USING GENETIC PROGRAMMING CLAIRE LE - PowerPoint PPT Presentation

AUTOMATIC PROGRAM REPAIR USING GENETIC PROGRAMMING CLAIRE LE GOUES APRIL 22, 2013 1 http://www.clairelegoues.com

GENPROG STOCHASTIC SEARCH + TEST CASE GUIDANCE = AUTOMATIC, EXPRESSIVE, SCALABLE PATCH GENERATION 2 Claire Le Goues http://www.clairelegoues.com

“Everyday, almost 300 Annual cost of bugs appear […] far too many for only the Mozilla software errors in the programmers to handle.” US: $59.5 billion – Mozilla Developer, (0.6% of GDP). 2005 PROBLEM: BUGGY SOFTWARE 10%: Everything Else Average time to fix a security-critical error: 28 days. 90%: Maintenance 3 Claire Le Goues http://www.clairelegoues.com

SOLUTION: AUTOMATE 4 Claire Le Goues http://www.clairelegoues.com

PRIOR ART Self-healing systems, security research: runtime monitors, repair strategies, error preemption. • Designed to address particular types of bugs, (e.g., buffer overruns). • Very successful in that domain (e.g., data execution prevention shipping with Windows 7). But what about generic repair of new real-world bugs as they come in? 5 Claire Le Goues http://www.clairelegoues.com

HOW DO HUMANS FIX NEW BUGS? 6 Claire Le Goues http://www.clairelegoues.com

??! � NOW WHAT? 7 Claire Le Goues http://www.clairelegoues.com

printf transformer 8 Claire Le Goues http://www.clairelegoues.com

Input: 1 2 3 4 5 6 7 Legend: � Likely faulty. probability 8 9 10 � Maybe faulty. probability � Not faulty. 11 12 9 Claire Le Goues http://www.clairelegoues.com

SECRET SAUCES • Test cases are useful. • Existing program • Existing program behavior contains the code and behavior seeds of many contains repairs. • The space of program patches can be searched. 10 Claire Le Goues http://www.clairelegoues.com

THESIS Stochastic search, guided by existing test cases ( GE GENP NPROG OG ), can provide a • scalable • expressive • human competitive …approach for the automated repair of: • many types of defects • in many types of real-world programs. 11 Claire Le Goues http://www.clairelegoues.com

OUTLINE GenProg: automatic program repair using genetic programming. Four overarching hypotheses. Empirical evaluations of: • Expressive power. • Scalability. Contributions/concluding thoughts. 12 Claire Le Goues http://www.clairelegoues.com

APPROACH Given a program and a set of test cases, conduct a biased, random search for a set of edits to a program that fixes a given bug. 13 Claire Le Goues http://www.clairelegoues.com

GENETIC PROGRAMMING: the application of evolutionary or genetic algorithms to program source code. 14 Claire Le Goues http://www.clairelegoues.com

GENETIC PROGRAMMING Population of variants. Fitness function evaluates desirability. Desirable individuals are more likely to be selected for iteration and reproduction. New variants created via: • Mutation • Crossover ABCDEF  ABADEF ABCDEF ABCWVU � ZYXWVU ZYXDEF � 15 Claire Le Goues http://www.clairelegoues.com

CHALLENGES The search is through the space of candidate patches or sets of changes to the input program. Two concerns: 1. Scalability – management, reduction, and traversal of the search space. 2. Correctness – proposed repair should fix the bug while maintaining other required functionality. 16 Claire Le Goues http://www.clairelegoues.com

INSIGHTS Explore coarse-grained edits at the statement level of the abstract syntax tree ([delete; replace; insert]). Use existing test suites as proxies for correctness specifications, and to reduce the search space. • Evaluate intermediate solutions. • Localize the fault, focusing candidate changes. Leverage existing code and behavior. • Do not invent new code; copy code from elsewhere in the same program. 17 Claire Le Goues http://www.clairelegoues.com

INPUT EVALUATE FITNESS DISCARD ACCEPT OUTPUT MUTATE 18 Claire Le Goues

> gcd(4,2) � 1 void gcd(int a, int b) { � 2 if (a == 0) { � > 2 � 3 printf(“%d”, b); � > gcd(0,55) � 4 } � > 55 � 5 while (b > 0) { � 6 if (a > b) � 7 a = a – b; � (looping forever) 8 else � 9 b = b – a; � 10 } � 11 printf(“%d”, a); � 12 return; � 13 } � 22 Claire Le Goues http://www.clairelegoues.com

(a=0; b=55) � 1 void gcd(int a, int b) { � true � 2 if (a == 0) { � > 55 � 3 printf(“%d”, b); � 4 } � (a=0; b=55) true � 5 while (b > 0) { � false � 6 if (a > b) � ! � � 7 a = a – b; � � 8 else � b = 55 - 0 � 9 b = b – a; � 10 } � 11 printf(“%d”, a); � 12 return; � 13 } � 23 Claire Le Goues http://www.clairelegoues.com

Input: {block} � while if(a==0) � printf(a) � return � (b>0) � {block} � {block} � {block} � printf(b) � if(a>b) � {block} � {block} � a = a – b b = b – a 24 Claire Le Goues http://www.clairelegoues.com

Input: {block} � while if(a==0) � printf(a) � return � (b>0) � Legend: {block} � {block} � {block} � � High change probability. printf(b) � if(a>b) � � Low change probability. {block} � {block} � � Not changed. a = a – b b = b – a 25 Claire Le Goues http://www.clairelegoues.com

Input: {block} � while if(a==0) � printf(a) � return � (b>0) � An edit is: {block} � {block} � {block} � • Insert statement X after statement Y printf(b) � if(a>b) � • Replace statement X with statement Y • Delete statement X {block} � {block} � a = a – b b = b – a 26 Claire Le Goues http://www.clairelegoues.com

Input: {block} � while if(a==0) � printf(a) � return � (b>0) � An edit is: {block} � {block} � {block} � • Insert statement X after statement Y printf(b) � if(a>b) � • Replace statement X with statement Y • Delete statement X {block} � {block} � a = a – b b = b – a 27 Claire Le Goues http://www.clairelegoues.com

Input: {block} � while if(a==0) � printf(a) � return � (b>0) � An edit is: {block} � {block} � {block} � • Insert statement X after statement Y printf(b) � if(a>b) � • Replace statement X with statement Y • Delete statement X {block} � {block} � return � a = a – b b = b – a 28 Claire Le Goues http://www.clairelegoues.com

OUTLINE GenProg: automatic program repair using genetic programming. Four overarching hypotheses. Empirical evaluations of: • Expressive power. • Scalability Contributions/concluding thoughts. 30 Claire Le Goues http://www.clairelegoues.com

HUMAN-COMPETITIVE REPAIR Goal: an automatic solution to alleviate a portion of the bug repair burden. Should be competitive with the humans its designed to help. Humans can: • Fix many different kinds of bugs in many different kinds of programs. [expressive power] • Fix bugs in large systems. [scalability] • Produce acceptable patches. [repair quality] 31 Claire Le Goues http://www.clairelegoues.com

HYPOTHESES Without defect- or program- specific information, GenProg can: 1. repair at least 5 different defect types, and can repair defects in at least least 10 different program types. 2. repair at least 50% of defects that humans developers fix in practice. 3. repair bugs in large programs of up to several million lines of code, and associated with up to several thousand test cases, at a time and economic cost that is human competitive. 4. produce patches that maintain existing program functionality; do not introduce new vulnerabilities; and address the underlying cause of a vulnerability. 32 Claire Le Goues http://www.clairelegoues.com

Program Description LOC Bug Type gcd example 22 infinite loop nullhttpd webserver 5575 heap buffer overflow (code) zune example 28 infinite loop uniq text processing 1146 segmentation fault look-u dictionary lookup 1169 segmentation fault look-s dictionary lookup 1363 infinite loop units metric conversion 1504 segmentation fault deroff document processing 2236 segmentation fault indent code processing 9906 infinite loop flex lexical analyzer generator 18774 segmentation fault openldap directory protocol 292598 non-overflow denial of service ccrypt encryption utility 7515 segmentation fault lighttpd webserver 51895 heap buffer overflow (vars) atris graphical game 21553 local stack buffer exploit php scripting language 764489 integer overflow wu-ftpd FTP server 67029 format string vulnerability 33 Claire Le Goues http://www.clairelegoues.com

AUTOMATIC PROGRAM REPAIR USING GENETIC PROGRAMMING CLAIRE LE - PowerPoint PPT Presentation

AUTOMATIC PROGRAM REPAIR USING GENETIC PROGRAMMING CLAIRE LE GOUES APRIL 22, 2013 1 http://www.clairelegoues.com GENPROG STOCHASTIC SEARCH + TEST CASE GUIDANCE = AUTOMATIC, EXPRESSIVE, SCALABLE PATCH GENERATION 2 Claire Le Goues

1 2 Genetic Program Genetic Program Parameter 3 Genetic Program Genetic Program 4 Softcoding

Genetic Programming What is it? Genetic Programming Genetic programming (GP) is an

Model Repair for Markov Decision Model Repair for Markov Decision Model Repair for Markov

The Role of Endovascular The Role of Endovascular Repair Repair Repair Repair John Rose John

Do Automated Program Repair Techniques Repair Hard and Important Bugs? Manish Motwani Sandhya

New Approaches to New Approaches to New Approaches to Repair of Repair of Repair of Spinal

Tissue Repair Kristine Krafts, M.D. Tissue Repair Lecture Objectives Define tissue repair,

Laparoscopic Laparoscopic Ventral vs. Open Hernia Repair Ventral vs. Open Hernia Repair Hernia

ASTOR: A PROGRAM REPAIR LIBRARY FOR JAVA Matias Martinez University of Valenciennes, France

Design of Repair Operators for Automated Program Repair Shin Hwei Tan National University of

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

Genetic.io Genetic Algorithms in all their shapes and forms ! Genetic.io Make something of your

Germ- -line Genetic Therapy line Genetic Therapy Germ Munson- -Davis Look Bravely at a Davis

Graph-based, Self-Supervised Program Repair from Diagnostic Feedback ICML 2020 Michihiro

SFH Program 504 Home Repair Program: This program provides loans to very low income homeowners

FY 2021 Highway-Railroad Crossing Surface Repair Program September 9, 2019 2021 Highway-Railroad

B. Religious and cultural developments 1. After the Babylonian captivity, many Jews turned from

L2_PythonCrashCourse August 17, 2017 1 Lecture 2: Python Crash Course CSCI 4360/6360: Data

How to Speak at a Conference Beth Tucker Long - @e3betht Who am I? Beth Tucker Long PHP

#SundayGathering with Shaykh Omar Khan Series 2: Al-Malaa ikah The Angels 14/04/19

Writing a 3 rd or 4 th Year Project Report Dr Maggie Charles, Language Centre 1 Outline of the

Keith Cook Director of Studies (International) Edge Hill University Business School

Mathematics Graduation Spiel 2007 aka . . . Mathematics Graduation Spiel 2007 aka . . .

Outline 0) Course Info 1) Introduction 2) Data Preparation and Cleaning 3) Schema matching and