S EM F IX : P ROGRAM R EPAIR VIA S EMANTIC A NALYSIS CREST Workshop, - PowerPoint PPT Presentation

S EM F IX : P ROGRAM R EPAIR VIA S EMANTIC A NALYSIS CREST Workshop, Jan 2014 H.D.T. Nguyen, Dawei Qi, Abhik Roychoudhury National University of Singapore , & Satish Chandra Samsung 1 Talk given at 30 th CREST Workshop, London, Jan 2014.

W HAT WE HAVE BEEN DISCUSSING Precise debugging is laborious. CREST Workshop, Jan 2014 Specification based repair, Genetic Programming, … Symbolic execution of test cases to extract specifications 2

Test–suite Failing tests T HIS WORK … Suspicions !! – statistical fault localization. CREST Workshop, Jan 2014 Infer intended meaning of suspicious statements - Symbolic execution (SE) Solve constraint from SE to create fixed statement - Program synthesis 3

0. T HE PROBLEM 1 int is_upward( int inhibit, int up_sep, int down_sep){ 2 int bias; 3 if (inhibit) CREST Workshop, Jan 2014 4 bias = down_sep; // bias= up_sep + 100 5 else bias = up_sep ; 6 if (bias > down_sep) 7 return 1; 8 else return 0; 9 } inhibit up_sep down_sep Observed Expected Result output Output 1 0 100 0 0 pass 1 11 110 0 1 fail 0 100 50 1 1 pass 4 1 -20 60 0 1 fail 0 0 10 0 0 pass

1. F IND A S USPECT 1 int is_upward( int inhibit, int up_sep, int down_sep){ 2 int bias; 3 if (inhibit) 4 bias = down_sep; // bias= up_sep + 100 CREST Workshop, Jan 2014 5 else bias = up_sep ; 6 if (bias > down_sep) 7 return 1; 8 else return 0; 9 } Line Score Rank 4 0.75 1 8 0.6 2 3 0.5 3 6 0.5 3 5 0 5 5 7 0 5

2 W HAT IT SHOULD HAVE BEEN 1 int is_upward( int inhibit, int up_sep, int down_sep){ 2 int bias; 3 if (inhibit) 4 bias = down_sep; // bias= up_sep + 100 5 else bias = up_sep ; 6 if (bias > down_sep) CREST Workshop, Jan 2014 7 return 1; 8 else return 0; 9 } inhibit up_sep down_sep Observed Expected Result output Output 1 11 110 0 1 fail Line 4 inhibit = 1, up_sep = 11, down_sep = 110 bias = X, path condition = true Line 7 Line 8 6 inhibit = 1, up_sep = 11, down_sep = 110 inhibit = 1, up_sep = 11, down_sep = 110 bias = X, path condition = X ≤ 110 bias = X, path condition = X> 110

2. W HAT IT SHOULD HAVE BEEN Inhibit up_sep down_sep == 1 == 11 == 110 1 int is_upward( int inhibit, int up_sep, int CREST Workshop, Jan 2014 down_sep){ 2 int bias; 3 if (inhibit) 4 bias = f(inhibit, up_sep, down_sep) 5 else bias = up_sep ; 6 if (bias > down_sep) 7 return 1; 8 else return 0; 9 } Symbolic Execution 7 f(1,11,110) > 110

3. F IX THE SUSPECT  Accumulated constraints  f(1,11, 110) > 110 ∧ CREST Workshop, Jan 2014  f(1,0,100) ≤ 100 ∧  …  Find a f satisfying this constraint  By fixing the set of operators appearing in f  Candidate methods  Search over the space of expressions  Program synthesis with fixed set of operators  More efficient!!  Generated fix 8  f(inhibit,up_sep,down_sep) = up_sep + 100

T O R ECAPITULATE  Ranked Bug report  Hypothesize the error causes – suspect CREST Workshop, Jan 2014  Symbolic execution  Specification of the suspicious statement  Input-output requirements from each test  Repair constraint  Program synthesis  Decide operators which can appear in the fix  Generate a fix by solving repair constraint. 9

P RODUCING R ANKED B UG REPORT  We use the Tarantula toolkit.  Given a test-suite T CREST Workshop, Jan 2014 fail(s) allfail Score(s) = fail(s) pass(s) + allfail allpass  fail(s) ≡ # of failing executions in which s occurs  pass(s) ≡ # of passing executions in which s occurs  allfail ≡ Total # of failing executions  allpass ≡ Total # of passing executions  allfail + allpass = |T|  Can also use other metric like Ochiai. 10

U SAGE OF R ANKED B UG REPORT Buggy Program CREST Workshop, Jan 2014 Test Suite NO -Investigate what this statement should be. - Generate a fixed statement YES 11 Fixed Program

T O R ECAPITULATE  Ranked Bug report  Hypothesize the error causes – suspect CREST Workshop, Jan 2014  Symbolic execution  Specification of the suspicious statement  Input-output requirements from each test  Repair constraint  Program synthesis  Decide operators which can appear in the fix  Generate a fixed statement by solving repair constraint. 12

W HAT IT SHOULD HAVE BEEN Concrete test input CREST Workshop, Jan 2014 Concrete Execution … var = a + b – c; x Buggy Program Symbolic Execution with x as the only unknown Path conditions, 13 Output Expressions

E XAMPLE Inhibit == 1 up_sep == 11 down_sep == 110 1 int is_upward( int inhibit, int up_sep, int down_sep){ CREST Workshop, Jan 2014 2 int bias; 3 if (inhibit) 4 bias = f(inhibit, up_sep, down_sep) // X 5 else bias = up_sep ; 6 if (bias > down_sep) 7 return 1; 8 else return 0; Symbolic Execution 9 } ∨ ( pc j ∧ out j == expected_out(t) ) ( (X >110 ∧ 1 ==1) j ∈ Paths ∨ (X ≤ 110 ∧ 0 == 1) ∧ ) ∧ f(1,11,110) == X f(t) == X 14 14 Repair constraint

O VERALL R EPAIR C ONSTRAINT t1 t2 Repair constraint = ∧ Cons i CREST Workshop, Jan 2014 TS … var = … ; 1. TS = failing tests; 2. Repair based on TS // guaranteed to pass TS 3. New = newly failed tests due to repair 4. If (New == φ ) exit // Got your repair 5. else { TS = TS ∪ New; 6. Go to 2 } Cons1 Cons2 15 Repair Constraint = Cons1 ∧ Cons2

T O R ECAPITULATE  Ranked Bug report  Hypothesize the error causes – suspect CREST Workshop, Jan 2014  Symbolic execution  Specification of the suspicious statement  Input-output requirements from each test  Repair constraint  Program synthesis  Decide operators which can appear in the fix  Generate a fix by solving repair constraint. 16

W HY P ROGRAM SYNTHESIS Repair Constraint:  Instead of solving f(1,11,110) > 110 ∧ f(1,0,100) ≤ 100 ∧ f(1,-20,60) > 60 CREST Workshop, Jan 2014  Select primitive components to be used by the synthesized program based on complexity  Look for a program that uses only these primitive components and satisfy the repair constraint  Where to place each component? int tmp=down_sep + 1; + int tmp = down_sep -1; return tmp- inhibit; return up_sep + tmp; +  What are the parameters? int tmp = down_sep -1; int tmp = down_sep - 1; 17 return tmp + inhibit ; inhibit return tmp + inhibit ; up_sep

L OCATION V ARIABLES  Define location variables for each component  Constraint on location variables solved by SMT.  Well-formed e.g. defined before being used CREST Workshop, Jan 2014  Output constraint from each test (repair constraint)  Meaning of the components  Lines determine the value L x == L y ⇒ x == y  Once locations are found, program is constructed. Components = {+} L in == 0, L out == 1, L out+ == 1, L in1+ == 0, L in2+ == 0 0 r0 = input; 1 r = r0 + r0; 2 return r; 18

E VALUATION  Results from  SIR and GNU CoreUtils CREST Workshop, Jan 2014  Tools  Ranked Bug report (Tarantula)  Symbolic execution (KLEE)  Program synthesis (Own tool + Z3) 19

S UBJECTS U SED SIR programs Subject LoC # Versions Description TCAS 135 41 Air Traffic Control Schedule 304 9 Process scheduler CREST Workshop, Jan 2014 Schedule2 262 9 Process scheduler Replace 518 29 Text processing Grep 9366 2 Text search engine GNU CoreUtils Subject LoC mknod 183 mkdir 159 mkfifo 107 20 cp 2272

S UCCESS OF REPAIR (SIR) 45 Time bound = 4 mins. 40 35 CREST Workshop, Jan 2014 # of programs repaired 30 Total 25 Semfix 20 TCAS GenProg 15 10 5 0 Number of tests 10 20 30 40 50 Overall 90 programs from SIR 21 SemFix repaired 48/90, GenProg repaired 16/90 for 50 tests. GenProg running time is >3 times of SemFix

T YPE OF B UGS (SIR) Total SemFix GenProg CREST Workshop, Jan 2014 Constant 14 10 3 Arithmetic 14 6 0 Comparison 16 12 5 Logic 10 10 3 Code 27 5 3 Missing Redundant 9 5 2 Code ALL 90 48 16 22

GNU C ORE U TILS  9 buggy programs where bug could be reproduced.  Taken from paper on KLEE, OSDI 2008. CREST Workshop, Jan 2014  SemFix succeeded in 4/9 [mkdir, cp, …]  Average time = 3.8 mins.  Average time = 6 mins. [GenProg]  All GenProg experiments using configuration from ICSE 2012 paper by Le Goues et al.  Pop size, # generations, …  Other configurations may lead to success for GP, but then we need a systematic method to determine the configurations. 23

E XPRESSION ENUMERATION  Enumerate all expressions over a given set of components (i.e. operators)  Enforce axioms of the operators CREST Workshop, Jan 2014  If candidate repair contains a constant, solve using SMT  Program synthesis turns out to be faster. Subject TCAS Schedul Schedule replace grep e 2 Ratio 6.9 2.8 2.5 1.36 2.2 24 Enumeration also timed out > 20 minutes. These are not even included .

S EM F IX : P ROGRAM R EPAIR VIA S EMANTIC A NALYSIS CREST Workshop, - PowerPoint PPT Presentation

S EM F IX : P ROGRAM R EPAIR VIA S EMANTIC A NALYSIS CREST Workshop, Jan 2014 H.D.T. Nguyen, Dawei Qi, Abhik Roychoudhury National University of Singapore , & Satish Chandra Samsung 1 Talk given at 30 th CREST Workshop, London, Jan 2014. W

S emantic Web Architecture Vitaly Vlasov inxaoc@ gmail.com Agenda 1. About S emantic Web,

H OW TO R EPAIR V ULNERABILITIES ? Correcting vulnerable logic, e.g. race condition

C LASSIFICATION T REE A NALYSIS : A U SEFUL S TATISTICAL T OOL FOR P ROGRAM E VALUATORS Meredith

R ECAP | P RELIMINARY D ESIGN P ROGRAM (PDP) P RELIMINARY E VALUATION E DUCATIONAL P ROGRAM E

CAMP CAMP ( C C ollege ollege A A ssistance ssistance M M igrant igrant P P rogram) rogram) (

1 O ral O ral Health Health P rogram P rogram to to E ngage E ngage N on N on- - Dental

MORE stands for M aintenance O ptimization R epair E ngineering W e offer high-quality kiln

SMART S tony-brook M illstone: A dvocate, R epair, T ransform Carolyn Wagner, Anna Pimenta, Kate

S EMANTIC -B ASED M ULTILINGUAL D OCUMENT C LUSTERING VIA T ENSOR M ODELING Salvatore Romeo 1 ,

A NALYSIS W HAT IS IT ? Created & Exclusively Owned by: Impact Branding Consulting, Inc

T ACOMA M IXED U SE C ENTERS F EASIBILITY A NALYSIS P REPARED BY P ROPERTY C OUNSELORS M AY 2015 I

T YPE -G UIDED W ORST -C ASE I NPUT G ENERATION Di Wang , Jan Hoffmann Carnegie Mellon

Messengers Program Presentation Guide August 15, 2016 H OCKEY C ANADA I NITIATION P ROGRAM

S emantic A utomated D iscovery and I ntegration http://sadiframework.org Summary SADI is a

WAVES B IG D ATA P LATFORM FOR R EAL - TIME S EMANTIC S TREAM M ANAGEMENT WAVES ATOS SE OUTLINE

Natural S emantics Based Tools for S emantic Web with Application to Product Models CUGS

POSIX IPC: Overview primitive POSIX function description message queues create or access

Public-Key Infrastructure David Basin, Cas Cremers, Tiffany Hyun-Jin Kim, Adrian Perrig, Ralf

Lecture 6 18 September 2018 Admin Matters Common C Mistakes Unit 13: Call Stack Unit 14: Pointer

IE 507, SEM/ HUMAN-CENTERED DESIGN An Introduction and Overview 1 Human-Centered Design

Informatics and Statistics Department Informatics and Statistics Department A SMS Tool for Alerts

Strategic Enrollment Management Retreat (with notes) January 31, 2020 What is Strategic

Mithridates: Peering into the Future with Idle Cores Earl T. Barr Mark Gabel David J.

Individualized, & Data-Informed Networks of Support NACADA Region 1 Conference, March 2018

S EM F IX : P ROGRAM R EPAIR VIA S EMANTIC A NALYSIS CREST Workshop, - PowerPoint PPT Presentation

S EM F IX : P ROGRAM R EPAIR VIA S EMANTIC A NALYSIS CREST Workshop, Jan 2014 H.D.T. Nguyen, Dawei Qi, Abhik Roychoudhury National University of Singapore , & Satish Chandra Samsung 1 Talk given at 30 th CREST Workshop, London, Jan 2014. W

S emantic Web Architecture Vitaly Vlasov inxaoc@ gmail.com Agenda 1. About S emantic Web,

H OW TO R EPAIR V ULNERABILITIES ? Correcting vulnerable logic, e.g. race condition

C LASSIFICATION T REE A NALYSIS : A U SEFUL S TATISTICAL T OOL FOR P ROGRAM E VALUATORS Meredith

R ECAP | P RELIMINARY D ESIGN P ROGRAM (PDP) P RELIMINARY E VALUATION E DUCATIONAL P ROGRAM E

CAMP CAMP ( C C ollege ollege A A ssistance ssistance M M igrant igrant P P rogram) rogram) (

1 O ral O ral Health Health P rogram P rogram to to E ngage E ngage N on N on- - Dental

MORE stands for M aintenance O ptimization R epair E ngineering W e offer high-quality kiln

SMART S tony-brook M illstone: A dvocate, R epair, T ransform Carolyn Wagner, Anna Pimenta, Kate

S EMANTIC -B ASED M ULTILINGUAL D OCUMENT C LUSTERING VIA T ENSOR M ODELING Salvatore Romeo 1 ,

A NALYSIS W HAT IS IT ? Created &amp; Exclusively Owned by: Impact Branding Consulting, Inc

T ACOMA M IXED U SE C ENTERS F EASIBILITY A NALYSIS P REPARED BY P ROPERTY C OUNSELORS M AY 2015 I

T YPE -G UIDED W ORST -C ASE I NPUT G ENERATION Di Wang , Jan Hoffmann Carnegie Mellon

Messengers Program Presentation Guide August 15, 2016 H OCKEY C ANADA I NITIATION P ROGRAM

S emantic A utomated D iscovery and I ntegration http://sadiframework.org Summary SADI is a

WAVES B IG D ATA P LATFORM FOR R EAL - TIME S EMANTIC S TREAM M ANAGEMENT WAVES ATOS SE OUTLINE

Natural S emantics Based Tools for S emantic Web with Application to Product Models CUGS

POSIX IPC: Overview primitive POSIX function description message queues create or access

Public-Key Infrastructure David Basin, Cas Cremers, Tiffany Hyun-Jin Kim, Adrian Perrig, Ralf

Lecture 6 18 September 2018 Admin Matters Common C Mistakes Unit 13: Call Stack Unit 14: Pointer

IE 507, SEM/ HUMAN-CENTERED DESIGN An Introduction and Overview 1 Human-Centered Design

Informatics and Statistics Department Informatics and Statistics Department A SMS Tool for Alerts

Strategic Enrollment Management Retreat (with notes) January 31, 2020 What is Strategic

Mithridates: Peering into the Future with Idle Cores Earl T. Barr Mark Gabel David J.

Individualized, &amp; Data-Informed Networks of Support NACADA Region 1 Conference, March 2018

A NALYSIS W HAT IS IT ? Created & Exclusively Owned by: Impact Branding Consulting, Inc

Individualized, & Data-Informed Networks of Support NACADA Region 1 Conference, March 2018