Checking Inside the Black Box: Regression Testing Based on Value - PowerPoint PPT Presentation

Checking Inside the Black Box: Regression Testing Based on Value Spectra Differences Tao Xie David Notkin Dept. of Computer Science & Engineering, University of Washington, Seattle 12 Sept. 2004 ICSM 2004, Chicago 1

Synopsis • Context: Traditional regression testing strongly focuses on black-box comparison of program outputs • Problem: Behavior deviations (behavior differences) might be difficult to be propagated to observable outputs • Goal: Detect and understand behavior deviations inside the black box 2

Approaches Existing approaches: • Fault propagation models [Thompson et al. 93, Voas 92] • Structural program spectra, e.g. branch, path [Ball&Larus 96, Reps et al. 97, Harrold et al. 00] Our new approach: • Compute value spectra from a program execution • characterize program states of user functions • Compare value spectra from two versions • detect and understand deviation propagation 3

Outline • Value Spectra • Value Spectra Comparison • Experiment • Conclusion 4

Example Program int main(int argc, char *argv[]) { int i, j; if (argc != 3) { printf("Wrong arguments!"); return 1; } i = atoi(argv[1]); j = atoi(argv[2]); if (max(i, j) >= 0){ if (max(i, j) == 0){ printf("0"); } else { printf("1"); } } else { printf("-1"); } return 0; } 5

Example Program int main(int argc, char *argv[]) { int i, j; if (argc != 3) { printf("Wrong arguments!"); return 1; } i = atoi(argv[1]); j = atoi(argv[2]); main if (max(i, j) >= 0){ if (max(i, j) == 0){ max printf("0"); } else { max printf("1"); } } else { printf("-1"); Program black-box input “0 1” } Program black-box output “1” return 0; } 6

Dynamic Call Tree main max max 7

Dynamic Call Tree argc argv[1] argv[2] Main- entry state 3 “0” “1” main max max 8

Dynamic Call Tree argc argv[1] argv[2] Main- entry state 3 “0” “1” main max max argc argv[1] argv[2] ret Main- exit state 3 “0” “1” 0 9

Dynamic Call Tree argc argv[1] argv[2] Main- entry state 3 “0” “1” main max main ( entry (3, “0”, “1”), exit (3, “0”, “1”, 0)) max argc argv[1] argv[2] ret Main- exit state 3 “0” “1” 0 10

Dynamic Call Tree argc argv[1] argv[2] Main- entry state 3 “0” “1” a b main Max- entry state 0 1 max a b ret 0 1 1 Max- exit state max ( entry (0, 1), exit (0, 1, 1)) a b Max- entry state 0 1 max a b ret 0 1 1 Max- exit state argc argv[1] argv[2] ret Main- exit state 3 “0” “1” 0 11

Value Spectra • Value hit spectra • main ( entry (3, “0”, “1”), exit (3, “0”, “1”, 0)) • max ( entry (0, 1), exit (0, 1, 1)) • Value count spectra • include additional count information • Value trace spectra • include additional sequence order information 12

Spectra Comparison • Function execution comparison test entry state entry state = ? exit state exit state = ? old version new version • State linearization to compare the values of pointer-type variables [Xie, Marinov, Notkin ASE 04] 14

Understanding Deviations entry state entry state ≠ Deviation follower exit state exit state entry state entry state = Deviation container exit state exit state ≠ old version new version 15

Understanding Deviation Propagations • Understand where the deviations start and how they are propagated. • Deviation root • a program change that triggers specific behavior deviations • Deviation-root localization • two heuristics 16

Heuristic 1: Earliest dev follower’s preceded area entry state Deviation root entry state container follower or nondeviated exit state entry state follower exit state exit state 17

Deviation Follower Example The 58 th test on main the 9 th faulty version of ___initialize tcas program ___alt_sep_test [Hutchins et al. 94] ___Non_Crossing_Biased_Climb ___Inhibit_Biased_Climb ___Own_Above_Threat ___Non_Crossing_Biased_Descend ___Inhibit_Biased_Climb ___Own_Below_Threat-------[dev follower] ___ALIM-------------------[dev follower] ___Own_Above_Threat 18

Heuristic 2: Innermost dev container’s enclosed area entry state entry state entry state container Deviation container root container exit state exit state exit state 19

Deviation Container Example The 91 th test on the 9 th faulty version of tcas program main [Hutchins et al. 94] ___initialize ___alt_sep_test-----------------[dev container] ___Non_Crossing_Biased_Climb ___Inhibit_Biased_Climb ___Own_Above_Threat ___ALIM ___Own_Below_Threat ___Non_Crossing_Biased_Descend-[dev container] ___Inhibit_Biased_Climb ___Own_Below_Threat 20

Experimental Subjects [ Hutchins et al. 94 , Rothermel&Harrold 98] program funcs loc tests vers_used printtok 18 402 4130 7 printtok2 19 483 4115 10 replace 21 516 5542 12 schedule 18 299 2650 9 schedule2 16 297 2710 10 tcas 9 138 1608 9 totinfo 7 346 1052 6 22

Questions to Be Answered • How effectively can we use the value spectra differences to expose deviations (comparing to using output differences)? • How accurately can we use the two heuristics to locate deviation roots? 23

Experiment Design • Run each test on both the original version and a faulty version (instrumented using Daikon frontend [Ernst et al. 01] ) • Compute value spectra of two versions • Compare value spectra of two versions [compare outputs of two versions] • Locate deviation roots from spectra differences 24

Deviation Exposure Ratios 26

What We Have Learned • When program outputs are the same for two versions, deviations can be still detected based on value spectra differences. • Value hit spectra seem to be good enough; adding count information or sequence information does not improve much. 27

Deviation-Root Localization Ratios (Value Hit Spectra) 28

What We Have Learned • Identified deviation roots are accurate for most programs. • The exceptional case is schedule2 , whose state changes lie in deep parts of a linked list. • By default, Daikon frontend looks into state information of complex data structures with depth of three 29

Threats to Validity • Representative of true practice? • Subject programs, faults, and tests • Instrumentation effects that bias the results • Faults on tools (analysis scripts, Daikon frontend) • Use of approximate state information 30

Conclusion • Checking only black-box outputs is limited in regression testing • Value spectra enrich the existing program spectra family • Comparing value spectra helps detect and understand deviation propagation • The experimental results have shown • Comparing value spectra is effective in detecting deviations • Two heuristics are effective in locating deviation roots 31

Questions? 32

Scalability • Cost = O (| vars | Χ | userfuncs | Χ | testsuite |) program funcs loc tests trace size/test (kb) printtok 18 402 4130 36 printtok2 19 483 4115 50 replace 21 516 5542 71 schedule 18 299 2650 982 schedule2 16 297 2710 272 tcas 9 138 1608 8 totinfo 7 346 1052 27 • Execution slowdown: (2 ~7) schedule2 : 31; schedule : 48 • Analysis time: (7 ~ 30 ms/test) schedule2 : 137; schedule : 201 ms/test 33

Related Work • Structural program spectra [Ball&Larus 96, Reps et al. 97, Harrold et al. 00] • GUI test oracles based on GUI states [Memon et al. 03] • Relative debugging [Abramson et al. 96] • Comparison checking [Jaramillo et al. 02] • RELAY model [Thompson et al. 93] • PIE (Propagation, Infection, and Execution) model [Voas 92] 34

Representation of Function Execution • Function-entry state • Argument values • Global variable values • Function-exit state • Updated argument values • Updated global variable values • Return value • funcname ( entry (argvals), exit (argvals, ret)) main ( entry (3, “0”, “1), exit (3, “0”, “1”, 0)) max ( entry (0, 1), exit (0, 1, 1)) 35

Checking Inside the Black Box: Regression Testing Based on Value - PowerPoint PPT Presentation

Checking Inside the Black Box: Regression Testing Based on Value Spectra Differences Tao Xie David Notkin Dept. of Computer Science & Engineering, University of Washington, Seattle 12 Sept. 2004 ICSM 2004, Chicago 1 Synopsis

[7] Gaussian Elimination Starting to peek inside the black box So far sol ve( A, b) is a black

Paradoxes in Probability How probability continues to amuse me! Let's play a game! Box A Box B

Black Box Scanning Tool + White Box Testing Tool Toshis Black Box Scanning Tool Same

Test Design Techniques Chapter 4 Part 2 3. Specification-based testing (Black-box) 4.

Regression Testing vs. Regression Testing Development Testing Developed first version of

Black Box Testing (A&O Ch. 4) (2 nd Ed. Ch. 6) Course Software Testing & Verification

A recipe for black box functors Maru Sarazola and Brendan Fong What is a black box functor? In

Flux Box Flux Box A concept by Flux Laboratory Flux box : concept Flux box : concept What is Flux

Testing Overview Introduction (Proving and Testing) Types of Testing Unit,

Testing III Testing III Week 16 Agenda (Lecture) Agenda (Lecture) White box testing White box

From Model Checking to Proof Checking ... and Back Kedar Namjoshi Bell Labs April 29, 2005

Kid s Box American English Level 1 Presentation Plus: Kid s Box American English Kid s Box

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Make sure we can query black box algorithms

Effective Black-Box Testing with Genetic Algorithms Mark Last and Shay Eyal Department of

OFFICE HOURS ESG-CV Reporting Prepared: September 15,2020 Call in If you are having audio

Executive Office of Public Safety and Security Department of Criminal Justice Information

Inventing Managerial Inventing Managerial Information: Information: Systems Men and the

Peer to Peer Networks and Security Kostya Kortchinsky CERT RENATER

Structuring Your Deal Now to Plan for the Exit 15 Years Later Melanie Clark, Sallie Lin, Kate

SANTA CLARA UNIVERSITY HUMAN RESOURCES COMMUNICATIONS COMMITTEE March 3, 2020 Agenda 2

The effect of DNS on Tors anonymity Benjamin Greschbach KTH Royal Institute of Technology

CSE 521 Algorithms Depth First Search and Strongly Connected Components W.L. Ruzzo, Winter 2013

Checking Inside the Black Box: Regression Testing Based on Value - PowerPoint PPT Presentation

Checking Inside the Black Box: Regression Testing Based on Value Spectra Differences Tao Xie David Notkin Dept. of Computer Science & Engineering, University of Washington, Seattle 12 Sept. 2004 ICSM 2004, Chicago 1 Synopsis

[7] Gaussian Elimination Starting to peek inside the black box So far sol ve( A, b) is a black

Paradoxes in Probability How probability continues to amuse me! Let's play a game! Box A Box B

Black Box Scanning Tool + White Box Testing Tool Toshis Black Box Scanning Tool Same

Test Design Techniques Chapter 4 Part 2 3. Specification-based testing (Black-box) 4.

Regression Testing vs. Regression Testing Development Testing Developed first version of

Black Box Testing (A&amp;O Ch. 4) (2 nd Ed. Ch. 6) Course Software Testing &amp; Verification

A recipe for black box functors Maru Sarazola and Brendan Fong What is a black box functor? In

Flux Box Flux Box A concept by Flux Laboratory Flux box : concept Flux box : concept What is Flux

Testing Overview Introduction (Proving and Testing) Types of Testing Unit,

Testing III Testing III Week 16 Agenda (Lecture) Agenda (Lecture) White box testing White box

From Model Checking to Proof Checking ... and Back Kedar Namjoshi Bell Labs April 29, 2005

Kid s Box American English Level 1 Presentation Plus: Kid s Box American English Kid s Box

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Make sure we can query black box algorithms

Effective Black-Box Testing with Genetic Algorithms Mark Last and Shay Eyal Department of

OFFICE HOURS ESG-CV Reporting Prepared: September 15,2020 Call in If you are having audio

Executive Office of Public Safety and Security Department of Criminal Justice Information

Inventing Managerial Inventing Managerial Information: Information: Systems Men and the

Peer to Peer Networks and Security Kostya Kortchinsky CERT RENATER

Structuring Your Deal Now to Plan for the Exit 15 Years Later Melanie Clark, Sallie Lin, Kate

SANTA CLARA UNIVERSITY HUMAN RESOURCES COMMUNICATIONS COMMITTEE March 3, 2020 Agenda 2

The effect of DNS on Tors anonymity Benjamin Greschbach KTH Royal Institute of Technology

CSE 521 Algorithms Depth First Search and Strongly Connected Components W.L. Ruzzo, Winter 2013

Black Box Testing (A&O Ch. 4) (2 nd Ed. Ch. 6) Course Software Testing & Verification