Exploiting Redundant Test Cases in Fault Localisation: Good or Bad? - PowerPoint PPT Presentation

Exploiting Redundant Test Cases in Fault Localisation: Good or Bad? Alexandre Perez alexandre.perez@fe.up.pt Nuno Cardoso, Jos´ e Campos, Rui Abreu nunopcardoso@gmail.com, jose.carlos.campos@fe.up.pt, rui@computer.com Department of Informatics Engineering Faculty of Engineering, University of Porto November 19, 2013 1 / 19

Table of Contents 1 Spectrum-based Reasoning 2 Redundancy at the Test Level: Impacts? 3 Minimising Coincidental Correctness 4 Conclusions 2 / 19

Spectrum-based Fault Localisation A hit spectra is a pair ( obs, e ) : obs i obs i Activity of components in i e i 1 2 3 transaction i . 1 1 1 0 1 e i Outcome of transaction i 2 1 0 1 1 (pass or fail). 3 / 19

Spectrum-based Fault Localisation A hit spectra is a pair ( obs, e ) : obs i obs i Activity of components in i e i 1 2 3 transaction i . 1 1 1 0 1 e i Outcome of transaction i 2 1 0 1 1 (pass or fail). Spectrum-based Reasoning : • Different from statistical fault localisation approaches. • Generate sets of components that would explain the observed erroneous behaviour. • Rank the candidates according to their likelihood of being faulty. 3 / 19

Diagnostic Candidate Generation obs i i e i • Generate sets of components that 1 2 3 would explain the observed erroneous 1 1 1 0 1 behaviour. 2 1 0 1 1 { 3 } { 2 , 3 } {} { 2 } { 1 , 3 } { 1 , 2 , 3 } { 1 } { 1 , 2 } 1 Rui Abreu and Arjan J. C. van Gemund. “A Low-Cost Approximate Minimal Hitting Set Algorithm and its Application to Model-Based Diagnosis”. In: SARA . 2009. 2 Nuno Cardoso and Rui Abreu. “MHS2: A Map-Reduce Heuristic-Driven Minimal Hitting Set Search Algorithm”. In: MUSEPAT . 2013, pp. 25–36. 4 / 19

Diagnostic Candidate Generation obs i i e i • Generate sets of components that 1 2 3 would explain the observed erroneous 1 1 1 0 1 behaviour. 2 1 0 1 1 { 3 } { 2 , 3 } { 3 } { 2 , 3 } {} { 2 } { 1 , 3 } { 1 , 2 , 3 } {} { 2 } { 1 , 3 } { 1 , 2 , 3 } { 1 } { 1 , 2 } { 1 } { 1 , 2 } • A minimal candidate is a set of components that cover all failing transactions. • Staccato 1 & MHS2 2 . 1 Rui Abreu and Arjan J. C. van Gemund. “A Low-Cost Approximate Minimal Hitting Set Algorithm and its Application to Model-Based Diagnosis”. In: SARA . 2009. 2 Nuno Cardoso and Rui Abreu. “MHS2: A Map-Reduce Heuristic-Driven Minimal Hitting Set Search Algorithm”. In: MUSEPAT . 2013, pp. 25–36. 4 / 19

Diagnostic Candidate Ranking Barinel 3 approach: • For each candidate d under a set of observations ( obs, e ) , the ıve Bayes rule 4 . posterior probability is calculated using Na¨ Pr ( obs | d ) Pr ( d | obs ) = Pr ( d ) × Pr ( obs ) 3 Rui Abreu, Peter Zoeteweij, and Arjan J. C. van Gemund. “Spectrum-Based Multiple Fault Localization”. In: ASE . 2009, pp. 88–99. 4 Conditional independence is assumed throughout the process. 5 / 19

Diagnostic Candidate Ranking Barinel 3 approach: • For each candidate d under a set of observations ( obs, e ) , the ıve Bayes rule 4 . posterior probability is calculated using Na¨ • Pr ( d ) is used to make larger candidates less probable. Pr ( obs | d ) Pr ( d | obs ) = Pr ( d ) Pr ( d ) × Pr ( obs ) 3 Rui Abreu, Peter Zoeteweij, and Arjan J. C. van Gemund. “Spectrum-Based Multiple Fault Localization”. In: ASE . 2009, pp. 88–99. 4 Conditional independence is assumed throughout the process. 5 / 19

Diagnostic Candidate Ranking Barinel 3 approach: • For each candidate d under a set of observations ( obs, e ) , the ıve Bayes rule 4 . posterior probability is calculated using Na¨ • Pr ( d ) is used to make larger candidates less probable. Pr ( obs | d ) Pr ( d | obs ) = Pr ( d ) × Pr ( obs ) Pr ( obs ) • Pr ( obs ) is not considered for ranking purposes (does not depend on d ). 3 Rui Abreu, Peter Zoeteweij, and Arjan J. C. van Gemund. “Spectrum-Based Multiple Fault Localization”. In: ASE . 2009, pp. 88–99. 4 Conditional independence is assumed throughout the process. 5 / 19

Diagnostic Candidate Ranking Barinel 3 approach: • For each candidate d under a set of observations ( obs, e ) , the ıve Bayes rule 4 . posterior probability is calculated using Na¨ • Pr ( d ) is used to make larger candidates less probable. Pr ( obs | d ) Pr ( obs | d ) Pr ( d | obs ) = Pr ( d ) × Pr ( obs ) • Pr ( obs ) is not considered for ranking purposes (does not depend on d ). • Pr ( obs | d ) is used to bias the probability based on the run-time observations. 3 Rui Abreu, Peter Zoeteweij, and Arjan J. C. van Gemund. “Spectrum-Based Multiple Fault Localization”. In: ASE . 2009, pp. 88–99. 4 Conditional independence is assumed throughout the process. 5 / 19

Diagnostic Candidate Ranking  if e i = 0 G ( obs i , d )  � Pr ( obs | d ) = 1 − G ( obs i , d ) if e i = 1  obs i ∈ obs G ( obs i , d ) is estimated: • Using maximum likelihood estimation under parameters { g j | j ∈ d } 5 . • NFGE 6 : uses a feedback loop to update the health estimates of each component. 5 Rui Abreu, Peter Zoeteweij, and Arjan J. C. van Gemund. “Spectrum-Based Multiple Fault Localization”. In: ASE . 2009, pp. 88–99. 6 Nuno Cardoso and Rui Abreu. “A Kernel Density Estimate-Based Approach to Component Goodness Modeling”. In: AAAI . 2013, pp. 152–158. 6 / 19

Redundant Test Cases At the spectra level of abstraction: • Tests are redundant if they share similar activity patterns. • Can exonerate faulty components. 7 Wes Masri and Rawad Abou Assi. “Cleansing Test Suites from Coincidental Correctness to Enhance Fault-Localization”. In: ICST . 2010, pp. 165–174. 8 George K. Baah, Andy Podgurski, and Mary Jean Harrold. “Mitigating the confounding effects of program dependences for effective fault localization”. In: FSE . 2011, pp. 146–156. 8 / 19

Redundant Test Cases At the spectra level of abstraction: • Tests are redundant if they share similar activity patterns. • Can exonerate faulty components. Coincidental correctness 7 , 8 : • Occurs when passing test cases execute faulty components and no failure is triggered. • Can be caused by incorrect or relaxed test oracles. • Can occur due to the abstraction of program traces used. 7 Wes Masri and Rawad Abou Assi. “Cleansing Test Suites from Coincidental Correctness to Enhance Fault-Localization”. In: ICST . 2010, pp. 165–174. 8 George K. Baah, Andy Podgurski, and Mary Jean Harrold. “Mitigating the confounding effects of program dependences for effective fault localization”. In: FSE . 2011, pp. 146–156. 8 / 19

Redundant Test Cases – Example Consider the following hit-spectra matrix: obs i i e i 1 2 3 1 1 1 0 1 2 0 1 1 1 3 1 0 1 1 4 1 0 0 0 • After candidate generation: D = �{ 1 , 2 } , { 1 , 3 } , { 2 , 3 }� • Diagnostic Ranking: • Pr ( { 2 , 3 }| obs ) = 0 . 66 • Pr ( { 1 , 2 }| obs ) = 0 . 17 • Pr ( { 1 , 3 }| obs ) = 0 . 17 9 / 19

Redundant Test Cases – Example After a redundant test case: obs i i e i 1 2 3 1 1 1 0 1 2 0 1 1 1 3 1 0 1 1 4 1 0 0 0 • Pr ( { 2 , 3 }| obs ) = 0 . 66 • Pr ( { 1 , 2 }| obs ) = 0 . 17 • Pr ( { 1 , 3 }| obs ) = 0 . 17 10 / 19

Redundant Test Cases – Example After a redundant test case: obs i obs i i e i i e i 1 2 3 1 2 3 1 1 1 0 1 1 1 1 0 1 2 0 1 1 1 2 0 1 1 1 3 1 0 1 1 3 1 0 1 1 4 1 0 0 0 4 1 0 0 0 5 1 0 1 0 • Pr ( { 2 , 3 }| obs ) = 0 . 66 • Pr ( { 2 , 3 }| obs ) = 0 . 59 • Pr ( { 1 , 2 }| obs ) = 0 . 17 • Pr ( { 1 , 2 }| obs ) = 0 . 35 • Pr ( { 1 , 3 }| obs ) = 0 . 17 • Pr ( { 1 , 3 }| obs ) = 0 . 06 10 / 19

Redundant Test Cases – Example After a redundant test case: obs i i e i 1 2 3 1 1 1 0 1 2 0 1 1 1 3 1 0 1 1 4 1 0 0 0 • Pr ( { 2 , 3 }| obs ) = 0 . 66 • Pr ( { 1 , 2 }| obs ) = 0 . 17 • Pr ( { 1 , 3 }| obs ) = 0 . 17 11 / 19

Redundant Test Cases – Example After a redundant test case: obs i obs i i e i i e i 1 2 3 1 2 3 1 1 1 0 1 1 1 1 0 1 2 0 1 1 1 2 0 1 1 1 3 1 0 1 1 3 1 0 1 1 4 1 0 0 0 4 1 0 0 0 5 0 1 1 0 • Pr ( { 2 , 3 }| obs ) = 0 . 66 • Pr ( { 1 , 2 }| obs ) = 0 . 33 • Pr ( { 1 , 2 }| obs ) = 0 . 17 • Pr ( { 1 , 3 }| obs ) = 0 . 33 • Pr ( { 1 , 3 }| obs ) = 0 . 17 • Pr ( { 2 , 3 }| obs ) = 0 . 33 11 / 19

Minimising Coincidental Correctness – Related work Marsi et al. 9 remove coincidentally correct test cases by: • Selecting a set of suspicious statements executed by all failing tests (called CCEs); • Clustering tests into two groups based on the similarity of the executed statements to the CCEs. 9 Wes Masri and Rawad Abou Assi. “Cleansing Test Suites from Coincidental Correctness to Enhance Fault-Localization”. In: ICST . 2010, pp. 165–174. 13 / 19

Minimising Coincidental Correctness – Related work Miao et al. 10 use a similar clustering approach: • Uses hard k-Means clustering with k = | T | × p . • If a passing test is in the same cluster as a failing one, it is labeled as coincidentally correct. 10 Yi Miao et al. “Identifying Coincidental Correctness for Fault Localization by Clustering Test Cases”. In: SEKE . 2012, pp. 267–272. 14 / 19

Exploiting Redundant Test Cases in Fault Localisation: Good or Bad? - PowerPoint PPT Presentation

Exploiting Redundant Test Cases in Fault Localisation: Good or Bad? Alexandre Perez alexandre.perez@fe.up.pt Nuno Cardoso, Jos e Campos, Rui Abreu nunopcardoso@gmail.com, jose.carlos.campos@fe.up.pt, rui@computer.com Department of

Evolving Fault Localisation Shin Yoo, University College London, UK Human Competitive Award,

P i Paired Redundant IOCs Paired Redundant IOCs d R d d t IOC with Redundant Hardware with

The Multilingual Web Where Are We? Next Generation Localisation Josef van Genabith, CNGL

Lecture 10: Fault Tolerance Fault Tolerant Concurrent Computing The main principles of fault

Using Information Theory to Guide Fault Localisation Shin Yoo (joint work with Mark Harman &

Chapt er 14: Redundant Arit hmet ic Keshab K. Parhi A non-redundant radix-r number has

Model-Based Testing (ISTQB Chapter 4) Arie van Deursen 1 4.1 ISTQB Test Design Test Scripts

Localisation, Mapping and the Simultaneous Localisation and Mapping (SLAM) Problem Hugh

9.4 Local Perception Filters 9.4 Local Perception Filters Exploiting Exploiting Perceptual

Fault Detective Test Analyzer 2004 Board Test Workshop Doug Manley Loveland, CO Gray Fault

Cost-efficient localisation system for agricultural use cases Gbor Paller, Sbastien Aubin,

Distributed Systems 5. Fault Tolerant Systems Fault-Tolerance - 1 Lszl Bszrmnyi

JUST ONE FAULT Persistent Fault Analysis on Block Ciphers Shivam Bhasin Temasek Labs @ NTU ASK

THE RESEARCH Assault 82,235 cases of assault on 64,519 cases Women Kidnapping women /

Use Cases Use Cases Use Use cases cases 2003 Giorgini Information Acquisition -- 1 Basi

Redundant Feature Elimination Redundant Feature Elimination for Multi-Class Problems for

MAESTRO Project Update 3 Charu Dwivedi Fidelia Lam Da Fang Nilay Muchhala October 11, 2016

The Arpeggigon: A Functional Reactive Musical Automaton Demo, FARM 2017, 9 Sept., Oxford Henrik

+ String Conditioning: Ten Minutes of Exercises 2011-2013 Daily Practice on the Violin, Viola,

Admissible Digit Sets Jesse Hughes 1 , 2 Milad Niqui 2 1 Technical University of Eindhoven 2

Physical Modeling meets Machine Learning: Teaching Bow Control to a Virtual Violinist Graham

1 Relational Database Domains SNOOPYFAMILY

Templates D&D Chapter 12 Introduction Templates - easily create a large range of related

Faster, Smarter, Greener Intelligent Reflecting Surface for 6G Communications Qingqing Wu

Exploiting Redundant Test Cases in Fault Localisation: Good or Bad? - PowerPoint PPT Presentation

Exploiting Redundant Test Cases in Fault Localisation: Good or Bad? Alexandre Perez alexandre.perez@fe.up.pt Nuno Cardoso, Jos e Campos, Rui Abreu nunopcardoso@gmail.com, jose.carlos.campos@fe.up.pt, rui@computer.com Department of

Evolving Fault Localisation Shin Yoo, University College London, UK Human Competitive Award,

P i Paired Redundant IOCs Paired Redundant IOCs d R d d t IOC with Redundant Hardware with

The Multilingual Web Where Are We? Next Generation Localisation Josef van Genabith, CNGL

Lecture 10: Fault Tolerance Fault Tolerant Concurrent Computing The main principles of fault

Using Information Theory to Guide Fault Localisation Shin Yoo (joint work with Mark Harman &amp;

Chapt er 14: Redundant Arit hmet ic Keshab K. Parhi A non-redundant radix-r number has

Model-Based Testing (ISTQB Chapter 4) Arie van Deursen 1 4.1 ISTQB Test Design Test Scripts

Localisation, Mapping and the Simultaneous Localisation and Mapping (SLAM) Problem Hugh

9.4 Local Perception Filters 9.4 Local Perception Filters Exploiting Exploiting Perceptual

Fault Detective Test Analyzer 2004 Board Test Workshop Doug Manley Loveland, CO Gray Fault

Cost-efficient localisation system for agricultural use cases Gbor Paller, Sbastien Aubin,

Distributed Systems 5. Fault Tolerant Systems Fault-Tolerance - 1 Lszl Bszrmnyi

JUST ONE FAULT Persistent Fault Analysis on Block Ciphers Shivam Bhasin Temasek Labs @ NTU ASK

THE RESEARCH Assault 82,235 cases of assault on 64,519 cases Women Kidnapping women /

Use Cases Use Cases Use Use cases cases 2003 Giorgini Information Acquisition -- 1 Basi

Redundant Feature Elimination Redundant Feature Elimination for Multi-Class Problems for

MAESTRO Project Update 3 Charu Dwivedi Fidelia Lam Da Fang Nilay Muchhala October 11, 2016

The Arpeggigon: A Functional Reactive Musical Automaton Demo, FARM 2017, 9 Sept., Oxford Henrik

+ String Conditioning: Ten Minutes of Exercises 2011-2013 Daily Practice on the Violin, Viola,

Admissible Digit Sets Jesse Hughes 1 , 2 Milad Niqui 2 1 Technical University of Eindhoven 2

Physical Modeling meets Machine Learning: Teaching Bow Control to a Virtual Violinist Graham

1 Relational Database Domains SNOOPYFAMILY

Templates D&amp;D Chapter 12 Introduction Templates - easily create a large range of related

Faster, Smarter, Greener Intelligent Reflecting Surface for 6G Communications Qingqing Wu

Using Information Theory to Guide Fault Localisation Shin Yoo (joint work with Mark Harman &

Templates D&D Chapter 12 Introduction Templates - easily create a large range of related