An Empirical Study on Reducing Omission Errors in Practice - PowerPoint PPT Presentation

ASE ¡2014 An ¡Empirical ¡Study ¡on ¡Reducing ¡ Omission ¡Errors ¡in ¡Practice Jihun ¡Park 1 , ¡Miryung Kim 2 , ¡Doo-‑Hwan ¡Bae 1 1. KAIST, ¡South ¡Korea 2. University ¡of ¡California, ¡Los ¡Angeles ¡(UCLA), ¡USA

Predicting ¡co-‑changed ¡entities A.java B.java … … C.java Version ¡history Revision ¡7365 Can ¡we ¡predict ¡an ¡additional ¡change ¡location ¡in ¡ a ¡transaction? Change coupling (mining SW repositories) : Zimmermann et al., • Ying et al., Hassan and Holt, Herzig and Zeller Structural dependency : Robillard, Saul et al. • Cloning-‑based relationship : Nguyen et al. • 2

Predicting ¡omission ¡errors Initial ¡change Supplementary ¡change A.java A.java B.java D.java … … … C.java E.java Version ¡history Revision ¡101 Revision ¡125 Log: ¡Fix ¡bug ¡#10000 Log: ¡Patch ¡bug ¡#10000 A ¡developer ¡missed ¡to ¡update ¡D ¡and ¡E ¡( omission ¡error ) How ¡can ¡we ¡predict ¡the ¡supplementary ¡change ¡ location, ¡given ¡the ¡initial ¡change ¡location? 3

Key ¡contributions • To systematically investigate a real-‑world supplementary patch data set, we suggest a graph representation change relationship graph (CRG) . 1. While a single trait is inadequate, combining multiple traits is limited as well. 2. A boosting approach does not significantly improve the accuracy. 3. There is no package or developer specific pattern. 4. There is no repeated mistake. 4

Change ¡Relationship ¡Graph ¡(CRG) Study ¡subjects: ¡Eclipse ¡JDT ¡core, ¡Eclipse ¡SWT, ¡and ¡Equinox ¡p2 • Graph ¡Nodes • Classes • Methods • Graph ¡Edges Class Class • Extends contains contains • Contains • Method ¡invocation Code ¡clone (calls, ¡called ¡by) Method Method • Historical ¡ co-‑change An ¡initial ¡change ¡location calls • Code ¡clone • Name ¡similarity Method The ¡supplementary ¡change ¡location 5 * ¡M.K. ¡Ripon ¡Saha et al. ¡A ¡graph-‑based ¡framework ¡for ¡reasoning ¡about ¡ relationships ¡among ¡software ¡modifications. ¡TR ¡2014

Observation ¡1: ¡While ¡a ¡single ¡trait ¡is ¡inadequate, ¡ combining ¡multiple ¡traits ¡is ¡limited ¡as ¡well. ¡ • Only 10% to 20% of supplementary change locations can be connected with one edge from initial change location. • Combining multiple traits as a prediction rule shows at most 10% accuracy Supplem Code ¡clone calls X ¡ Initial (arbitrary) entary Combining ¡multiple ¡traits ¡does ¡not ¡predict ¡ supplementary ¡change ¡locations ¡accurately 6

Observation ¡2: ¡A ¡boosting ¡approach ¡does ¡not ¡ improve ¡the ¡accuracy. • We design a boosting approach that sums up trained accuracy of rules connecting initial and supplementary change locations to calculate prediction score Sums ¡up ¡trained ¡accuracy ¡of ¡these ¡rules. calls, ¡called ¡by Prediction ¡score Method ¡2 Method ¡1 Supplementary Predict ¡locations ¡which ¡have ¡ Initial Co-‑change high ¡prediction ¡scores • This approach cannot accurately predict supplementary change location (at most 7% precision). Boosting ¡approach ¡based ¡on ¡the ¡past ¡prediction ¡accuracy ¡also ¡ cannot ¡accurately ¡predict ¡supplementary ¡change ¡locations. 7

Observation ¡3: ¡There ¡is ¡no ¡package ¡or ¡developer ¡ specific ¡pattern. • Package or developer specific rules might improve the prediction accuracy. Accuracy ¡of ¡ code ¡clone : ¡40% Package ¡A Accuracy ¡of ¡ co-‑change : ¡10% • We make boosting approaches based on package and developer specific prediction rules. • The improvements is negligible; the highest accuracy improvementis only 1.2% No ¡package ¡or ¡developer ¡specific ¡pattern ¡between ¡initial ¡and ¡ supplementary ¡change ¡locations ¡exists. 8

Observation ¡4: ¡There ¡is ¡no ¡repeated ¡mistake. • There might be an uncovered relationship which can result in repeated patterns . Initial Supplementary Initial A.java B.java A.java … … Version ¡history Rev. ¡100 Rev. ¡109 Rev. ¡200 • The majority of patterns (78% ~ 96%) appear only once. • 69% to 84% of initial change locations appear only once. Developers ¡rarely ¡make ¡repeated ¡ mistakes ¡at ¡the ¡same ¡location 9

Conclusion • We systematically study omission errors using a real-‑ world supplementary patch data set. • Version history based pattern mining cannot be accurate at finding supplementarychange locations. • Past prediction accuracy, and package or developer specific information does not help. • We share our skepticism that reducing real-‑world omission errors is inherently challenging. 10

ASE ¡2014 Thank you for listening An ¡Empirical ¡Study ¡on ¡Reducing ¡ Omission ¡Errors ¡in ¡Practice Jihun ¡Park 1 , ¡Miryung Kim 2 , ¡Doo-‑Hwan ¡Bae 1 1. KAIST, ¡South ¡Korea 2. University ¡of ¡California, ¡Los ¡Angeles ¡(UCLA), ¡USA

Supplementary ¡Data ¡Set The ¡bug ¡IDs ¡that ¡were ¡ The ¡bug ¡IDs ¡that ¡were ¡mentioned ¡ mentioned ¡only ¡one ¡commit. in ¡multiple ¡fix ¡revisions. Type ¡2 ¡bug Type ¡1 ¡bug Supplementary … … Bug ¡reports Bug 31 Bug 22 patches Fix ¡#22 Fix ¡#31 Fix ¡#31 Fix ¡#31 Fix ¡commits ü ü ü ü An ¡initial ¡ Development ¡ (incomplete) ¡ history patch • We use Eclipse JDT core, Eclipse SWT, and Equinox p2 • Total 16 years, 13259 bugs (24.8% are Type 2 bugs on average) 12

Subject ¡projects Eclipse ¡JDT ¡core Eclipse ¡SWT Equinox ¡p2 Study ¡period 2001/06 ~ ¡2007/12 2001/05 ¡~ ¡2008/12 2006/01 ¡~ ¡2009/12 Total revisions 17009 revisions 21530 ¡revisions 6761 ¡revisions # ¡of ¡bugs 1812 1256 1783 Type ¡1 ¡bugs 2930 ¡(77.04%) 3458 ¡(74.00%) 1328 ¡(74.48%) Type ¡2 ¡bugs 873 ¡(22.96%) 1215 ¡(26.00%) 455 ¡(25.52%) 13

Evaluating ¡a ¡prediction ¡method • Precision, recall, and f-‑score – Predicted set 𝑄 and Suggested set 𝑇 – 𝑄𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 = ¡ |-∩/| |-| , 𝑆𝑓𝑑𝑏𝑚𝑚 = ¡ |-∩/| |/| – 𝐺 − 𝑡𝑑𝑝𝑠𝑓 = 2 ∗ 𝑞𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 ∗ 𝑠𝑓𝑑𝑏𝑚𝑚/(𝑞𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 + 𝑠𝑓𝑑𝑏𝑚𝑚) • Feedback – What portion of initial changes can obtain at least one suggestion? > is derived using a predictionmethod m for bug b, – 𝑄 = N ¡} ¡| | ¡ ¡=∈CDEFGG=HIJ ¡ KL - M – 𝐺𝑓𝑓𝑒𝑐𝑏𝑑𝑙 = ¡ | ¡ CDEFGG=HIJ | 14

An Empirical Study on Reducing Omission Errors in Practice - PowerPoint PPT Presentation

ASE 2014 An Empirical Study on Reducing Omission Errors in Practice Jihun Park 1 , Miryung Kim 2 , Doo-Hwan Bae 1 1. KAIST, South Korea 2. University of California,

Poetic Figures 1 THE OMISSION OF CERTAIN WORDS Ellipsis : omission of words necessary in

Basic Errors Compiling in Unix Syntax errors Common Errors, and Debugging Run-Time errors

Poetic Figures 1 THE OMISSION OF CERTAIN WORDS Ellipsis: leaving out omission of

EECS 591 D ISTRIBUTED S YSTEMS Manos Kapritsos Fall 2020 Slides by: Lorenzo Alvisi B YZANTINE F

Unified error reporting -- A worthy goal? Andi Kleen, Intel Corporation Sep 2009

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

(2) The overwhelming majority of children's spontaneous errors involve omission, not co -mission

Case 2: Reducing Cardiovascular Risk Type 2 Diabetes Management Case 1: Reducing Hypoglycemic

ELO TRANSLATION PROJECT SARAH **** SOME VOCAB Errors Logic Errors Runtime Errors

Treasurers Institute Sun, Nov. 17, 2019 Property Tax Errors Property Tax Errors Property Tax

NMVTIS INFORMATION FOR TACA MARCH 2019 NMVTIS ERRORS Odometer Reading Discrepancies

GENIE Systematic Errors GENIE Systematic Errors GENIE Systematic Errors Hugh Gallagher, Tufts

Unforced Errors Unforced Errors My mother taught me that in polite society, we do not talk

Exceptions Introduction to Computing Using Python Types of errors We saw different types of

Empirical research on economic inequality: Normative considerations and empirical practice.

Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the

Spectral Approximate Inference Speaker: Sejun Park 1 Joint work with Eunho Yang 1,2 , Se-Young Yun

Structured Policy Iteration for Linear Quadratic Regulator Youngsuk Park 1 with R. Rossi 2 , Z.

Hibachi: A Cooperative Hybrid Cache with NVRAM and DRAM for Storage Arrays Ziqi Fan, Fenggang Wu,

FAST: Quick Application Launch on Solid-State Drives Yongsoo Joo 1 , Junhee Ryu 2 , Sangsoo Park

Evidence for Episodic Accretion in Class I Source, IRAS 16316-1540 Sung-Yong Yoon 1 , Jeong-Eun

Charm++ as an Energy Efficient Runtime 1 4/18/17 BILGE ACUN - CHARM++ WORKSHOP 2017 Interaction

LOBBY 10 1 PAINLESS ADVOCACY: The Art of Successfully Engaging with Your Elected officials

Industrial REIT C E N T U R I A I N D U S T R I A L R E I T A S X : C I P 1 69 STUDLEY COURT,