Exploring the Integration of User Feedback in Automated Testing of - PowerPoint PPT Presentation

Exploring the Integration of User Feedback in Automated Testing of Android Applications G. Grano, A. Ciurumelea, S. Panichella, F. Palomba, H. Gall SANER 2018, 20-23 March, Campobasso (Italy) grano@ifi.uzh.ch giograno90

149 billions of apps 12 millions of devs 60 billions

Competition Satisfaction Quality

Testing tools Plethora of Android testing tools: > Monkey: state of the practice > Sapienz: now in Facebook > Dynodroid > ... > and a lot of others! 4 — Giovanni Grano @ s.e.a.l.

Limitations They are not suited for generating inputs that require human intelligence Redundancy of generated input sequences 5 — Giovanni Grano @ s.e.a.l.

Tools behavior 1. Stack Trace 2. Sequence of inputs

Stack Trace // CRASH: com.danvelazco.fbwrapper (pid 4302) // Short Msg: java.lang.NullPointerException // Long Msg: java.lang.NullPointerException // Build Label: samsung/espressowifixx/espressowifi:4.2.2/JDQ39/P3110XXDMH1:user/release-keys // Build Changelist: 8291 // Build Time: 1419156873000 // java.lang.NullPointerException // at com.danvelazco.fbwrapper.activity.BaseFacebookWebViewActivity .onKeyDown(BaseFacebookWebViewActivity.java:649) // at com.danvelazco.fbwrapper.FbWrapper.onKeyDown(FbWrapper.java:429) // at android.view.KeyEvent.dispatch(KeyEvent.java:2640) // at android.app.Activity.dispatchKeyEvent(Activity.java:2433) // at com.android.internal.policy.impl.PhoneWindow$DecorView.dispatchKeyEvent(PhoneWindow.java:2021) // at android.view.ViewRootImpl$ViewPostImeInputStage.processKeyEvent(ViewRootImpl.java:3845) // at android.view.ViewRootImpl$ViewPostImeInputStage.onProcess(ViewRootImpl.java:3819) // at android.view.ViewRootImpl$InputStage.deliver(ViewRootImpl.java:3392) // at android.view.ViewRootImpl$InputStage.onDeliverToNext(ViewRootImpl.java:3442) // at android.view.ViewRootImpl$InputStage.forward(ViewRootImpl.java:3411) // at android.view.ViewRootImpl$AsyncInputStage.forward(ViewRootImpl.java:3518) 7 — Giovanni Grano @ s.e.a.l.

Sequence of Inputs type= raw events count= -1 speed= 1.0 start data >> LaunchActivity(com.ringdroid,com.ringdroid.RingdroidSelectActivity) DispatchKey(223989,223989,0,23,0,0,-1,0) DispatchKey(224204,224204,1,23,0,0,-1,0) DispatchPointer(224346,224347,0,479.0,774.0,0.0,0.0,0,1.0,1.0,0,0) DispatchPointer(224346,224351,2,479.60635,797.5855,0.0,0.0,0,1.0,1.0,0,0) DispatchPointer(224346,224353,2,482.31937,814.9475,0.0,0.0,0,1.0,1.0,0,0) DispatchPointer(224346,224357,2,483.44247,829.02045,0.0,0.0,0,1.0,1.0,0,0) DispatchPointer(224346,224359,2,486.9434,848.0035,0.0,0.0,0,1.0,1.0,0,0) DispatchPointer(224346,224361,2,490.1806,859.495,0.0,0.0,0,1.0,1.0,0,0) DispatchPointer(224346,224364,2,497.59595,872.6837,0.0,0.0,0,1.0,1.0,0,0) DispatchPointer(224346,224367,2,500.53647,894.2986,0.0,0.0,0,1.0,1.0,0,0) DispatchPointer(224346,224369,1,503.94815,896.686,0.0,0.0,0,1.0,1.0,0,0) DispatchPointer(224374,224374,0,166.0,4.0,0.0,0.0,0,1.0,1.0,0,0) 8 — Giovanni Grano @ s.e.a.l.

Can we make it easier?

History of success > release planning 1 2 > change localization 3 2 > user feedback categorization 4 1 Villaroel et al - Release planning of mobile apps based on user reviews 3 Palomba et al - Recommending and localizing change requests for mobile apps based on user reviews 2 Ciurumelea et al - Analyzing reviews and code of mobile apps for better release planning 4 Panichella et al - How can i improve my app? classifying user reviews for software maintenance and evolution 10 — Giovanni Grano @ s.e.a.l.

Concrete Example

A Stack Trace Long Msg: java.lang.NumberFormatException: Invalid int: "/" java.lang.RuntimeException: An error occurred while executing doInBackground() at android.os.AsyncTask$3.done(AsyncTask.java:300) at java.util.concurrent.FutureTask.finishCompletion(FutureTask.java:355) ... at com.amaze.filemanager.services.asynctasks.LoadList.doInBackground(LoadList.java:120) at com.amaze.filemanager.services.asynctasks.LoadList.doInBackground(LoadList.java:50) at android.os.AsyncTask$2.call(AsyncTask.java:288) at java.util.concurrent.FutureTask.run(FutureTask.java:237) ... 3 more 12 — Giovanni Grano @ s.e.a.l.

An User Review "Love the idea of this app but anytime I leave the page the screen goes completely white and won’t come back until force-stopped. Update: I thought the white screen was because my phone was so outdated but it still does it on my Nexus 6 ...." 13 — Giovanni Grano @ s.e.a.l.

Underline idea User reviews might be helpful for: > comprehending the causes behind a failure > easing the debugging phase > discovering errors that tools cannot reveal 14 — Giovanni Grano @ s.e.a.l.

Research Questions

> RQ1: What type of user feedback can we leverage to detect bugs and support testing activities of mobile apps? > RQ2: How complementary is user feedback information with respect to the outcomes of automated testing tools? > RQ3: To what extent can we automatically link the crash- related information reported in both user feedback and testing tools? 16 — Giovanni Grano @ s.e.a.l.

RQ1: which reviews can we use? Data collection 1 Data Collection 2 Classification 6,600 reviews > Reviews Crawler for Google Play Store user reviews golden set > Manually validated from an external ML tools validator external 8 apps validator > Run our apps against Monkey and Sapienz HLT & LLT stack traces Output > Machine Learning classifier > Two (high and low) level taxonomy 17 — Giovanni Grano @ s.e.a.l.

RQ1: Results Category Precision Recall F1 Score crashes Bugs features & UI bugs feature additions Features 0.83 0.82 0.83 Feature Requests feature improvements & UI Bugs Taxonomy Usability performance Resources battery Request Information Crashes 0.91 0.94 0.92 Compatibility & Update Issues 18 — Giovanni Grano @ s.e.a.l.

We are able to predict with good precision reviews claminig about bugs

RQ2: complementarity We gave to an external inspector: 3 Complementarity > stack traces ML > event logs for crashes golden set crash-related > crash-related reviews > apk and source external validator > emulator Goal: establish manually validated links between reviews and stack traces stack traces 20 — Giovanni Grano @ s.e.a.l.

RQ2: Results App Common Only Reviews Only Tools app 1 13.6% 68.2% 18.2% app 2 23.1% 69.2% 7.7% ... ... ... ... Average 16% 62% 22% 21 — Giovanni Grano @ s.e.a.l.

Testing tools potentially miss several failures experienced by users

RQ3: linking 4 Linking Goal: automatically link stack traces with user reviews crash related stack traces Steps source > Augmenting stack trace with source code IR information > Preprocessing for both source bag of words bag of words > 2 bags of word for each source > 3 different IR techniques: Dice, Jaccard, VSM 23 — Giovanni Grano @ s.e.a.l.

RQ3: results App Precision Recall F1 Score app 1 67% 57% 62% app 2 62% 68% 65% ... ... ... ... Average 82% 75% 78% 24 — Giovanni Grano @ s.e.a.l.

good performances in linking crash-related user reviews and stack traces

Future work User-oriented testing > summarization > prioritization > generation 26 — Giovanni Grano @ s.e.a.l.

Exploring the Integration of User Feedback in Automated Testing of - PowerPoint PPT Presentation

Exploring the Integration of User Feedback in Automated Testing of Android Applications G. Grano, A. Ciurumelea, S. Panichella, F. Palomba, H. Gall SANER 2018, 20-23 March, Campobasso (Italy) grano@ifi.uzh.ch giograno90 149 billions of

Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring

Improving Automated Feedback Building a Rule Feedback Generator Eric Bouwers September 27, 2007

Automated Design of Digital Automated Design of Digital Automated Design of Digital Automated

Automated integration tests for AJAX applications Overview CEE-SECR 2011 October 31

Systems Systems Systems Integration Systems Integration Systems Systems Integration Systems

Overview of Automated Bus Consortium Program Accelerating automated technology for transit

Automated Reasoning: Some Successes and New Challenges Predrag Jani ci c

Week 3 Video 4 Automated Feature Generation Automated Feature Selection Automated Feature

Automated Reasoning Course Presentation Summary Automated Reasoning Motivations Course Plan

Fortify Integration & User Experience Fortify Partner Integration Integration with both

RUN groupadd -r user && useradd -r -g user user USER user $ docker run --read-only debian

Exploring The Value of Energy Disaggregation through actionable feedback Nipun Batra , Amarjeet

Baby Got Feedback: How to Give and Take Feedback Like A Boss Sarah Hagan @thesarahhagan Sarah

Exploring the use of computational linguistics for automated formative feedback in the Humanities

Enterprise Automated Testing System Integration North American Distributor ATaaS Technology

Research Integration Model Codes Looking Forward Integration Bim Ex Plan Research

Learning and Inference in Markov Logic Networks CS 786 University of Waterloo Lecture 24: July

Please Advise: The 10 Most Common Underwriter Questions Bill Pratt June 21, 2018 In order to

Application Process for the Provider Relief Fund Phase 2 General Distribution for Medicaid

PIT Count Office Hours Topic of Focus: 2021 Unsheltered PIT Count Flexibilities November 24, 2020

Outcomes Driven User Story Mapping Sharjeel Aleemullah & Sylvia Yanto BEWARE OF MAPS BEWARE

Australias Primary Health Care System Challenges and Government Reform Mark Booth First

IHST Regional Partner Panel Heli-Expo 4 March 2015 Emergency Evacuation Insert diagram of the

Health & NDIS: Interface in the age of COVID-19 Steven Davison - Director, Social Policy

Sambuz

Useful Links

Newsletter

Mail Us

Exploring the Integration of User Feedback in Automated Testing of - PowerPoint PPT Presentation

Exploring the Integration of User Feedback in Automated Testing of Android Applications G. Grano, A. Ciurumelea, S. Panichella, F. Palomba, H. Gall SANER 2018, 20-23 March, Campobasso (Italy) grano@ifi.uzh.ch giograno90 149 billions of

Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring

Improving Automated Feedback Building a Rule Feedback Generator Eric Bouwers September 27, 2007

Automated Design of Digital Automated Design of Digital Automated Design of Digital Automated

Automated integration tests for AJAX applications Overview CEE-SECR 2011 October 31

Systems Systems Systems Integration Systems Integration Systems Systems Integration Systems

Overview of Automated Bus Consortium Program Accelerating automated technology for transit

Automated Reasoning: Some Successes and New Challenges Predrag Jani ci c

Week 3 Video 4 Automated Feature Generation Automated Feature Selection Automated Feature

Automated Reasoning Course Presentation Summary Automated Reasoning Motivations Course Plan

Fortify Integration &amp; User Experience Fortify Partner Integration Integration with both

RUN groupadd -r user &amp;&amp; useradd -r -g user user USER user $ docker run --read-only debian

Exploring The Value of Energy Disaggregation through actionable feedback Nipun Batra , Amarjeet

Baby Got Feedback: How to Give and Take Feedback Like A Boss Sarah Hagan @thesarahhagan Sarah

Exploring the use of computational linguistics for automated formative feedback in the Humanities

Enterprise Automated Testing System Integration North American Distributor ATaaS Technology

Research Integration Model Codes Looking Forward Integration Bim Ex Plan Research

Learning and Inference in Markov Logic Networks CS 786 University of Waterloo Lecture 24: July

Please Advise: The 10 Most Common Underwriter Questions Bill Pratt June 21, 2018 In order to

Application Process for the Provider Relief Fund Phase 2 General Distribution for Medicaid

PIT Count Office Hours Topic of Focus: 2021 Unsheltered PIT Count Flexibilities November 24, 2020

Outcomes Driven User Story Mapping Sharjeel Aleemullah &amp; Sylvia Yanto BEWARE OF MAPS BEWARE

Australias Primary Health Care System Challenges and Government Reform Mark Booth First

IHST Regional Partner Panel Heli-Expo 4 March 2015 Emergency Evacuation Insert diagram of the

Health &amp; NDIS: Interface in the age of COVID-19 Steven Davison - Director, Social Policy

Sambuz

Useful Links

Newsletter

Mail Us

Fortify Integration & User Experience Fortify Partner Integration Integration with both

RUN groupadd -r user && useradd -r -g user user USER user $ docker run --read-only debian

Outcomes Driven User Story Mapping Sharjeel Aleemullah & Sylvia Yanto BEWARE OF MAPS BEWARE

Health & NDIS: Interface in the age of COVID-19 Steven Davison - Director, Social Policy