Novice developers Distributed software development Delayed 12 days - PowerPoint PPT Presentation

56 th COW : Code Review and Continuous Inspection/Integration T OWARDS A UTOMATED S UPPORTS FOR C ODE R EVIEWS USING R EVIEWER R ECOMMENDATION AND R EVIEW Q UALITY M ODELLING Mohammad Masudur Rahman, Chanchal K. Roy, Raula G. Kula, Jason Collins, and Jesse Redl University of Saskatchewan, Canada, Osaka University, Japan Vendasta Technologies, Canada

C ODE R EVIEW Code review could be unpleasant  2

R ECAP ON C ODE R EVIEW Code review is a systematic examination of source code for detecting bugs or defects and coding rule violations . Formal inspection Early bug detection Peer code review Stop coding rule violation Enhance developer skill 3 Modern code review (MCR)

T ODAY ’ S T ALK O UTLINE Part I: Code Reviewer Part II: Prediction Recommendation Model for Review System (ICSE-SEIP 2016) Usefulness (MSR 2017) 4

T ODAY ’ S T ALK O UTLINE Part III: Impact of Continuous Integration on Code Reviews (MSR 2017 Challenge) 5

Part I: Code Reviewer Recommendation (ICSE-SEIP 2016) 6

 FOR Novice developers Distributed software development Delayed 12 days (Thongtanunam et al, SANER 2015) 7

E XISTING L ITERATURE  Line Change History (LCH)  ReviewBot (Balachandran, ICSE 2013)  File Path Similarity (FPS)  Issues & Limitations  RevFinder (Thongtanunam et al, SANER 2015)  Library & Technology Similarity  FPS (Thongtanunam et al, CHASE 2014)  Mine developer’s contributions from  Tie (Xia et al, ICSME 2015) within a single project only .  Code Review Content and Comments  Tie (Xia et al, ICSME 2015)  SNA (Yu et al, ICSME 2014) Technology Library 8

O UTLINE OF THIS S TUDY Vendasta codebase Exploratory study 3 Research questions CORRECT Evaluation using Evaluation using Comparative Open Source Projects VendAsta code base study 9 Conclusion

E XPLORATORY S TUDY ( 3 RQ S )  RQ 1 : How frequently do the commercial software projects reuse external libraries from within the codebase?  RQ 2 : Does the experience of a developer with such libraries matter in code reviewer selection by other developers?  RQ 3 : How frequently do the commercial projects adopt specialized technologies (e.g., 10 taskqueue, mapreduce, urlfetch)?

D ATASET : E XPLORATORY S TUDY 10 utility libraries 10 commercial projects (Vendasta) (Vendasta) 10 Google App Engine Technologies  Each project has at least 750 closed pull requests.  Each library is used at least 10 times on average. 11  Each technology is used at least 5 times on average.

L IBRARY U SAGE IN C OMMERCIAL P ROJECTS (A NSWERED : E XP -RQ 1 )  Empirical library usage frequency in 10 projects  Mostly used: vtest, vauth , and vapi 12  Least used: vlogs, vmonitor

L IBRARY U SAGE IN P ULL R EQUESTS (A NSWERED : E XP -RQ 2 ) % of PR using selected libraries % of library authors as code reviewers  30%-70% of pull requests used at least one of the 10 libraries  87%-100% of library authors recommended as code reviewers in the projects using those libraries 13  Library experience really matters !

S PECIALIZED T ECHNOLOGY U SAGE IN P ROJECTS (A NSWERED : E XP -RQ 3 )  Empirical technology usage frequency in top 10 commercial projects 14  Champion technology: mapreduce

T ECHNOLOGY U SAGE IN P ULL R EQUESTS (A NSWERED : E XP -RQ3)  20%-60% of the pull requests used at least one of the 10 specialized technologies.  Mostly used in: ARM, CS and VBC 15

S UMMARY OF E XPLORATORY F INDINGS About 50% of the pull requests use one or more of the selected libraries. (Exp-RQ 1 ) About 98% of the library authors were later recommended as pull request reviewers. (Exp-RQ 2 ) About 35% of the pull requests use one or more specialized technologies. (Exp-RQ 3 ) Library experience and Specialized technology experience really matter in code reviewer selection/recommendation 16

C O RR E CT : C O DE R EVIEWER R E COMMENDATION IN G IT H UB USING C ROSS - PROJECT & T ECHNOLOGY EXPERIENCE 17

CORRECT: C ODE R EVIEWER R ECOMMENDATION R 1 R 2 R 3 PR Review R 1 PR Review R 2 PR Review R 3 Review Similarity Review Similarity 18

O UR C ONTRIBUTIONS IF State-of-the-art (Thongtanunam et al, SANER 2015) IF Our proposed technique--CORRECT 19 = Source file = Reviewed PR = New PR = External library & specialized technology

E VALUATION OF CORRECT  Two evaluations using-- (1) Vendasta codebase (2) Open source software projects 1: Are library experience and technology experience useful proxies for code review skills? 2: Does CoRReCT outperform the baseline technique for reviewer recommendation? 3 : Does CoRReCT perform equally/comparably for both private and public codebase? 4: Does CoRReCT show bias to any of the development frameworks 20

E XPERIMENTAL D ATASET 13,081 Pull requests 4,034 Pull requests 10 Python projects 2 Python, 2 Java & 2 Ruby projects Code reviews Code reviewers Gold set  Sliding window of 30 past requests for learning.  Metrics: Top-K Accuracy, Mean Precision (MP), Mean Recall (MR) , and Mean Reciprocal rank (MRR). 21

L IBRARY E XPERIENCE & T ECHNOLOGY E XPERIENCE (A NSWERED : RQ 1 ) Metric Library Similarity Technology Similarity Combined Similarity Top-3 Top-5 Top-3 Top-5 Top-3 Top-5 Accuracy 83.57% 92.02% 82.18% 91.83% 83.75% 92.15% 0.66 0.67 0.62 0.64 0.65 0.67 MRR MP 65.93% 85.28% 62.99% 83.93% 65.98% 85.93% MR 58.34% 80.77% 55.77% 79.50% 58.43% 81.39% [ MP = Mean Precision, MR = Mean Recall, MRR = Mean Reciprocal Rank ]  Both library experience and technology experience are found as good proxies, provide over 90% accuracy .  Combined experience provides the maximum performance.  92.15% recommendation accuracy with 85.93% precision and 81.39% recall. 22  Evaluation results align with exploratory study findings.

C OMPARATIVE S TUDY F INDINGS (A NSWERED : RQ 2 ) Metric RevFinder [Thongtanunam CoRReCT et al. SANER 2015] Top-5 Top-5 Accuracy 80.72% 92.15% 0.65 0.67 MRR MP 77.24% 85.93% MR 73.27% 81.39% [ MP = Mean Precision, MR = Mean Recall, MRR = Mean Reciprocal Rank ]  CoRReCT performs better than the competing technique in all metrics ( p-value = 0.003<0.05 for Top-5 accuracy)  Performs better both on average and on individual projects. 23  RevFinder uses PR similarity using source file name and file’s directory matching

C OMPARISON ON O PEN S OURCE P ROJECTS (A NSWERED : RQ 3 ) Metric RevFinder CoRReCT (OSS) CoRReCT (VA) Top-5 Top-5 Top-5 Accuracy 62.90% 85.20% 92.15% MRR 0.55 0.69 0.67 MP 62.57% 84.76% 85.93% MR 58.63% 78.73% 81.39% [ MP = Mean Precision, MR = Mean Recall, MRR = Mean Reciprocal Rank ]  In OSS projects, CoRReCT also performs better than the baseline technique.  85.20% accuracy with 84.76% precision and 78.73% recall, and not significantly different than earlier ( p- 24 value = 0.239>0.05 for precision)  Results for private and public codebase are quite close .

C OMPARISON ON D IFFERENT P LATFORMS (A NSWERED : RQ 4 ) Metrics Python Java Ruby Beets St2 Avg. OkHttp Orientdb Avg. Rubocop Vagrant Avg. Accuracy 93.06% 79.20% 86.13% 88.77% 81.27% 85.02% 89.53% 79.38% 84.46% MRR 0.82 0.49 0.66 0.61 0.76 0.69 0.76 0.71 0.74 MP 93.06% 77.85% 85.46% 88.69% 81.27% 84.98% 88.49% 79.17% 83.83% MR 87.36% 74.54% 80.95% 85.33% 76.27% 80.80% 81.49% 67.36% 74.43% [ MP = Mean Precision, MR = Mean Recall, MRR = Mean Reciprocal Rank ]  In OSS projects, results for different platforms look surprisingly close except the recall.  Accuracy and precision are close to 85% on average.  CORRECT does NOT show any bias to any particular platform. 25

T HREATS TO V ALIDITY  Threats to Internal Validity  Skewed dataset: Each of the 10 selected projects is medium sized (i.e., 1.1K PR) except CS.  Threats to External Validity  Limited OSS dataset: Only 6 OSS projects considered — not sufficient for generalization.  Issue of heavy PRs: PRs containing hundreds of files can make the recommendation slower .  Threats to Construct Validity  Top-K Accuracy: Does the metric represent effectiveness of the technique? Widely used by relevant literature (Thongtanunam et al, SANER 2015) 26

T AKE -H OME M ESSAGES (P ART I) 1 2 3 4 5 6 27

Part II: Prediction Model for Code Review Usefulness (MSR 2017) 28

R ESEARCH P ROBLEM : U SEFULNESS OF C ODE R EVIEW C OMMENTS  What makes a review comment useful or non-useful ?  34.5% of review comments are non- useful at Microsoft (Bosu et al., MSR 2015)  No automated support to detect or improve such comments so far 29

S TUDY M ETHODOLOGY 1,482 Review comments ( 4 systems) Manual tagging with Bosu et al., MSR 2015 Non-useful Useful comments (602) comments (880) (2) (1) 30 Prediction Comparative model study

C OMPARATIVE S TUDY : V ARIABLES  Contrast between useful and non-useful comments.  Two paradigms – comment texts , and commenter’s/developer’s experience  Answers two RQs related to two paradigms. Independent Variables (8) Response Variable (1) Reading Ease Textual Stop word Ratio Textual Question Ratio Textual Code Element Ratio Textual Comment Usefulness ( Yes / No ) Conceptual Similarity Textual Code Authorship Experience Code Reviewership Experience 31 External Lib. Experience Experience

Novice developers Distributed software development Delayed 12 days - PowerPoint PPT Presentation

56 th COW : Code Review and Continuous Inspection/Integration T OWARDS A UTOMATED S UPPORTS FOR C ODE R EVIEWS USING R EVIEWER R ECOMMENDATION AND R EVIEW Q UALITY M ODELLING Mohammad Masudur Rahman, Chanchal K. Roy, Raula G. Kula, Jason Collins,

NHTSA Novice Driver Initiatives NHTSA Novice Driver Initiatives Driver Education Driver

Developers Developers Developers (what about creatives?) Ryan Gorley Designer + Marketer + Art

PADE-Supersharkz Swimming Club 2 nd Selangor Novice Swimming Meet 2017 General Guide for Swimmers

Funding Free Software Developers with... Why? Because Free Software Developers... Freely

2 Hello Quantum Developers World Yet Another Frontier for JavaScript Hello Quantum Developers

DEALING WITH DEVELOPERS Kelsey Becker Brookes Wednesday, September 25, 2019 Prepared for the

Distributed Systems (ICE 601) Distributed Transactions Dongman Lee ICU Class Overview

Unleashing Talent in A Distributed Workforce C O R E N E T 2 0 2 0 HACKATHON: DISTRIBUTED W O R K

Distributed Software Engineering: an Architectural Approach Jeff Magee Distributed Software

Development-Driven Testing Ensuring Testing Meets the Needs of Software Developers

Model Minded Development George Fairbanks SATURN 3 May 2016 Developers weave models Developers

Distributed Databases Distributed database management system A distributed database (DDB) is

Distributed File Systems Distributed File Systems A distributed file system (DFS) is a

CSE 2221 Software I: Software Components and CSE 2231 Software II: Software Development and

Shaw 42 Developers Shaw 42 Developers team members have extensive experience delivering high

Reflective Diary for Professional Development of Novice Teachers Martin Ukrop, Valdemar

Financial Markets Law Committee (FMLC) Securities Markets Scoping Forum Date: Wednesday 11

Deformed commutations of operators and their relations with braided algebras and convolutions

Building a self-contained auto-configuring Linux system on an iso9660 filesystem or Making

on Low- & Moderate-Income Communities Wednesday, November 7, 2018 | 3:30-4:00 ET Welcome

Cosmic Ray Modulation Observed by the Princess Sirindhorn Neutron Monitor at High Rigidity Cutoff

Noncommutative Potential Theory 1 Fabio Cipriani Dipartimento di Matematica Politecnico di

Reverse Engineering Dr. Vadim Zaytsev aka @grammarware UvA, MSc SE, 9 November 2015 Roadmap

Graded quaternion-symbol equivalence Przemysaw Koprowski ALaNT 5 1/29 Fundamental question

Sambuz

Useful Links

Newsletter

Mail Us

Novice developers Distributed software development Delayed 12 days - PowerPoint PPT Presentation

56 th COW : Code Review and Continuous Inspection/Integration T OWARDS A UTOMATED S UPPORTS FOR C ODE R EVIEWS USING R EVIEWER R ECOMMENDATION AND R EVIEW Q UALITY M ODELLING Mohammad Masudur Rahman, Chanchal K. Roy, Raula G. Kula, Jason Collins,

NHTSA Novice Driver Initiatives NHTSA Novice Driver Initiatives Driver Education Driver

Developers Developers Developers (what about creatives?) Ryan Gorley Designer + Marketer + Art

PADE-Supersharkz Swimming Club 2 nd Selangor Novice Swimming Meet 2017 General Guide for Swimmers

Funding Free Software Developers with... Why? Because Free Software Developers... Freely

2 Hello Quantum Developers World Yet Another Frontier for JavaScript Hello Quantum Developers

DEALING WITH DEVELOPERS Kelsey Becker Brookes Wednesday, September 25, 2019 Prepared for the

Distributed Systems (ICE 601) Distributed Transactions Dongman Lee ICU Class Overview

Unleashing Talent in A Distributed Workforce C O R E N E T 2 0 2 0 HACKATHON: DISTRIBUTED W O R K

Distributed Software Engineering: an Architectural Approach Jeff Magee Distributed Software

Development-Driven Testing Ensuring Testing Meets the Needs of Software Developers

Model Minded Development George Fairbanks SATURN 3 May 2016 Developers weave models Developers

Distributed Databases Distributed database management system A distributed database (DDB) is

Distributed File Systems Distributed File Systems A distributed file system (DFS) is a

CSE 2221 Software I: Software Components and CSE 2231 Software II: Software Development and

Shaw 42 Developers Shaw 42 Developers team members have extensive experience delivering high

Reflective Diary for Professional Development of Novice Teachers Martin Ukrop, Valdemar

Financial Markets Law Committee (FMLC) Securities Markets Scoping Forum Date: Wednesday 11

Deformed commutations of operators and their relations with braided algebras and convolutions

Building a self-contained auto-configuring Linux system on an iso9660 filesystem or Making

on Low- &amp; Moderate-Income Communities Wednesday, November 7, 2018 | 3:30-4:00 ET Welcome

Cosmic Ray Modulation Observed by the Princess Sirindhorn Neutron Monitor at High Rigidity Cutoff

Noncommutative Potential Theory 1 Fabio Cipriani Dipartimento di Matematica Politecnico di

Reverse Engineering Dr. Vadim Zaytsev aka @grammarware UvA, MSc SE, 9 November 2015 Roadmap

Graded quaternion-symbol equivalence Przemysaw Koprowski ALaNT 5 1/29 Fundamental question

Sambuz

Useful Links

Newsletter

Mail Us

on Low- & Moderate-Income Communities Wednesday, November 7, 2018 | 3:30-4:00 ET Welcome