Toward Predicting Architectural Significance of Implementation - - PowerPoint PPT Presentation

toward predicting architectural significance of
SMART_READER_LITE
LIVE PREVIEW

Toward Predicting Architectural Significance of Implementation - - PowerPoint PPT Presentation

MSR 2018 Toward Predicting Architectural Significance of Implementation Issues Arman Shahbazian, Daye Nam, and Nenad Medvidovic University of Southern California Mo Motivation Mo Motivation Mo Motivation B A C Mo Motivation B A C


slide-1
SLIDE 1

Toward Predicting Architectural Significance of Implementation Issues

Arman Shahbazian, Daye Nam, and Nenad Medvidovic University of Southern California

MSR 2018

slide-2
SLIDE 2

Mo Motivation

slide-3
SLIDE 3

Mo Motivation

slide-4
SLIDE 4

A

B

C

Mo Motivation

slide-5
SLIDE 5

A

B

C

Mo Motivation

slide-6
SLIDE 6

A

B

C

Mo Motivation

slide-7
SLIDE 7

Numerous Design Decisions Inadvertent Architectural Changes

A

B

C

Mo Motivation

slide-8
SLIDE 8

Numerous Design Decisions

  • Accumulation of Technical Debt
  • Deterioration of Software Quality

Inadvertent Architectural Changes

A

B

C

Mo Motivation

slide-9
SLIDE 9

A

B

C

Mo Motivation

slide-10
SLIDE 10

A

B

C

Architecturally Significant

Mo Motivation

slide-11
SLIDE 11

A

B

C

Architecturally Significant

Mo Motivation

slide-12
SLIDE 12

Da Dataset Cl Classifier De Detect ction

Con Contribution

  • ns
slide-13
SLIDE 13

Da Dataset Cl Classifier De Detect ction

Con Contribution

  • ns
slide-14
SLIDE 14

Da Dataset Cl Classifier De Detect ction

Con Contribution

  • ns
slide-15
SLIDE 15

Da Dataset Cl Classifier De Detect ction

Con Contribution

  • ns
slide-16
SLIDE 16

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

De Detectio tion + + Da Datas aset

slide-17
SLIDE 17

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

De Detectio tion + + Da Datas aset

slide-18
SLIDE 18

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

De Detectio tion + + Da Datas aset

slide-19
SLIDE 19

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

De Detectio tion + + Da Datas aset

slide-20
SLIDE 20

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

0.12.0 0.12.1

De Detectio tion + + Da Datas aset

slide-21
SLIDE 21

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

Recover

release-0.12.0 release-0.12.1

De Detectio tion + + Da Datas aset

slide-22
SLIDE 22

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

Before After

release-0.12.0 release-0.12.1

De Detectio tion + + Da Datas aset

slide-23
SLIDE 23

release-0.12.0 release-0.12.1

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

Before After

Metric

De Detectio tion + + Da Datas aset

slide-24
SLIDE 24

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

Before After

release-0.12.0 release-0.12.1

Metric

De Detectio tion + + Da Datas aset

slide-25
SLIDE 25

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

De Detectio tion + + Da Datas aset

slide-26
SLIDE 26

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

https://softarch.usc.edu/predictar

De Detectio tion + + Da Datas aset

slide-27
SLIDE 27

Feature Vectors Significant Issues Unsignificant Issues One-Hot Encoding TF (Term Frequencies) Non-Textual Contents Textual Contents

Classifier

Con Contribution

  • ns
slide-28
SLIDE 28

Precision Recall Hadoop 0.838 0.592 Nutch 0.946 0.247 Wicket 0.761 0.537 Cxf 0.865 0.538 OpenJpa 0.934 0.451 Cross-Project 0.811 0.583

Ev Evaluation

slide-29
SLIDE 29

Precision Recall Hadoop 0.838 0.592 Nutch 0.946 0.247 Wicket 0.761 0.537 Cxf 0.865 0.538 OpenJpa 0.934 0.451 Cross-Project 0.811 0.583

Ev Evaluation

slide-30
SLIDE 30

Precision Recall Hadoop 0.838 0.592 Nutch 0.946 0.247 Wicket 0.761 0.537 Cxf 0.865 0.538 OpenJpa 0.934 0.451 Cross-Project 0.811 0.583

Ev Evaluation

slide-31
SLIDE 31

Precision Recall Hadoop 0.838 0.592 Nutch 0.946 0.247 Wicket 0.761 0.537 Cxf 0.865 0.538 OpenJpa 0.934 0.451 Cross-Project 0.811 0.583

Ev Evaluation

slide-32
SLIDE 32
  • Automatically detecting architecturally significant issues,
  • A reusable dataset of over 21,000 issues,
  • Classifying them based on information contained in each issue.

Su Summar ary

  • Expand to more systems by adding the support for other issue trackers,
  • Improve the performance by adding more data,
  • Improve the performance by adapting new model in Machine Learning.

Fu Future Wo Work

Con Conclusion

  • n
slide-33
SLIDE 33

THANK YOU

(armansha@usc.edu, dayenam@usc.edu, neno@usc.edu)

slide-34
SLIDE 34

Data Collection

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

  • ACDC: Algorithm for Comprehension-Driven Clustering
  • Structural pattern-based clustering
  • ARC: Architecture Recovery using Concerns
  • Concern-based hierarchical clustering based on similarity measure
slide-35
SLIDE 35

Evaluation

ARC ACDC Precision Recall Precision Recall Hadoop 0.793 0.637 0.883 0.547 Nutch 0.941 0.276 0.951 0.217 Wicket 0.843 0.657 0.678 0.417 Cxf 0.801 0.698 0.928 0.468 OpenJpa 0.965 0.503 0.903 0.399 Cross-Project 0.816 0.592 0.806 0.573

slide-36
SLIDE 36

Evaluation

ARC ACDC Precision Recall Precision Recall Hadoop 0.793 0.637 0.883 0.547 Nutch 0.941 0.276 0.951 0.217 Wicket 0.843 0.657 0.678 0.417 Cxf 0.801 0.698 0.928 0.468 OpenJpa 0.965 0.503 0.903 0.399 Cross-Project 0.816 0.592 0.806 0.573

slide-37
SLIDE 37

Evaluation

ARC ACDC Precision Recall Precision Recall Hadoop 0.793 0.637 0.883 0.547 Nutch 0.941 0.276 0.951 0.217 Wicket 0.843 0.657 0.678 0.417 Cxf 0.801 0.698 0.928 0.468 OpenJpa 0.965 0.503 0.903 0.399 Cross-Project 0.816 0.592 0.806 0.573

slide-38
SLIDE 38

Evaluation

ARC ACDC Precision Recall Precision Recall Hadoop 0.793 0.637 0.883 0.547 Nutch 0.941 0.276 0.951 0.217 Wicket 0.843 0.657 0.678 0.417 Cxf 0.801 0.698 0.928 0.468 OpenJpa 0.965 0.503 0.903 0.399 Cross-Project 0.816 0.592 0.806 0.573

slide-39
SLIDE 39

ARMAN SHAHBAZIAN, DAYE NAM, NENAD MEDVIDOVIC UNIVERSITY OF SOUTHERN CALIFORNIA

Toward Predicting Architectural Significance of Implementation Issues

MSR 2018

slide-40
SLIDE 40

Da Dataset

A Dataset of 21,062 Issues Identified Across 5 Large OSSs

De Detect ction

Automatic Detection of Architecturally Significant Issues

Cl Classifier

A Classifier Architectural Significance

  • f New Issue

Contributions

slide-41
SLIDE 41

Data Collection

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

slide-42
SLIDE 42

Future Works

slide-43
SLIDE 43

Motivation

slide-44
SLIDE 44

Data Collection

Commit Analysis Architecture Recovery Change Detection For each issue: Significant Issues Issues After Before System Architectures

slide-45
SLIDE 45

Contributions Da Dataset

A Dataset of 21,062 Issues Identified Across 5 Large OSSs

De Detect ction

Automatic Detection of Architecturally Significant Issues

Cl Classifier

A Classifier Architectural Significance

  • f New Issue