DerivingCouplingMetricsfromCall Graphs - PowerPoint PPT Presentation

Deriving Coupling Metrics from Call  Graphs  Simon Allier, Stéphane Vaucher,  Bruno Dufour, Houari Sahraoui  DIRO,   Université de Montréal 

So@ware metrics   So@ware metrics are widely used for:  • QuanEfying so@ware quality using models  • PredicEng so@ware aIributes (e.g. fault‐proneness)  • Summarizing complex systems  • Studying the evoluEon of so@ware systems over Eme  • …   Metrics are o@en defined in high‐level, language‐ agnosEc ways 

Ambiguity in metric definiEons   Metric definiEons use high‐level concepts that leave  room for different interpretaEons  • e.g. “class  c  uses class  d ”   Even aIempts to formalize metric definiEons usually  result in ambiguity  • e.g. “methods from class  c ”   The same metric definiEon can lead to different tool  implementaEons   Different choices to resolve ambiguity can lead to  wide variaEons in metric values 

Example ‐ Coupling Between Objects (CBO)   Two disEnct classes  c  and  d  are  coupled  if either  • c  uses  d , or  • d  uses  c    A class  c  uses  a class  d  if either  •   c  calls at least one method from  d , or   • c  reads or writes at least one field from  d   Q: How to compute the set of classes used by  c  without  execuEng the program? 

How exisEng tools compute CBO  Tool  Considers method invoca9ons?  Together  ✓ Uses declared targets  CKJM  ✓ Uses declared targets  MASU  ✓ Uses declared targets  POM  ✓ Uses declared targets  Aivosto  ✓ Uses declared types  Jhawk  ✗  Counts referenced types  Powertools  ✗  Counts associaEon types  McCabe IQ  ✗  Counts external references  The tools exhibit a wide number of  variaEons on the same definiEon  

Goals   Study several factors that can vary between metric  implementaEons for a sample of exisEng metrics  • In this talk, we use CBO as a running example   Evaluate the impact of these factors on computed  metric result  • We focus on two factors: polymorphism and dynamic class  loading (other factors are fixed) 

Outline   FormalizaEon of CBO definiEon for dynamic language  features   Empirical study   Related work & conclusions 

A more precise definiEon of CBO   Recall that two disEnct classes  c  and  d  are  coupled  if  either  • c  uses  d , or  • d  uses  c    A class  c  uses  a class  d  if either  • c   polymorphically invokes  at least one method  implemented  in  d , or   • c  reads or writes at least one field  implemented  in  d  (Note: « implemented in  d  » excludes superclasses) 

Polymorphically invoked methods   Given a call in method  m , how to determine the set  of all methods that can be invoked at runEme?  • This is a well‐studied problem in program analysis, i.e. call  graph construcEon  • Several algorithms exist that make various tradeoffs  between cost and precision 

Call graph construcEon  void main() { void useA(A a) { a.m(); B b1 = new B(); } C c = new C(); useA(b1); void useB(B b2) { useB(c); b2.m() } } A  m()  main  B  m()  useA  useB  m()  m()  C  D  A.m  B.m  C.m  D.m 

Call graph construcEon  void main() { void useA(A a) { a.m(); B b1 = new B(); } C c = new C(); useA(b1); void useB(B b2) { useB(c); b2.m() } } A  m()  main  B  m()  useA  useB  m()  m()  C  D  A.m  B.m  C.m  D.m  Declared Target (DT) 

Call graph construcEon  void main() { void useA(A a) { a.m(); B b1 = new B(); } C c = new C(); useA(b1); void useB(B b2) { useB(c); b2.m() } } A  m()  main  B  m()  useA  useB  m()  m()  C  D  A.m  B.m  C.m  D.m  Class Hierarchy Analysis (CHA) 

Call graph construcEon  void main() { void useA(A a) { a.m(); B b1 = new B(); } C c = new C(); useA(b1); void useB(B b2) { useB(c); b2.m() } } A  m()  main  B  m()  useA  useB  m()  m()  C  D  A.m  B.m  C.m  D.m  Rapid Type Analysis (RTA) 

Call graph construcEon  void main() { void useA(A a) { a.m(); B b1 = new B(); } C c = new C(); useA(b1); void useB(B b2) { useB(c); b2.m() } } A  m()  main  B  m()  useA  useB  m()  m()  C  D  A.m  B.m  C.m  D.m  Variable Type Analysis (VTA) 

Dynamic class loading  void foo() { Class c = Class.forName("MyClass"); MyClass obj = (MyClass) c.newInstance(); obj.m(); // Use the object ... }  Two main strategies:  • Ignore dynamic class loading  • Assume all applicaEon classes can be loaded reflecEvely   To avoid imprecision, we ignore calls to no‐arg  constructors from  newInstance

Experiments 

Experimental sefng  Benchmark  Classes  Interfaces  ArgoUML 0.18.1  1237  100  Azureus 2.1.0.0  1232  250   5 call graph algorithms implemented using Soot:  • DT, CHA, RTA  • VTA (no dynamic class loading)  • VTAd (supports dynamic class loading)   IBM JVM 6.0, Opteron 2Ghz, 8GB RAM, FC7 Linux 

Call graph sizes  ArgoUML  Azureus  Algorithm  Nodes  Edges  Nodes  Edges  CHA  36 872  1 113 377  27 825  384 330  RTA  36 642  1 102 549  27 749  383 650  VTA  32 085  715 109  25 377  279 392  VTAd  36 632  1 858 348  27 076  613 025 

Dead code   ConservaEve algorithms (CHA and VTAd) can underesEmate  the amount of dead code   Unsafe algorithms (DT) can both underapproximate and  overapproximate the amount of dead code 

Polymorphism   DT algorithm can underapproximate the coupling as  compared to VTAd for both CBO‐In and CBO‐Out   CHA can mainly overapproximate CBO‐In 

Dynamic class loading   Very significant difference in CBO between VTA and VTAd due  to a non‐trivial use of dynamic loading 

Related work   StaEc coupling metrics  • e.g. Chidamber and Kemerer, Briand et al., Briand & Wüst   Dynamic coupling metrics  • e.g. Arisholm  et al.,  Yacoub  et al.   Metrics & program analysis  • e.g. Harman  et al.,  Myers & Binkley   Comparing so@ware metrics tools  • e.g. Lincke  et al.  

Conclusions   SophisEcated computaEon methods are necessary  when capturing coupling in the presence of dynamic  features   For programs with a non‐trivial class hierarchy and a  significant use of polymorphism, the choice of CG  building algorithm can have an important impact on  the computed coupling   When deciding how to implement a metric tool, one  needs to consider how the metrics will be used  • e.g. program understanding vs. change impact 

Addi9onal slides 

Running Emes  ArgoUML  Azureus  Algorithm  CG  Metrics  Total  CG  Metrics  Total  DT  0:00  0:49  0:49  0:00  0:48  0:48  CHA  5:11  3:59  9:10  3:15  2:28  5:43  RTA  35:43  4:03  39:46  23:46  2:21  26:07  VTA  12:42  2:31  15:13  7:30  0:50  8:20  VTAd  14:47  2:55  17:42  11:44  1:28  13:12 

DerivingCouplingMetricsfromCall Graphs - PowerPoint PPT Presentation

DerivingCouplingMetricsfromCall Graphs SimonAllier,StphaneVaucher, BrunoDufour,HouariSahraoui DIRO, UniversitdeMontral So@waremetrics

A PLACE TO CALL HOME A PLACE TO CALL HOME A PLACE TO CALL HOME A PLACE TO CALL HOME A PLACE

Module 1: Introduction Deriving Business Information Deriving meaningful information from

Deriving Filtering Algorithms Deriving Filtering Algorithms from Constraint Checkers from

Deriving Consensus for Multi-Parallel Corpora: An English Bible Study Patrick Xia David

Graphs () Graphs () Graphs Graphs Graphs are collections of nodes

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

Week 4 Kullmann Graphs and directed graphs Elementary Graph Algorithms Representing graphs

On some classes of Deza graphs Deza graphs without 3-cocliques Line graphs V.V. Kabanov 1 Deza

Graphs Graphs Examples Definitions Implementation/Representation of graphs Graphs

What we learned from Community Metrics Agenda Why are metrics used? How metrics are used

Performance Metrics for Graph Mining Tasks 1 Outline Introduction to Performance Metrics

AGENCY OPERATIONS METRICS The Metrics of Me The Metrics of Me x 159 13,006 5 days old books

Proposal Metrics Dashboard What Gets Measured Gets Done Topics Why Keep Metrics? What

Coupling On-line and Off-line Random Graphs Woojin Kim March 1st Introduction Preliminary

PWSCF and new charge density PWSCF call read_input_file (input.f90) call run_pwscf call setup

Day Ahead Coupling Oct 3 rd 2011 Workshop Day Ahead Coupling Implicit auctions; Single

THE BREADTH & WIDTH OF THE WFD REGULATING ADAPTIVE WATER MANAGEMENT TIINA PALONIITTY /

Modeling Limits Jaroslav Neetil Patrice Ossona de Mendez Charles University CAMS, CNRS/EHESS

Hybrid Reduced-Order Modeling and Particle-Kalman Filtering for the Health Monitoring of Flexible

Adaptive model-based dose selection methods Francois Vandenhende, Ph.D. CEO, Clinbay

How Good Are the Specs? A Study of the Bug-Finding Effectiveness of Existing Java API

Production of Benzene, Toluene, and Xylenes from Natural Gas via Methanol: A Process Synthesis

Mind The Gap! Setting Up A Code Structure Building Bridges Representation Of

s t sss r

DerivingCouplingMetricsfromCall Graphs - PowerPoint PPT Presentation

DerivingCouplingMetricsfromCall Graphs SimonAllier,StphaneVaucher, BrunoDufour,HouariSahraoui DIRO, UniversitdeMontral So@waremetrics

A PLACE TO CALL HOME A PLACE TO CALL HOME A PLACE TO CALL HOME A PLACE TO CALL HOME A PLACE

Module 1: Introduction Deriving Business Information Deriving meaningful information from

Deriving Filtering Algorithms Deriving Filtering Algorithms from Constraint Checkers from

Deriving Consensus for Multi-Parallel Corpora: An English Bible Study Patrick Xia David

Graphs () Graphs () Graphs Graphs Graphs are collections of nodes

Weighted graphs Weighted graphs Weighted graphs Weighted graphs Graphs with numbers, called

Week 4 Kullmann Graphs and directed graphs Elementary Graph Algorithms Representing graphs

On some classes of Deza graphs Deza graphs without 3-cocliques Line graphs V.V. Kabanov 1 Deza

Graphs Graphs Examples Definitions Implementation/Representation of graphs Graphs

What we learned from Community Metrics Agenda Why are metrics used? How metrics are used

Performance Metrics for Graph Mining Tasks 1 Outline Introduction to Performance Metrics

AGENCY OPERATIONS METRICS The Metrics of Me The Metrics of Me x 159 13,006 5 days old books

Proposal Metrics Dashboard What Gets Measured Gets Done Topics Why Keep Metrics? What

Coupling On-line and Off-line Random Graphs Woojin Kim March 1st Introduction Preliminary

PWSCF and new charge density PWSCF call read_input_file (input.f90) call run_pwscf call setup

Day Ahead Coupling Oct 3 rd 2011 Workshop Day Ahead Coupling Implicit auctions; Single

THE BREADTH &amp; WIDTH OF THE WFD REGULATING ADAPTIVE WATER MANAGEMENT TIINA PALONIITTY /

Modeling Limits Jaroslav Neetil Patrice Ossona de Mendez Charles University CAMS, CNRS/EHESS

Hybrid Reduced-Order Modeling and Particle-Kalman Filtering for the Health Monitoring of Flexible

Adaptive model-based dose selection methods Francois Vandenhende, Ph.D. CEO, Clinbay

How Good Are the Specs? A Study of the Bug-Finding Effectiveness of Existing Java API

Production of Benzene, Toluene, and Xylenes from Natural Gas via Methanol: A Process Synthesis

Mind The Gap! Setting Up A Code Structure Building Bridges Representation Of

s t sss r

THE BREADTH & WIDTH OF THE WFD REGULATING ADAPTIVE WATER MANAGEMENT TIINA PALONIITTY /