Dependence Communities in Software Crest COW UCL Sebastian Danicic - PowerPoint PPT Presentation

Dependence Communities in Software Crest COW UCL Sebastian Danicic and James Hamilton Goldsmiths, University of London 30th April 2012 1 / 58

Communities in Graphs A network is said to have community structure if the nodes of the network can be easily grouped into (potentially overlapping) sets of nodes such that each set of nodes is densely connected internally, with few connections to the rest of the network. 2 / 58

Communities in Real World Graphs Many real-world networks are known to have community structure. 3 / 58

Communities in Real World Graphs Many real-world networks are known to have community structure. Social networks 4 / 58

Communities in Real World Graphs Many real-world networks are known to have community structure. Social networks Biological networks 5 / 58

Communities in Real World Graphs Many real-world networks are known to have community structure. Social networks Biological networks Computer networks 6 / 58

Communities in Real World Graphs Many real-world networks are known to have community structure. Social networks Biological networks Computer networks Not all networks have community structure e.g. random graphs 7 / 58

Communities in Real World Graphs “Graphical representation of the network of communities extracted from a Belgian mobile phone network. About 2M customers are represented on this network. The size of a node is proportional to the number of individuals in the corresponding community and its colour on a red-green scale represents the main language spoken in the community (red for French and green for Dutch). Only the communities composed of more than 100 customers have been plotted. Notice the intermediate community of mixed colours between the two main language clusters. A zoom at higher resolution reveals that it is made of several sub-communities with less apparent language separation.” (Blondel, V. D., Guillaume, J.-L., Lambiotte, R., & Lefebvre, E. (2008). Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, 2008(10), P10008. doi:10.1088/1742-5468/2008/10/P10008 ) 8 / 58

Does Software have Community Structure? 9 / 58

Does Software have Community Structure? It depends how you turn the software into a graph. 10 / 58

Does Software have Community Structure? It depends how you turn the software into a graph. Consider, the graph: G 1 ( P ) = n 1 → n 2 if and only if n 1 and n 2 are in the same function in program P . 11 / 58

Does Software have Community Structure? It depends how you turn the software into a graph. Consider, the graph: G 1 ( P ) = n 1 → n 2 if and only if n 1 and n 2 are in the same function in program P . Clearly G 1 has community structure but it’s not very interesting! 12 / 58

Does Software have Community Structure? It depends how you turn the software into a graph. Consider, the graph: G 1 ( P ) = n 1 → n 2 if and only if n 1 and n 2 are in the same function in program P . Clearly G 1 has community structure but it’s not very interesting! Previous work has shown community structure exists in class dependence graphs. 13 / 58

‘Interesting’ Communities in Software We are looking for communites which reflect semantic properties of programs. 14 / 58

‘Interesting’ Communities in Software We are looking for communites which reflect semantic properties of programs. Where do we start? 15 / 58

‘Interesting’ Communities in Software We are looking for communites which reflect semantic properties of programs. Where do we start? We have to choose graphs which reflect semantic properties of programs. 16 / 58

‘Interesting’ Communities in Software We are looking for communites which reflect semantic properties of programs. Where do we start? We have to choose graphs which reflect semantic properties of programs. We then find communites in these graphs. 17 / 58

‘Interesting’ Communities in Software We are looking for communites which reflect semantic properties of programs. Where do we start? We have to choose graphs which reflect semantic properties of programs. We then find communites in these graphs. Finally we see if these communities reflect anything semantic. 18 / 58

Slice Graphs S ( P ) = n 1 → n 2 if and only if n 2 is in the slice of P with respect to n 1 . 19 / 58

Slice Graphs S ( P ) = n 1 → n 2 if and only if n 2 is in the slice of P with respect to n 1 . In other words, n 1 → n 2 if and only if n 1 depends on n 2 in P . 20 / 58

Slice Graphs S ( P ) = n 1 → n 2 if and only if n 2 is in the slice of P with respect to n 1 . In other words, n 1 → n 2 if and only if n 1 depends on n 2 in P . Clearly, slice graphs can be considered ‘semantic’. 21 / 58

Slice Graphs S ( P ) = n 1 → n 2 if and only if n 2 is in the slice of P with respect to n 1 . In other words, n 1 → n 2 if and only if n 1 depends on n 2 in P . Clearly, slice graphs can be considered ‘semantic’. Question: Do slice graphs have community structure, and if so are the communites ‘interesting’ or ‘useful’? 22 / 58

Slice Graphs S ( P ) = n 1 → n 2 if and only if n 2 is in the slice of P with respect to n 1 . In other words, n 1 → n 2 if and only if n 1 depends on n 2 in P . Clearly, slice graphs can be considered ‘semantic’. Question: Do slice graphs have community structure, and if so are the communites ‘interesting’ or ‘useful’? Intuitively, a community in a slice graph is a part of a program where there is strong internal inter-dependence. 23 / 58

Slice Graphs S ( P ) = n 1 → n 2 if and only if n 2 is in the slice of P with respect to n 1 . In other words, n 1 → n 2 if and only if n 1 depends on n 2 in P . Clearly, slice graphs can be considered ‘semantic’. Question: Do slice graphs have community structure, and if so are the communites ‘interesting’ or ‘useful’? Intuitively, a community in a slice graph is a part of a program where there is strong internal inter-dependence. Perhaps dependence communities will highlight different semantic concerns within a program. 24 / 58

Modularity Given a partition of a network, modularity is a measure of the ‘strength’ of the community structure of this partition. (fraction of edges that fall (expected number of edges Q = (1) within communities in the − within those communities given graph) in the null model ) 25 / 58

Modularity Given a partition of a network, modularity is a measure of the ‘strength’ of the community structure of this partition. (fraction of edges that fall (expected number of edges Q = (1) within communities in the − within those communities given graph) in the null model ) Modularity, of a weighted undirected graph, is defined as Q = 1 � � � δ ( c i , c j ) (2) A ij − E ij 2 m i , j where A ij is the weight of the edge incident to i and j , k i = � j A ij is the sum of the weights of the edges incident to vertex i , c i is the community to which vertex i is assigned, δ ( u , v ) is 1 if i and j are in the same community and 0 otherwise and m = 1 � i , j A ij . E ij is the expected 2 number of edges between i and j in a random graph of the same degree distribution which can be calculated as k i k j 2 m . 26 / 58

Algorithms for Finding Comunities Finding partitions with the best modularity is NP-hard but tractable algortihms exist for aproximation best possible exist. 27 / 58

Algorithms for Finding Comunities Finding partitions with the best modularity is NP-hard but tractable algortihms exist for aproximation best possible exist. The Louvain method is a fast algorithm for detecting communities in large networks based upon modularity maximisation. 28 / 58

Algorithms for Finding Comunities Finding partitions with the best modularity is NP-hard but tractable algortihms exist for aproximation best possible exist. The Louvain method is a fast algorithm for detecting communities in large networks based upon modularity maximisation. The algorithm combines neighbouring nodes until a local maximum of modularity is reached and then creates a new network of communities; these two steps are repeated until there is no further increase in modularity. 29 / 58

Algorithms for Finding Comunities Finding partitions with the best modularity is NP-hard but tractable algortihms exist for aproximation best possible exist. The Louvain method is a fast algorithm for detecting communities in large networks based upon modularity maximisation. The algorithm combines neighbouring nodes until a local maximum of modularity is reached and then creates a new network of communities; these two steps are repeated until there is no further increase in modularity. This creates a hierarchical decomposition of the network - at the lowest level all nodes are in their own community, and at the highest level nodes are in communities which gives the highest gain in modularity. 30 / 58

Dependence Communities in Software Crest COW UCL Sebastian Danicic - PowerPoint PPT Presentation

Dependence Communities in Software Crest COW UCL Sebastian Danicic and James Hamilton Goldsmiths, University of London 30th April 2012 1 / 58 Communities in Graphs A network is said to have community structure if the nodes of the network can

Measuring Dependence and Conditional Dependence with Kernels Kenji Fukumizu The Institute of

Linear dependence and independence Linear dependence 1 Definition (linear (in)dependence) Let {

Treating Tobacco Treating Tobacco Treating Tobacco Treating Tobacco Dependence and Providing

Control-dependence Analysis 2 Control-dependence Analysis 1. Introduction (motivation, overview)

More refined representations Control dependence graph Problem: control-flow edges in CFG

Unusual compositional dependence of the Unusual compositional dependence of the exciton reduced

Data Dependence in Data Dependence in Combining Classifiers Combining Classifiers Mohamed

Local Dependence and Persistence in Discrete Sliding Window Processes Ohad N. Feldheim Joint

From Data to Effects Dependence Graphs: Source-to-Source Transformations for C CPC 2015 Nelson

Energy Dependence of Multiplicity Fluctuations in Heavy Ion Collisions Benjamin Lungwitz, IKF

The Expressive Power of Backround Modal Dependence Logic Modal logic Team semantics Modal

Dependence in Games & Dependence Games Davide Grossi (ILLC, University of Amsterdam) Paolo

Priority and Particle Physics: structure, dependence, and moderation in all things Kerry McKenzie

Sensitive dependence on initial conditions in maps Marc R. Roussel November 28, 2019 Marc R.

Chapter 3 Structural breaks for models with path dependence 2 Chapter 3 Path dependence (p.

Dependence: Theory and Practice Introduction to loop dependence and loop transformation 1 The

Things break. riak bends. Justin Sheehy justin@basho.com Perfection is Unattainable A system

Marco Baldi & Kazuya Koyama CSWG Task-6: Numerical tools for non-standard cosmological

Adiabatic vs Sudden Flux Insertion and Nonlinear Electric Conduction Talk 2 by Masaki Oshikawa

Spatial Collectives and Causality Antony Galton Department of Mathematics and Computer Science

rss trs rttrs r

Fractions, Decimals, Ratios and Percentages Fraction Line Denominator

Introduction to C Programming Basics of Programming (2) Variables Standard Input/Output (2)

Finishing Point and PointTest Formal vs. Actual Parameters formal parameter: in declaration of

Dependence Communities in Software Crest COW UCL Sebastian Danicic - PowerPoint PPT Presentation

Dependence Communities in Software Crest COW UCL Sebastian Danicic and James Hamilton Goldsmiths, University of London 30th April 2012 1 / 58 Communities in Graphs A network is said to have community structure if the nodes of the network can

Measuring Dependence and Conditional Dependence with Kernels Kenji Fukumizu The Institute of

Linear dependence and independence Linear dependence 1 Definition (linear (in)dependence) Let {

Treating Tobacco Treating Tobacco Treating Tobacco Treating Tobacco Dependence and Providing

Control-dependence Analysis 2 Control-dependence Analysis 1. Introduction (motivation, overview)

More refined representations Control dependence graph Problem: control-flow edges in CFG

Unusual compositional dependence of the Unusual compositional dependence of the exciton reduced

Data Dependence in Data Dependence in Combining Classifiers Combining Classifiers Mohamed

Local Dependence and Persistence in Discrete Sliding Window Processes Ohad N. Feldheim Joint

From Data to Effects Dependence Graphs: Source-to-Source Transformations for C CPC 2015 Nelson

Energy Dependence of Multiplicity Fluctuations in Heavy Ion Collisions Benjamin Lungwitz, IKF

The Expressive Power of Backround Modal Dependence Logic Modal logic Team semantics Modal

Dependence in Games &amp; Dependence Games Davide Grossi (ILLC, University of Amsterdam) Paolo

Priority and Particle Physics: structure, dependence, and moderation in all things Kerry McKenzie

Sensitive dependence on initial conditions in maps Marc R. Roussel November 28, 2019 Marc R.

Chapter 3 Structural breaks for models with path dependence 2 Chapter 3 Path dependence (p.

Dependence: Theory and Practice Introduction to loop dependence and loop transformation 1 The

Things break. riak bends. Justin Sheehy justin@basho.com Perfection is Unattainable A system

Marco Baldi &amp; Kazuya Koyama CSWG Task-6: Numerical tools for non-standard cosmological

Adiabatic vs Sudden Flux Insertion and Nonlinear Electric Conduction Talk 2 by Masaki Oshikawa

Spatial Collectives and Causality Antony Galton Department of Mathematics and Computer Science

rss trs rttrs r

Fractions, Decimals, Ratios and Percentages Fraction Line Denominator

Introduction to C Programming Basics of Programming (2) Variables Standard Input/Output (2)

Finishing Point and PointTest Formal vs. Actual Parameters formal parameter: in declaration of

Dependence in Games & Dependence Games Davide Grossi (ILLC, University of Amsterdam) Paolo

Marco Baldi & Kazuya Koyama CSWG Task-6: Numerical tools for non-standard cosmological