707 000 web science and web technology gy network
play

707.000 Web Science and Web Technology gy Network Evolution and - PowerPoint PPT Presentation

Knowledge Management Institute 707.000 Web Science and Web Technology gy Network Evolution and Processes Markus Strohmaier Univ. Ass. / Assistant Professor Knowledge Management Institute Graz University of Technology, Austria e-mail:


  1. Knowledge Management Institute 707.000 Web Science and Web Technology gy „Network Evolution and Processes“ Markus Strohmaier Univ. Ass. / Assistant Professor Knowledge Management Institute Graz University of Technology, Austria e-mail: markus.strohmaier@tugraz.at web: http://www.kmi.tugraz.at/staff/markus Markus Strohmaier 2011 1

  2. Knowledge Management Institute Overview A Agenda d • Network Creation and Evolution – Random Networks, Configuration Model, Barabasi and Albert • Network Processes • Network Processes – The SIR Model Markus Strohmaier 2011 2

  3. Knowledge Management Institute Motivation With demos from http://www-personal.umich.edu/~ladamic/NetLogo/ Examples of network evolution: • „Invites“ to join GMail • „Invites“ to buy Chumby • „Invites“ to join Joost I it “ t j i J t • Vaccination strategies for epidemics • • … Markus Strohmaier 2011 3

  4. Knowledge Management Institute Background Background [Newman 2003] • • First example of a scale free network (Price): First example of a scale-free network (Price): – Network of citations between scientific papers – Both in- and out-degrees had power-law distributions • Answered the question: How do power law distributions Answered the question: How do power law distributions emerge? – “the rich get richer” – In other words: the amount you get goes up with the amount you already have • The “Matthew affect” – “For to every one that hath shall be given” (Matthew 25:29) – – (in german ~ “wer hat dem wird gegeben”) (in german wer hat dem wird gegeben ) • Other labels – Cumulative advantage – Preferential attachment • Evident in scientific paper citations – The rate at which a paper gets new citations is proportional to the number that it already has Markus Strohmaier 2011 6

  5. Knowledge Management Institute Giant Components - Demo • When do Giant Components emerge? Wh d Gi t C t ? http://ccl.northwestern.edu/netlogo/models/GiantComponent Markus Strohmaier 2011 7

  6. Knowledge Management Institute Two Assumptions Two Assumptions [Leskovec 2006] “Conventional Wisdom” that networks that evolve are characterized by • Constant average degree – Edges grow linearly with edges • Sl Slowly growing diameter l i di t – Growing diameter with the addition of new nodes Empirical observations show that • Networks are becoming denser over time (densification power laws) laws) • Effective diameter is in many cases decreasing as networks grow (shrinking diameter) Markus Strohmaier 2011 8

  7. Knowledge Management Institute Empirical Observation: Densification Empirical Observation: Densification [Leskovec 2006] Markus Strohmaier 2011 9

  8. Knowledge Management Institute Empirical Observation: Densification Empirical Observation: Densification [Leskovec 2006] Markus Strohmaier 2011 10

  9. Knowledge Management Institute Empirical Observation: Effective Diameter Empirical Observation: Effective Diameter [Leskovec 2006] Eff Effective diameter: ti di t The minimum distance d such that at least 90% such that at least 90% of the connected node pairs are at distance at pairs are at distance at most d Decreasing Decreasing diameter over time Markus Strohmaier 2011 11

  10. Knowledge Management Institute Motivation Motivation [Leskovec 2006] What underlying processes cause a graph to 1. systematically densify? 2. experience a decrease in effective diameter even as it its size increases? i i ? But first, let’s take a step back Markus Strohmaier 2011 12

  11. Knowledge Management Institute Graph Generators Graph Generators [Leskovec 2006] “What if we could develop algorithms that are capable of constructing networks that exhibit similar characteristics as g observed in “real-world” networks?” We could do interesting things, such as: • E t Extrapolations l ti – predicting future network development • Sampling p g – Drawing a sample and generalizing to the entire population • Abnormality detection – – Identifying deviations from “normal” network behaviour Identifying deviations from normal network behaviour • Simulation – Exploring “what if” scenarios, e.g. deletion of hubs, network resilience Markus Strohmaier 2011 13

  12. Knowledge Management Institute Simple Graph Generators Simple Graph Generators [Newman 2003] Can we develop an algorithm that constructs random graphs? Algorithm : Take some number n of vertices and connect each pair (or not) with probability p (or 1-p) with probability p (or 1 p). The Erdos-Renyi / Poisson random Graph G( G(n,m) the set of all graphs having n vertices and m edges, each ) th t f ll h h i ti d d h possible graph appearing with equal probability For example: G(3,2) is the set of all three graphs having 3 vertices p ( ) g p g and 2 edges, each graph has probability 1/3 ->Does not mimic reality Markus Strohmaier 2011 14

  13. Knowledge Management Institute Faloutsos / Leskovec Faloutsos / Leskovec ECML/PKDD 2007 Markus Strohmaier 2011 15

  14. Knowledge Management Institute Random Graphs Random Graphs [Faloutsos / Leskovec ECML/PKDD 2007] � Pros: – Simple model Simple model – Phase transitions (giant component with avg. degree >1) – Giant component � Cons: – Degree distribution – No community structure No comm nit str ct re – No degree correlations � � Extensions: Extensions: Configuration model – Random graphs with arbitrary degree sequence Markus Strohmaier 2011 16

  15. Knowledge Management Institute The Configuration Model The Configuration Model C Consider the model defined in the following way. id th d l d fi d i th f ll i We specify a degree distribution p k , such that p k is the fraction of vertices in the network having degree k. We choose a degree sequence, which is a set of n values of the degrees k of vertices i = 1 values of the degrees k i of vertices i = 1 . . . n, from n from this distribution. We can think of this as giving each vertex i in our graph k i “stubs” or “spokes” sticking vertex i in our graph k i stubs or spokes sticking out of it, which are the ends of edges-to-be. [Newman 2003] Markus Strohmaier 2011 17

  16. Knowledge Management Institute The Configuration Model The Configuration Model Th Then we choose pairs of stubs at random from the h i f t b t d f th network and connect them together. It is straightforward to demonstrate that this process straightforward to demonstrate that this process generates every possible topology of a graph with the given degree sequence with equal probability. g g q q p y The configuration model is defined as the ensemble of g graphs so produced, with each having equal weight. [Newman 2003] Markus Strohmaier 2011 18

  17. Knowledge Management Institute The Configuration Model: The Configuration Model: Example 1 1. Define a degree distribution (e.g. 3,2,1,1,1) D fi d di t ib ti ( 3 2 1 1 1) 2. Specify degrees for each node, based on the degree distribution (e.g. A->3, B->2, C->1, D->1, E->1) ( g , , , , ) 3. Insert an edge between two arbitrary nodes in your node set that have not satisfied their specified degree yet. 4 4. Repeat step 3 until all node degrees are satisfied. R t t 3 til ll d d ti fi d 1 1 1 1 1 1 1 1 1 1 E E D D E E D D E E D D E E D D E E D D 1 1 1 1 3 3 3 3 3 1 A B A B A B A B A B 2 2 2 2 2 2 2 2 2 2 C C C C C Example S Specified degree satisfied ifi d d ti fi d S Specified f node degree Markus Strohmaier 2011 19

  18. Knowledge Management Institute The Configuration Model: The Configuration Model: Example II A Another perspective: th ti Faloutsos / Leskovec ECML/PKDD 2007 Example Markus Strohmaier 2011 20

  19. Knowledge Management Institute The Configuration Model • Can reproduce networks with power-law distributions C d t k ith l di t ib ti – Accepts arbitrary degree distributions as input • Does not explain the natural emergence of power law networks networks • Does not explain network growth / evolution Markus Strohmaier 2011 21

  20. Knowledge Management Institute Generating Scale Free Networks Generating Scale Free Networks [ Barabasi and Albert 1999] T To incorporate the growing character of the network , starting with a small number i t th t ti ith ll b i h t f th t k ( m 0 ) of vertices, at every time step we add a new vertex with m( ≤ m 0 ) edges that link the new vertex to m different vertices already present in the system. To incorporate preferential attachment, we assume that the probability Π that a new vertex will be connected to vertex i depends on the connectivity k i of that vertex, so that Degree of g vertex i Probability of a new vertex attaching to a The sum of Π ( k i ) = k i / ∑ j k j vertex i with degree k all vertices‘ degrees In other words: the probability is the degree of vertex i divided by the sum of all nodes’ degrees After t time steps the model leads to a random network with t + m vertices and mt After t time steps, the model leads to a random network with t + m 0 vertices and mt edges. This network evolves into a scale-invariant state following a power law (satisfies the two conditions: Growth and Preferential Attachment). two conditions: Growth and Preferential Attachment). Markus Strohmaier 2011 22

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend