L ECTURE 35: N ETWORKS 2 T EACHER : G IANNI A. D I C ARO I MPORTANCE - - PowerPoint PPT Presentation

l ecture 35 n etworks 2
SMART_READER_LITE
LIVE PREVIEW

L ECTURE 35: N ETWORKS 2 T EACHER : G IANNI A. D I C ARO I MPORTANCE - - PowerPoint PPT Presentation

15-382 C OLLECTIVE I NTELLIGENCE S19 L ECTURE 35: N ETWORKS 2 T EACHER : G IANNI A. D I C ARO I MPORTANCE / P OWER IN NETWORKS Certain positions within the network give nodes more impor portanc nce / / pow power o Directly affect/influence


slide-1
SLIDE 1

LECTURE 35: NETWORKS 2

TEACHER: GIANNI A. DI CARO

15-382 COLLECTIVE INTELLIGENCE – S19

slide-2
SLIDE 2

IMPORTANCE / POWER IN NETWORKS

Certain positions within the network give nodes more impor portanc nce / / pow power

  • Directly affect/influence others
  • Control the flow of information
  • Avoid control of others

2

Influencers in social networks, vehicles for disease spreading, hubs in road networks, key infrastructures on the Internet, leaders in animal societies, …

This and fol

  • llow
  • wing

g slides are adapted from

  • m

Kri Kristina Le Lerman’s sl slides s

slide-3
SLIDE 3

3

IMPORTANCE OF A NODE: NETWORK CENTRALITY

§ Networ

  • rk Centrality indices provide

answers to the question by defining a mapping that attributes a real-valued number to each node § These number can be used to determine a ranking among the nodes of a network

"What characterizes an important node?"

slide-4
SLIDE 4

CENTRALITY IN NETWORKS

§ Ce Centrality encodes the relationship between structure (topology) and importance/power (flows of information and control) in interconnected systems à Certain positions within the network give nodes more power or importance § How do we measure importance?

  • Who can directly affect/influence others? [direct information transfer]
  • Highest degree nodes are “in the thick of it”
  • Who controls information flows? [relaying information transfer]
  • Nodes that fall on shortest paths between others can disrupt the flow of

information between them

  • Who can quickly inform most others? [multicasting, few hops dissemination]
  • Nodes who are close to other nodes can quickly get/give information to them

4

slide-5
SLIDE 5

DEGREE CENTRALITY

§ Degr gree Cent ntrality of node 𝑜 is the number of other nodes 𝑜 is connected to

  • A node with high degree has high potential communication activity

1 2 3 4 5

node In- degree Out- degree Total degree 1 1 1 2 3 2 5 3 1 3 4 4 2 1 3 5 2 1 3

1 2 3 4 5

5

slide-6
SLIDE 6

MATHEMATICAL REPRESENTATION USING ADJACENCY MATRIX

1 1 1 1 1 1 1 1

1 2 3 4 5

1 2 3 4 5 1 2 3 4 5 Adjacency matrix A

di

  • ut =

Aij

j

å

di

in =

A ji

j

å

node In- degree Out- degree Total degree 1 1 1 2 3 2 5 3 1 3 4 4 2 1 3 5 2 1 3 Out-degree: row sum In-degree: column sum

6

slide-7
SLIDE 7

BETWEENNESS CENTRALITY

§ Betweenne nness Cent ntrality of node 𝑜 is the number of shortest paths (geodesics) connecting all pairs of other nodes that pass through 𝑜

  • Node with highest betweenness can potentially control or distort

communication between a large number of nodes

1 2 3 4 5 2 3 4 5

1

7

1à2 1à2à3 1à2à4 1à2à4à5

Paths from 1

2à3 2à4 2à4à5

Paths from 2

3à4 3à5 3à2

Paths from 3

4à5à2 4à5à2à3 4à5

Paths from 4

5à2 5à2à3 5à2à4

Paths from 5

slide-8
SLIDE 8

CLOSENESS CENTRALITY

§ Node that is closer to all other nodes can reach all other nodes in shortest amount of time (on average)

  • This node should best avoid being controlled by others!

§ Clos

  • sene

ness cent ntrality is the reciprocal of the sum of geodesic distances from a node to all other nodes

8

𝐷 𝑜 = 1 ∑& 𝑒(𝑛, 𝑜) 𝐷 𝑜 = 𝑂 ∑& 𝑒(𝑛, 𝑜) In not strongly connected graphs, the distance between two nodes that are not connected is set to 1/∞=0

slide-9
SLIDE 9

SELF-CONSISTENT MEASURES OF CENTRALITY

§ Katz (1953): Katz scor

  • re
  • “not only on how many others a person is connected to, but who he

connects to”

  • One’s status is determined by the status of the people s/he is

connected to

9

§ Bonacich (1972): Eige genvector

  • r centrality
  • Node’s centrality is the sum of the centralities of its connections
  • Relative centrality can be computed as an average over the

centralities of the directly connected nodes

  • We can use the degree centrality as a reference centrality for the 𝑑

/

𝑑0 = 1 𝜇 2

/∈𝒪(0)

𝑑/

slide-10
SLIDE 10

EIGENVECTOR CENTRALITY

10

c is the eigenvector of A associated to the largest eigenvalue 𝜇 (only 𝒅 ≥ 0 are of interest + Perron-Frobenius theorem)

§ Bonacich (1972): Eige genvector

  • r centrality
  • Node’s centrality is the sum of the centralities of its connections

𝑑8 = 1 𝜇 2

/∈𝒪(0)

𝑑/ Using the adjacency matrix 𝐵 for degree centrality: 𝑑8 = 1 𝜇 2

/∈:

𝐵8/𝑑/ Where 𝐻 is the entire network graph, 𝒪 is 𝑜’s neighborhood 𝜇 𝑑8 = 2

/∈:

𝐵8/𝑑/ This is an eige genvector

  • r equation
  • n!

𝜇𝒅 = 𝐵𝒅

slide-11
SLIDE 11

EIGENVECTOR CENTRALITY

11

𝜇 𝑑8 = 2

/∈:

𝐵8/𝑑/

𝜇𝒅 = 𝐵𝒅

  • The value of the 𝑜-th component of the eigenvector provides the

value of the relative centrality of node 𝑜

  • There’s a scale factor, such that only the relative values are

meaningful (for ranking the nodes)

  • Computational aspects: find the largest eigenvalue, compute the

eigenvector (for very large matrices)

slide-12
SLIDE 12

EIGENVECTOR CENTRALITY

12

  • Me

Message ge passing g computation by estimate bootstrapping:

  • Start with an initial guess for the centrality of a node (e.g., number of

neighbors), then iterate for each node 𝑗:

  • Get centrality values from neighbors and update 𝑑8 according to

𝑑8 ← >

? ∑/∈𝒪 𝐵8/𝑑/

  • Over time, estimates become more and more accurate, converging to the

correct estimates in the limit

  • Convergence can be assessed by any L1, L2, L∞ metric of choice for

detecting that the vector of centralities undergo no significant changes

  • Any synchronous or asynchronous scheme can be adopted for selecting

the nodes performing the update, examples from algorithms for asynchronous value iteration (dynamic programming) can be useful to devise computationally-effective schemes

slide-13
SLIDE 13

MESSAGE PASSING BY PUSH-PULL GOSSIP

13

  • Me

Message ge passing g computation or estimation of network-level node-local properties can be realized according to a general scheme based on pus push-pull gos gossip of information data, which is based on a model of epidemic spreading

  • The message passing algorithm algorithm can be implemented in a fully distributed

and decentralized way, including running it on the network itself

  • Active: runs periodically, selects a neighbor 𝑞 (peer), sends (pus

push) its state value 𝑡 to 𝑞, receives (pull pull) 𝑞’s state value, use it to update its local state 𝑡

  • Passive: listen to requests from neighbors, when contacted receives state value from

requester, sends its state value, updates its local state Ø 𝑡 is the state value held by a node, that can be the centrality value, as well as any

  • ther state value of interest related to the network

Active thread Passive thread

Each node has two running threads

slide-14
SLIDE 14

EIGENVECTOR CENTRALITY: POWER METHOD

14

  • Pow
  • wer method
  • d (Pow
  • wer iteration
  • n): form of centralized numeric implementation
  • f the message passing approach for computing centrality values, that iterates

the following equation: 𝒅BC> = 𝐵𝒅B 𝐵𝒅B Ø This method is a general approach for the numeric calculation

  • n of
  • f the large

gest eige genvalue of a diagonalizable matrix 𝐵 and, accordingly, of the eige genvector

  • r

assoc

  • ciated to
  • the large

gest (dom

  • minant) eige

genvalue

  • Assumptions (for convergence): matrix 𝐵 is diagonalizable, has a dominant

eigenvalue, and the starting vector 𝒅D has a non-zero component in the direction of an eigenvector associated to the dominant eigenvalue

  • It’s a simple and possibly slow to converge algorithm
  • Appropriate for large and sparse matrices
slide-15
SLIDE 15

ALPHA-CENTRALITY (BONACICH, 1987)

§ Alp Alpha Centralit lity: Similar to eigenvector centrality, but the degree to which a node centrality contributes to the centralities of other nodes depends on a parameter 𝛽 § Mathematical interpretation:

  • 𝑑8(𝛽) is the expected number of paths activated directly or indirectly by node 𝑗

ci(a) = (1+ ac j(a))Aij

j

å

ci(a) = A + aA2 + a 2A3 + ...

( )

j

å

15

slide-16
SLIDE 16

A CLOSER LOOK AT ALPHA-CENTRALITY

§ Alpha-Centrality matrix § 1st term: number of paths of length 1 (edges) between i and j § Contribution of this term to ci(a) is SjAij

c(a) = A + aA2 + a 2A3 + ...

1 1 1 1 1 1 1 1

1 2 3 4 5

16

slide-17
SLIDE 17

A CLOSER LOOK AT ALPHA-CENTRALITY

§ Alpha-Centrality matrix § 2nd term: number of paths of length 2 between i and j c(a) = A + aA2 + a 2A3 + ...

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

1 2 3 4 5

1 1 1 1 2 1 1 1 1 1 1 1 x =

17

slide-18
SLIDE 18

A CLOSER LOOK AT ALPHA-CENTRALITY

§ Alpha-Centrality matrix § 3rd term: number of paths of length 3 between i and j c(a) = A + aA2 + a 2A3 + ...

1 1 1 1 1 1 1 1

1 2 3 4 5

1 1 1 1 2 1 1 1 1 1 1 1 x = 1 1 2 2 1 1 1 2 1 2 2 1 1 1 1 2

18

slide-19
SLIDE 19

A CLOSER LOOK AT ALPHA-CENTRALITY

§ Alpha-Centrality matrix § Number of paths of diverges as length of the path k grows § To keep the infinite sum finite, a < 1/l1, where l1 is the largest eigenvalue of A (also called radius of centrality § Interpretation: Node’s centrality is the sum of paths of any length connecting it to other nodes, exponentially attenuated by length of the path, so that longer paths contribute less than shorter paths

c(a) = A + aA2 + a 2A3 + ...= a kAk+1

k= 0 ¥

å

c(a) = A a kAk

k= 0 ¥

å

= (I -aA)-1A

19

slide-20
SLIDE 20

RADIUS OF CENTRALITY

Parameter a sets the length scale of communication or interactions. § For a = 0, only local interactions (with neighbors) are considered

  • Only loc
  • cal structure is important
  • centrality is same as degree centrality

20

slide-21
SLIDE 21

RADIUS OF CENTRALITY

Parameter a sets the length scale of communication or interactions. § As a grows, the length of interaction grows

  • Globa
  • bal structure becomes more important
  • Centrality depends on node’s position within a larger

structure, e.g., a community

21

slide-22
SLIDE 22

RADIUS OF CENTRALITY

Parameter a sets the length scale of communication or interactions. § As a à 1/l1, length of interactions becomes infinite

  • Glob
  • bal structure is important
  • Centrality is same as eigenvector centrality

22

slide-23
SLIDE 23

NORMALIZED ALPHA-CENTRALITY [GHOSH & LERMAN 2011]

§ Alpha-Centrality diverges for a > 1/l1 § Solution: Normalized Alpha-Centrality

  • Holds for

n(a) = c(a) cij(a)

i, j N

å

0 £ a £1

23

slide-24
SLIDE 24

MULTI-SCALE ANALYSIS WITH ALPHA-CENTRALITY

§ Parameter a allows for multi-scale analysis of networks

  • Differentiate between local and global structures

§ Study how rankings change with a

  • Leaders: high influence on group members
  • Nodes with high centrality for small values of a
  • Bridges: mediate communication between groups
  • Nodes with low centrality for small values of a
  • But high centrality for large values of a
  • Peripherals: poorly connected to everyone
  • Nodes with low centrality for any value of a

24

slide-25
SLIDE 25

KARATE CLUB NETWORK [ZACHARY, 1977]

instructor administrator

25

slide-26
SLIDE 26

RANKING KARATE CLUB MEMBERS

Centrality scores of nodes vs. a

26

slide-27
SLIDE 27

FLORENTINE FAMILIES IN 15TH CENTURY ITALY

27

slide-28
SLIDE 28

RANKING FLORENTINE FAMILIES

28

slide-29
SLIDE 29

SUMMARY

§ Network position confers advantages or disadvantages to a node, but how you measure it depends on what you mean by advantage

  • Ability to directly reach many nodes à degree centrality
  • Ability to control information à betweenness centrality
  • Ability to avoid control à closeness centrality

§ Self-consistent (bootstrapping) definitions of centrality

  • Node’s centrality depends on centrality of those it is connected to, directly
  • r indirectly, but contribution of distant nodes is attenuated by how far

they are

  • Message passing and Power methods for computing centrality values
  • Attenuation parameter sets the length scale of interactions
  • Can probe structure at different scales by varying this parameter

29

slide-30
SLIDE 30

PAGERANK: STANDING ON THE SHOULDERS OF GIANTS [FRANCESCHET]

Key insights § Analyzes the structure of the web of hyperlinks to determine importance score

  • f web pages
  • A web page is important if it is pointed to by other important pages

§ An algorithm with deep mathematical roots

  • Random walks
  • Social network theory

30

slide-31
SLIDE 31

PAGERANK AND THE RANDOM SURFER

Random Surfer § Starts at arbitrary page

I H L M G E F B C D A

31

slide-32
SLIDE 32

PAGERANK AND THE RANDOM SURFER

Random Surfer § Starts at arbitrary page § Bounces from page to page by following links randomly

I H L M G E F B C D A

32

slide-33
SLIDE 33

PAGERANK AND THE RANDOM SURFER

I H L M G E F B C D A

Random Surfer

  • Starts at arbitrary page
  • Bounces from page to

page by following links randomly

  • PageRank score of a

web page is the relative number of time it is visited by the Random Surfer

33

slide-34
SLIDE 34

MATHEMATICS OF PAGERANK

§ PageRank is a solution to a random walk on a graph

I H L M G E F B C D A A B C D E F G H I L M A B 1 C 1 D 1 1 E 1 1 1 F 1 1 G 1 1 H 1 1 I 1 1 L 1 M 1

Adjacency matrix of the graph A

34

slide-35
SLIDE 35

MATHEMATICS OF PAGERANK

§ PageRank is a solution to a random walk on a graph

I H L M G E F B C D A A B C D E F G H I L M A B 1 C 1 D 2 E 3 F 2 G 2 H 2 I 2 L 1 M 1

(Diagonal) Out-degree matrix D

35

slide-36
SLIDE 36

MATHEMATICS OF PAGERANK

§ PageRank is a solution to a random walk on a graph

  • hij is probability to go from node i to node j

I H L M G E F B C D A A B C D E F G H I L M A B 1 C 1 D .5 .5 E .3 .3 .3 F .5 1 .5 G .5 .5 H .5 .5 I .5 .5 L 1 M 1

hij=1/di à H=D-1A

36

slide-37
SLIDE 37

MATHEMATICS OF PAGERANK

§ PageRank of page j is defined recursively as

  • pj=Sipihij
  • Or in matrix form p=pH

§ What contributes to PageRank score?

  • Number of links page j receives
  • Cf B and D
  • Number of outgoing links of linking pages
  • Cf E’s effect on F and B’s effect on C
  • PageRank scores of linking pages
  • Cf E and B

I H L M G E F B C D A

37

slide-38
SLIDE 38

.09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 1 1 .5 .5 .3 .3 .3 .5 1 .5 .5 .5 .5 .5 .5 .5 1 1

… BUT THERE ARE PROBLEMS

§ Random Surfer gets trapped by dangling nodes! (no outlinks) § Solution: matrix S

  • replace zero rows in H with u=[0.9,0.9, …, 0.9]
  • From dangling node, surfer jumps to any other node

I H L M G E F B C D A

38

slide-39
SLIDE 39

STILL PROBLEMS

§ Random Surfer gets trapped in buckets

  • Reachable strongly connected component without outlinks

§ Solution: teleportation matrix E

  • Matrix of u

I H L M G E F B C D A .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09 .09

39

slide-40
SLIDE 40

… FINALLY

§ Google matrix G = aS + (1-a) E

  • Where a is the damping factor

§ Interpretation of G

  • With probability a, Random Surfer follows a hyperlink from a page

(selected at random)

  • With probability 1-a, Random Surfer jumps to any page (e.g., by

entering a new URL in the browser) § PageRank scores are the solution of self-consistent equation p =pG =apS + (1-a)u

40

slide-41
SLIDE 41

PAGERANK SCORES

I 1.6 H 1.6 L 1.6 M 1.6 G 1.6 E 8.1 F 3.9 B 38.4 C 34.3 D 3.9 A 3.3

41

slide-42
SLIDE 42

SUMMARY

§ Recursive (or self-consistent) nature of PageRank has roots in social network analysis metrics § PageRank is fundamentally related to random walks on graphs

  • Lots of research to compute it efficiently
  • Huge economic and social impact!

42