Network Science Barab asi: Ch. 2 Graph Theory Lecture 2 Joao - - PowerPoint PPT Presentation

network science
SMART_READER_LITE
LIVE PREVIEW

Network Science Barab asi: Ch. 2 Graph Theory Lecture 2 Joao - - PowerPoint PPT Presentation

Network Science Barab asi: Ch. 2 Graph Theory Lecture 2 Joao Meidanis University of Campinas, Brazil September 26, 2020 Summary Brief Statistics Review 1 Paths and Distances 2 Breadth First Search (BFS) 3 Connectivity 4


slide-1
SLIDE 1

Network Science

Barab´ asi: Ch. 2 — Graph Theory — Lecture 2 Joao Meidanis

University of Campinas, Brazil

September 26, 2020

slide-2
SLIDE 2

Summary

1

Brief Statistics Review

2

Paths and Distances

3

Breadth First Search (BFS)

4

Connectivity

5

Clustering coefficients

Meidanis (Unicamp) Network Science September 26, 2020 2 / 22

slide-3
SLIDE 3

Brief Statistics Review

Meidanis (Unicamp) Network Science September 26, 2020 3 / 22

slide-4
SLIDE 4

Average, moments, standard deviation

For a sample of N values x1, x2, . . . , xN: Average (mean): x = x1 + x2 + . . . + xN N = 1 N

N

  • i=1

xi The nth moment: xn = xn

1 + xn 2 + . . . + xn N

N = 1 N

N

  • i=1

xn

i

Standard deviation: σx =

  • 1

N

N

  • i=1

(xi − x)2

Meidanis (Unicamp) Network Science September 26, 2020 4 / 22

slide-5
SLIDE 5

Distributions

For a sample of N values x1, x2, . . . , xN: Distribution: px = 1 N

N

  • i=1

δ(x, xi) where the Kronecker δ is defined as δ(a, b) = 1 if a = b

  • therwise

We have:

  • x

px = 1 Continuous case (density function f ): ∞

−∞

f (x)dx = 1

Meidanis (Unicamp) Network Science September 26, 2020 5 / 22

slide-6
SLIDE 6

Paths and Distances

Meidanis (Unicamp) Network Science September 26, 2020 6 / 22

slide-7
SLIDE 7

Paths and Length

Physical distance usually irrelevant in networks:

a webpage can link to others very far away two neighbors may not know each other

Definition: a path is a route following network links (some texts require distinct nodes) Path length: number of links traversed

Meidanis (Unicamp) Network Science September 26, 2020 7 / 22

slide-8
SLIDE 8

Shortest Paths, Distance, Diameter

Shortest path from i to j: smallest number of links dij = distance from i to j = length of a shortest path from i to j Undireted network: dij = dji Directed network: often dij = dji Directed network: existence of i → j path does not guarantee existence of j → i path Computing distances:

powers of adjacency matrix — good to know BFS (breadth first search) algorithm — fast — good to run

dmax = diameter = maximum distance in network Average distance (connected graph): d = 1 N(N − 1)

  • i=j

dij = 1 2Lmax

  • i=j

dij

Meidanis (Unicamp) Network Science September 26, 2020 8 / 22

slide-9
SLIDE 9

Number of Paths

N(k)

ij

= number of length-k paths from i to j Can be computed from adjacency matrix Aij There is a link from i to j if and only if Aij = 1 Then N(1)

ij

= Aij There is a length-2 path from i to j if and only if there is k such that AikAkj = 1 The number of such paths is N(2)

ij

=

k AikAkj = A2 ij

And so on. In general N(k)

ij

= Ak

ij

Meidanis (Unicamp) Network Science September 26, 2020 9 / 22

slide-10
SLIDE 10

Breadth First Search (BFS)

Meidanis (Unicamp) Network Science September 26, 2020 10 / 22

slide-11
SLIDE 11

Breadth First Search (BFS)

algorithm: step 0

Meidanis (Unicamp) Network Science September 26, 2020 11 / 22

slide-12
SLIDE 12

Breadth First Search (BFS)

algorithm: step 1

Meidanis (Unicamp) Network Science September 26, 2020 12 / 22

slide-13
SLIDE 13

Breadth First Search (BFS)

algorithm: step 2

Meidanis (Unicamp) Network Science September 26, 2020 13 / 22

slide-14
SLIDE 14

Breadth First Search (BFS)

algorithm: step 3

Meidanis (Unicamp) Network Science September 26, 2020 14 / 22

slide-15
SLIDE 15

Breadth First Search (BFS)

algorithm: step 4

Meidanis (Unicamp) Network Science September 26, 2020 15 / 22

slide-16
SLIDE 16

Connectivity

Meidanis (Unicamp) Network Science September 26, 2020 16 / 22

slide-17
SLIDE 17

Connectivity for Undirected Graphs

Connected graph: any two nodes can be joined by a path Disconnected graph: two or more connected components Giant component: the largest connected component Isolates: the other connected components Bridge: link whose removal increases the number of components

Meidanis (Unicamp) Network Science September 26, 2020 17 / 22

slide-18
SLIDE 18

Connectivity for Directed Graphs

Strongly Connected graph: has paths back and forth from every node to every other node (e.g., AB path and BA path) Weakly connected graph: connected if we disregard link orientations Strongly connected components: can be identified; sometimes a single node In-component: nodes that reach a s.c.c. Out-component: nodes reachable from a s.c.c.

Meidanis (Unicamp) Network Science September 26, 2020 18 / 22

slide-19
SLIDE 19

Clustering coefficients

Meidanis (Unicamp) Network Science September 26, 2020 19 / 22

slide-20
SLIDE 20

Clustering coefficient

What fraction of the possible links exist among my neighbors? Ci = 2Li ki(ki − 1), where:

Li = number of links between node i’s neighbors ki = degree of node i

Ci ∈ [0, 1] Ci = 1 Ci = 1/2 Ci = 0

Meidanis (Unicamp) Network Science September 26, 2020 20 / 22

slide-21
SLIDE 21

Clustering coefficient for the entire network

Average clustering coefficient C = 1 N

N

  • i=1

Ci Global clustering coefficient C∆ = 3 × #Triangles #Connected Triplets connected triplet: path ABC, but ABC and CBA are considered to be the same triplet. a triangle contributes 3 triplets to the denominator a path ABC without link AC contributes 1 triplet to the denominator both C, C∆ ∈ [0, 1], not necessarily equal

Meidanis (Unicamp) Network Science September 26, 2020 21 / 22

slide-22
SLIDE 22

Clustering coefficients: Example

C = 13 42 ∼ 0.310 C∆ = 6 16 = 0.375

Meidanis (Unicamp) Network Science September 26, 2020 22 / 22