Hierarchical cl u stering N E TW OR K AN ALYSIS IN TH E TIDYVE R - - PowerPoint PPT Presentation

hierarchical cl u stering
SMART_READER_LITE
LIVE PREVIEW

Hierarchical cl u stering N E TW OR K AN ALYSIS IN TH E TIDYVE R - - PowerPoint PPT Presentation

Hierarchical cl u stering N E TW OR K AN ALYSIS IN TH E TIDYVE R SE Massimo Franceschet Prof . of Data Science , Uni v ersit y of Udine ( Ital y) NETWORK ANALYSIS IN THE TIDYVERSE NETWORK ANALYSIS IN THE TIDYVERSE The similarit y meas u re


slide-1
SLIDE 1

Hierarchical clustering

N E TW OR K AN ALYSIS IN TH E TIDYVE R SE

Massimo Franceschet

  • Prof. of Data Science, University of

Udine (Italy)

slide-2
SLIDE 2

NETWORK ANALYSIS IN THE TIDYVERSE

slide-3
SLIDE 3

NETWORK ANALYSIS IN THE TIDYVERSE

slide-4
SLIDE 4

NETWORK ANALYSIS IN THE TIDYVERSE

The similarity measure

Single-linkage: the similarity between two groups is the maximum of the similarities between nodes of dierent groups. Complete-linkage: the similarity between two groups is the minimum of the similarities between nodes of dierent groups. Average-linkage: the similarity between two groups is the average of the similarities between nodes of dierent groups.

slide-5
SLIDE 5

NETWORK ANALYSIS IN THE TIDYVERSE

The clustering algorithm

  • 1. Evaluate the similarity measures for all node pairs.
  • 2. Assign each node to a group of its own.
  • 3. Find the pair of groups with the highest similarity and join them together into a single

group.

  • 4. Calculate the similarity between the new composite group and all others.
  • 5. Repeat steps 3 and 4 until all nodes have been joined into a single group.
slide-6
SLIDE 6

NETWORK ANALYSIS IN THE TIDYVERSE

Hierarchical clustering in R

# distance matrix from similarity matrix D <- 1-S # distance object from distance matrix d <- as.dist(D) # average-linkage clustering method cc <- hclust(d, method = "average") # cut dendrogram at 4 clusters hclust(d, method = "average") [1] 1 1 1 2 2 2 1 1 1 1 1 1 1 1 1 1 1 2 2 3 2 2 2 1 4 2 2 2 [29] 2 2 2 3 2 2 4 1 1 2 2 2 1 3 1 1 2 3 1 1 4 4 1 1 1 4 1 2 [57] 3 3 3 3 3 1 1 3

slide-7
SLIDE 7

Let's cluster our network!

N E TW OR K AN ALYSIS IN TH E TIDYVE R SE

slide-8
SLIDE 8

Interactive visualizations with visNetwork

N E TW OR K AN ALYSIS IN TH E TIDYVE R SE

Massimo Franceschet

  • Prof. of Data Science, University of

Udine (Italy)

slide-9
SLIDE 9

NETWORK ANALYSIS IN THE TIDYVERSE

Different layouts

slide-10
SLIDE 10

NETWORK ANALYSIS IN THE TIDYVERSE

slide-11
SLIDE 11

NETWORK ANALYSIS IN THE TIDYVERSE

slide-12
SLIDE 12

NETWORK ANALYSIS IN THE TIDYVERSE

slide-13
SLIDE 13

Let's interact!

N E TW OR K AN ALYSIS IN TH E TIDYVE R SE

slide-14
SLIDE 14

Congratulations!

N E TW OR K AN ALYSIS IN TH E TIDYVE R SE

Massimo Franceschet

  • Prof. of Data Science, University of

Udine (Italy)

slide-15
SLIDE 15

NETWORK ANALYSIS IN THE TIDYVERSE

Deeper inside network science

You now know how to: Analyze any network with basic centrality and similarity measures Produce beautiful network visualizations, including interactive ones For more information: University of Udine Network Science Course

slide-16
SLIDE 16

Continue the journey!

N E TW OR K AN ALYSIS IN TH E TIDYVE R SE