Clustering Relational Data using the Infinite Relational Model
Ana Daglis
Supervised by: Matthew Ludkin
September 4, 2015
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 1 / 29
Clustering Relational Data using the Infinite Relational Model Ana - - PowerPoint PPT Presentation
Clustering Relational Data using the Infinite Relational Model Ana Daglis Supervised by: Matthew Ludkin September 4, 2015 Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 1 / 29 Outline Clustering 1 Model
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 1 / 29
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 2 / 29
Clustering
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 3 / 29
Model
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 4 / 29
Model
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 5 / 29
Model
1 added to an existing block with probability |b|/(n + A), where |b| is
2 creates a completely new block with probability A/(n + A).
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 6 / 29
Model
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 7 / 29
Model
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 8 / 29
Model
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 9 / 29
Model
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 10 / 29
Gibbs Sampling Methodology
1 Initialize with θ = (θ(0)
2 For i = 1, 2, . . . , n,
1
2
d
2
1 , θ(i−1) 3
d
d
1 , θ(i) 2 , . . . , θ(i) d−1).
3 Discard the first k iterations and estimate the posterior distribution
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 11 / 29
Gibbs Sampling Methodology
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 12 / 29
Gibbs Sampling Methodology
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 13 / 29
Gibbs Sampling Results
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 14 / 29
Gibbs Sampling Results
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 15 / 29
Gibbs Sampling Results
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 16 / 29
Gibbs Sampling Results
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 17 / 29
Split-Merge Algorithm Methodology
1 Select two distinct nodes, i and j, uniformly at random. 2 If i and j belong to the same cluster, split that cluster into two by
3 If i and j belong to different clusters, merge those clusters. 4 Evaluate Metropolis-Hastings acceptance probability. If accepted,
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 18 / 29
Split-Merge Algorithm Methodology
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 19 / 29
Split-Merge Algorithm Methodology
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 20 / 29
Split-Merge Algorithm Methodology
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 21 / 29
Split-Merge Algorithm Methodology
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 22 / 29
Split-Merge Algorithm Methodology
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 23 / 29
Split-Merge Algorithm Methodology
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 24 / 29
Split-Merge Algorithm Results
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 25 / 29
Split-Merge Algorithm Results
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 26 / 29
Split-Merge Algorithm Results
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 27 / 29
Future Work
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 28 / 29
Future Work
Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015 29 / 29