Variational Greedy Algorithm for Clustering of Grouped Data
Linda S. L. Tan (Joint work with A/Prof. David J. Nott)
National University of Singapore
20–23 Dec ICSA 2013
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 1 / 15
Variational Greedy Algorithm for Clustering of Grouped Data Linda - - PowerPoint PPT Presentation
Variational Greedy Algorithm for Clustering of Grouped Data Linda S. L. Tan (Joint work with A/Prof. David J. Nott) National University of Singapore 2023 Dec ICSA 2013 Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 1 / 15
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 1 / 15
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 2 / 15
5 10 15 −2 −1 1 2 3
Time points Gene expression levels
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 3 / 15
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 4 / 15
ajI) and bj ∼ N(0, σ2 bjI) are random effects
i γj)
l=1 exp(uT i γl)
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 5 / 15
1
2
i=1 qi(θi) for θ = {θ1, . . . , θm} (Variational Bayes)
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 6 / 15
n
k
aj)q(σ2 bj) g
jl)
ai, Σq ai), q(βj) = N(µq βj, Σq βj), q(bj) = N(µq bj, Σq bj),
aj) = IG(αq aj, λq aj), q(σ2 bj) = IG(αq bj, λq bj), q(σ2 jl) = IG(αq jl, λq jl)
k
γ} (for tractable L).
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 7 / 15
1 Partial centering (Xi = Wi):
2 Full centering (Xi = Wi = Vi): Introduce νj = βj + bj ∼ N(βj, σ2
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 8 / 15
1
2
3
4
5
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 9 / 15
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 10 / 15
5 10 15 −2 −1 1 2 3
Time course data (Spellman et al. 1998)
Time points Gene expression levels
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 11 / 15
40 80 −2 1 3 cluster 1 (37 genes) 40 80 −2 1 3 cluster 2 (105 genes) 40 80 −2 1 3 cluster 3 (41 genes) 40 80 −2 1 3 cluster 4 (20 genes) 40 80 −2 1 3 cluster 5 (8 genes) 40 80 −2 1 3 cluster 6 (64 genes) 40 80 −2 1 3 cluster 7 (65 genes) 40 80 −2 1 3 cluster 8 (79 genes) 40 80 −2 1 3 cluster 9 (25 genes) 40 80 −2 1 3 cluster 10 (17 genes) 40 80 −2 1 3 cluster 11 (15 genes) 40 80 −2 1 3 cluster 12 (49 genes) 40 80 −2 1 3 cluster 13 (13 genes) 40 80 −2 1 3 cluster 14 (37 genes) 40 80 −2 1 3 cluster 15 (31 genes) 40 80 −2 1 3 cluster 16 (6 genes)
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 12 / 15
5 10 15 20 −2 1 2 3 400 gene expressions (4 repeated measurements)
5 10 15 20 −2 2
cluster 1: 67 genes
5 10 15 20 −2 2
cluster 2: 67 genes
5 10 15 20 −2 2
cluster 3: 67 genes
5 10 15 20 −2 2
cluster 4: 67 genes
5 10 15 20 −2 2
cluster 5: 66 genes
5 10 15 20 −2 2
cluster 6: 66 genes
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 13 / 15
cluster 1 (99)
0.5 6 12 18 28 30
cluster 2 (113)
0.5 6 12 18 28 30
cluster 3 (48)
0.5 6 12 18 28 30
cluster 4 (30)
0.5 6 12 18 28 30
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 14 / 15
Linda Tan (NUS) Variational Greedy Algorithm ICSA 2013 15 / 15