Discovery of Latent Factors in High-dimensional Data Using Tensor Methods
Furong Huang
University of California, Irvine Machine Learning Conference 2016 New York City
1 / 24
Discovery of Latent Factors in High-dimensional Data Using Tensor - - PowerPoint PPT Presentation
Discovery of Latent Factors in High-dimensional Data Using Tensor Methods Furong Huang University of California, Irvine Machine Learning Conference 2016 New York City 1 / 24 Machine Learning - Modern Challenges Big Data Challenging Tasks
1 / 24
2 / 24
2 / 24
2 / 24
Cell T ypes T
Communities
3 / 24
Cell T ypes T
Communities
3 / 24
3 / 24
Topics Education Crime Sports 4 / 24
5 / 24
103 104 105
Perplexity Tensor Variational
2 4 6 8 10 ×104
Running Time (s) 6 / 24
103 104 105
Perplexity Tensor Variational
2 4 6 8 10 ×104
Running Time (s)
Facebook: n ∼ 20k Yelp: n ∼ 40k DBLP: n ∼ 1 million
10-2 10-1 100 101
Error /group FB YP DBLPsub DBLP
102 103 104 105 106
Running Times (s) FB YP DBLPsub DBLP 6 / 24
103 104 105
Perplexity Tensor Variational
2 4 6 8 10 ×104
Running Time (s)
Facebook: n ∼ 20k Yelp: n ∼ 40k DBLP: n ∼ 1 million
10-2 10-1 100 101
Error /group FB YP DBLPsub DBLP
102 103 104 105 106
Running Times (s) FB YP DBLPsub DBLP
“Online Tensor Methods for Learning Latent Variable Models”, F. Huang, U. Niranjan, M. Hakeem, A. Anandkumar, JMLR14. “Tensor Methods on Apache Spark”, by F. Huang, A. Anandkumar, Oct. 2015. 6 / 24
7 / 24
0.5 1.0 1.5 2.0 2.5 k Spatial point process (ours) Average expression level ( ) previous
” Discovering Neuronal Cell Types and Their Gene Expression Profiles Using a Spatial Point Process Mixture Model ” by F. Huang, A. Anandkumar, C. Borgs, J. Chayes, E. Fraenkel, M. Hawrylycz, E. Lein, A. Ingrosso, S. Turaga, NIPS 2015 BigNeuro workshop. 8 / 24
football soccer tree
The weather is good. Her life spanned years of incredible change for women. Mary lived through an era of liberating reform for women.
9 / 24
football soccer tree
The weather is good. Her life spanned years of incredible change for women. Mary lived through an era of liberating reform for women.
“Convolutional Dictionary Learning through Tensor Factorization”, by F. Huang, A. Anandkumar, conference and workshop proceeding of JMLR, vol.44, Dec 2015. 9 / 24
” Scalable Latent TreeModel and its Application to Health Analytics ” by F. Huang, N. U.Niranjan, I. Perros, R. Chen, J. Sun,
10 / 24
Words Topics Choice Variable life gene data DNA RNA k1 k2 k3 k4 k5 h A A A A A
Unlabeled data Probabilistic admixture model Learning Algorithm Inference 11 / 24
Words Topics Choice Variable life gene data DNA RNA k1 k2 k3 k4 k5 h A A A A A
Unlabeled data Probabilistic admixture model MCMC Inference
◮ Exponential mixing time 11 / 24
Words Topics Choice Variable life gene data DNA RNA k1 k2 k3 k4 k5 h A A A A A
Unlabeled data Probabilistic admixture model Likelihood Methods Inference
◮ Exponential mixing time
◮ Exponential critical points 11 / 24
Words Topics Choice Variable life gene data DNA RNA k1 k2 k3 k4 k5 h A A A A A
Unlabeled data Probabilistic admixture model Likelihood Methods Inference
◮ Exponential mixing time
◮ Exponential critical points
11 / 24
Words Topics Choice Variable life gene data DNA RNA k1 k2 k3 k4 k5 h A A A A A
Unlabeled data Probabilistic admixture model T
❡ ✁ ✂ ✄ ☎ ❡✆ ✂ ✝ ♣ ✂ ✁ ✞ ✟ ✞ ✂12 / 24
Words Topics Choice Variable life gene data DNA RNA k1 k2 k3 k4 k5 h A A A A A
Unlabeled data Probabilistic admixture model T
✠✡ ☛ ☞ ✌ ✍ ✠✎ ☞ ✏ ✑ ☞ ☛ ✒ ✓ ✒ ☞ ✡Inference
✔ ✕ ✕12 / 24
=
i1 i2
=
i1 i2 i3
13 / 24
1 +e2e⊤ 2 = u1u⊤ 1 +u2u⊤ 2
√ 2 2 , − √ 2 2 ]
√ 2 2 , √ 2 2 ]
14 / 24
1 +e2e⊤ 2 = u1u⊤ 1 +u2u⊤ 2
√ 2 2 , − √ 2 2 ]
√ 2 2 , √ 2 2 ]
14 / 24
1 +e2e⊤ 2 = u1u⊤ 1 +u2u⊤ 2
√ 2 2 , − √ 2 2 ]
√ 2 2 , √ 2 2 ]
14 / 24
1 +e2e⊤ 2 = u1u⊤ 1 +u2u⊤ 2
√ 2 2 , − √ 2 2 ]
√ 2 2 , √ 2 2 ]
14 / 24
1
2
3
15 / 24
1
2
3
16 / 24
Topics Topic Proportion
police witness campus police witness campus police witness campus 17 / 24
Topics Topic Proportion
police witness campus police witness campus police witness campus police witness
crime S p
t s Educa
✖campus 17 / 24
Topics Topic Proportion
police witness campus police witness campus police witness campus police witness
crime S p
t s Educaon
campus
campus police witness
17 / 24
campus police witness
18 / 24
campus police witness
campus police witness
crime Sports Educaon
campus police witness campus police witness 18 / 24
campus police witness
campus police witness
crime Sports Educaon
campus police witness campus police witness
18 / 24
campus police witness
campus police witness
crime Sports Educaon
campus police witness campus police witness 18 / 24
campus police witness
campus police witness
crime Sports Educaon
campus police witness campus police witness
18 / 24
W W W
campus police witness
crime Sports Educaon
campus police witness campus police witness
18 / 24
W W W
campus police witness
crime Sports Educaon
campus police witness campus police witness
W W W
18 / 24
W W W
campus police witness
crime Sports Educaon
campus police witness campus police witness
W W W
18 / 24
19 / 24
Alice Bob Charlie
Mathema
✙cians V e g e t a r i a n s M u s i c i a n s
David Ellen Frank Grace Jack Kathy 19 / 24
Alice Bob Charlie
Mathemacians V e g e t a r i a n s M u s i c i a n s
David Ellen Frank Grace Jack Kathy 19 / 24
1
2
3
20 / 24
21 / 24
21 / 24
21 / 24
Saddle Point
Saddle point has 0 gradient
21 / 24
Saddle Point
Saddle point has 0 gradient Non-degenerate saddle: Hessian has ± eigenvalue
21 / 24
escape stuck
Saddle point has 0 gradient Non-degenerate saddle: Hessian has ± eigenvalue Negative eigenvalue: direction of escape
21 / 24
escape stuck
Saddle point has 0 gradient Non-degenerate saddle: Hessian has ± eigenvalue Negative eigenvalue: direction of escape
21 / 24
escape stuck
Saddle point has 0 gradient Non-degenerate saddle: Hessian has ± eigenvalue Negative eigenvalue: direction of escape
“Escaping From Saddle Points — Online Stochastic Gradient for Tensor Decomposition”,by R. Ge, F. Huang, C. Jin, Y. Yuan, COLT 2015. 21 / 24
1
2
3
22 / 24
23 / 24
M3
escape stuck
23 / 24
Anima Anandkumar UC Irvine Rong Ge Duke University Srini Turaga Janelia Research Chi Jin UC Berkeley Jennifer Chayes MSR Christian Borgs MSR Ernest Fraenkel MIT Yang Yuan Cornell U UN Niranjan UC Irvine
24 / 24