- AP & LLE Xiangliang Zhang King Abdullah University of - PowerPoint PPT Presentation

无监督学习中的选代表和被代表问题 - AP & LLE 张响亮 Xiangliang Zhang King Abdullah University of Science and Technology CNCC, Oct 25, 2018 Hangzhou, China

Outline • Affinity Propagation (AP) [Frey and Dueck, Science, 2007] • Locally Linear Embedding (LLE) [Roweis and Saul, Science, 2000] 2

Affinity Propagation [Frey and Dueck, Science 2007] 3

Affinity Propagation [Frey and Dueck, NIPS 2005] We describe a new method that, for the first time to our knowledge, combines the advantages of model-based clustering and affinity-based clustering. Component Mixing coefficient 4

Clustering : group the similar points together Minimize Minimize k 2 k 2 S S - µ S S - µ ( x ) ( x ) = = m 1 i m m 1 i m Î Î x C x C i m i m where where 1 µ = Î µ = S x { x | x C } m i i Î x C | C | m i i m m m K-medoids K-medians

Inspired by greedy k-medoids Data to cluster: The likelihood of belong to the cluster with center is the Bayesian prior probability that is a cluster center The responsibility of k-th component generating x i Assign x i with center s i Choose a new center

Understanding the process Message sent from x i to each center/exemplar: preference to be with each exemplar Hard decision for cluster centers/exemplars Introduce: “availabilities”, which is sent from exemplars to other points, and provides soft evidence of the preference for each exemplar to be available as a center for each point

The method presented in NIPS ‘ 05 Responsibilities are computed using likelihoods and availabilities • Availabilities are computed using responsibilities, recursively • Affinities 8

Interpretation by Factor Graph is the index of the exemplar for Should not be empty with a single exemplar Constraints: An exemplar must select itself as exemplar Objective function is 9

Input and Output of AP in Science07 Preference (prior) 10

AP: a message passing algorithm 11

Iterations of Message passing in AP 12

Summary AP Xiangliang Zhang, KAUST CS340: Data Mining 20

Extensive study of AP 21

Outline • Affinity Propagation (AP) [Frey and Dueck, Science, 2007] • Locally Linear Embedding (LLE) [Roweis and Saul, Science, 2000] 22

Locally Linear Embedding (LLE) [Roweis and Saul, Science, 2000] Saul and Roweis. Think globally, fit locally: unsupervised learning of low dimensional manifolds. JMLR 2003 23

LLE - motivations 24

Inspired by MDS Multidimensional Scaling (MDS), find embedding of objects in low-dim space, preserving pairwise distance Given pairwise similarity Embedding to find Eliminate the need to estimate pairwise distances between widely separated data points? 25

LLE – general idea Locally, on a fine enough scale, everything looks linear Represent object as linear combination of its neighbors Assumption: same linear representation will hold in the low dimensional space Find a low dimensional embedding which minimizes reconstruction loss 26

LLE – matrix representation 1.Select k nearest neighbors 2.Reconstruct x i by its K nearest neighbors Find W by minimizing (W) å å 2 e = - || x w x || i ij j i j 27

LLE – matrix representation 3.Need to solve system Y = W*Y Find the embedding vectors Y i by minimizing: N N 2 e = - = • å å åå (Y) || Y W Y || M (Y Y ) i ij j ij i j i j i j = - T - where M (I W) (I W) N = å s.t. Y 0 (centered on the origin) i i 1 N T = å and Y Y I ( with unit convarianc e ) i i N i 28

LLE – algorithm summary 1. Find k nearest neighbors in X space 2. Solve for reconstruction weights W -1 C 1 T = = - h - h h w where C ( x ) ( x ), and is one of x' s K nearest neighbors jk j k j T - 1 1 C 1 3. Compute embedding coordinates Y using weights W: Create a sparse matrix M = (I-W) T (I-W) Set Y to be the eigenvectors corresponding to the bottom d non-zero eigenvectors of M 29

Continuing, SNE Allows “many-to-one” mappings in which a single ambiguous object really belongs in several disparate locations in the low-dimensional space, while LLE makes one-to-one mapping. q j|i is induced probability that point p j|i is the asymmetric probability i picks point j as its neighbor that i would pick j as its neighbor 30 Gaussian Neighborhood in low- Gaussian Neighborhood in dim space original space

Continuing, t-SNE uses a Student-t distribution (heavier tail) rather than a Gaussian to compute the similarity between two points in the low-dimensional space symmetrized version of the SNE cost function with simpler gradients 31

[Chen and Liu, 2011] LLE - Follow up work LLE output strongly depends on selection of k Jing Chen and Yang Liu. Locally linear embedding: a survey. Artificial Intelligence Review (2011) 32

[Ting and Jordan, 2018]

Thank you for your attention! Lab of Machine Intelligence and kNowledge Engineering (MINE): http://mine.kaust.edu.sa/

- AP & LLE Xiangliang Zhang King Abdullah University of - PowerPoint PPT Presentation

- AP & LLE Xiangliang Zhang King Abdullah University of Science and Technology CNCC, Oct 25, 2018 Hangzhou, China Outline Affinity Propagation (AP) [Frey and Dueck,

Stude nt Suc c e ss Sc ore c a rd Stude nt Suc c e ss Sc ore c a rd I I rvine Va lle y Co lle

Wha t is Co lle g e Cre dit Plus (CCP)? Why CCP a t T ri-C? ACCE SS Be ne fits a nd Cha lle

VACUUM E XCE LLE NCE DE FINE D VACUUM E XCE LLE NCE DE FINE D Cutting

L a Sa lle Ba nk & T rust # 5200 WH F to PUD fo r a n Anima l Ho spita l* L a Sa lle

T he US Cybe r Cha lle ng e U S Cyb e r Cha lle ng e : De ve lo p ing the Ne xt Ge ne ra

De bug g ing L a rg e S c a le a nd Hybrid P a ra lle l C ode Ma rk O'C onnor m a rk@ a

WHS SE NI ORS AGE NDA Re vie w Gra dua tio n Re q uire me nts Disc uss Co lle g e / Ca

OVE RVI E W I NSURT E CH & T E L E MAT I CS Dr. Mic he lle O sb o rne , MBA, C

Hopstech Indu Hopstech Indu Industries Limited Industries Limited Assure Exce lle nce o

T eam- I nitiated P roblem S olving II (TIPS II) Cha nda T e lle e n, E duc a tio na l Co

JACK CKSONVILLE LLE 2 2017 AN OPPORTUNITY FOR HISTORIC PRESERVATION & REVITALIZATION

Yo uth in the 21 st Ce ntury: Ho w will the y me e t the c ha lle ng e s? Ma ure e n E . K e

Ric ha rd Ro se a nd F e lic ia T ripp Co lle g e Co a c he s Ma king Wa ve s F o unda tio

Skin Cancer DR M I CHE LLE WHI TE M BBS FRACG P AUS TR ALI A How Can You Reduce Your Risk? R

Ra ndo mize d Co ntro lle d T ria ls, De ve lo pme nt E c o no mic s a nd Po lic y Ma king in

6/16/2017 ME DICAT ION MANAGE ME NT T he c ha lle ng e of ma na g ing e ve r c ha ng ing

MACHINE LEARNING Spectral Clustering 1 ADVANCED MACHINE LEARNING Outline of Todays Lecture

Dr. Damien Fay. SRG group, Computer Lab, University of Cambridge. A graph metric: motivation.

Off- -The The- -Shelf Classifiers Shelf Classifiers Off A method that can be applied directly

Introduction to Dialectometry II Wilbert Heeringa German Academic Exchange Service DAAD

Statistics and learning Multivariate statistics 2 and clustering Emmanuel Rachelson and Matthieu

Recovering dialect geography from an unaligned comparable corpus Yves Scherrer LATL, Department

Partial order embedding with multiple kernels Brian McFee and Gert Lanckriet University of

Gomory Reloaded Matteo Fischetti, DEI, University of Padova (joint work with Domenico Salvagnin)

Sambuz

Useful Links

Newsletter

Mail Us

- AP & LLE Xiangliang Zhang King Abdullah University of - PowerPoint PPT Presentation

- AP & LLE Xiangliang Zhang King Abdullah University of Science and Technology CNCC, Oct 25, 2018 Hangzhou, China Outline Affinity Propagation (AP) [Frey and Dueck,

Stude nt Suc c e ss Sc ore c a rd Stude nt Suc c e ss Sc ore c a rd I I rvine Va lle y Co lle

Wha t is Co lle g e Cre dit Plus (CCP)? Why CCP a t T ri-C? ACCE SS Be ne fits a nd Cha lle

VACUUM E XCE LLE NCE DE FINE D VACUUM E XCE LLE NCE DE FINE D Cutting

L a Sa lle Ba nk &amp; T rust # 5200 WH F to PUD fo r a n Anima l Ho spita l* L a Sa lle

T he US Cybe r Cha lle ng e U S Cyb e r Cha lle ng e : De ve lo p ing the Ne xt Ge ne ra

De bug g ing L a rg e S c a le a nd Hybrid P a ra lle l C ode Ma rk O'C onnor m a rk@ a

WHS SE NI ORS AGE NDA Re vie w Gra dua tio n Re q uire me nts Disc uss Co lle g e / Ca

OVE RVI E W I NSURT E CH &amp; T E L E MAT I CS Dr. Mic he lle O sb o rne , MBA, C

Hopstech Indu Hopstech Indu Industries Limited Industries Limited Assure Exce lle nce o

T eam- I nitiated P roblem S olving II (TIPS II) Cha nda T e lle e n, E duc a tio na l Co

JACK CKSONVILLE LLE 2 2017 AN OPPORTUNITY FOR HISTORIC PRESERVATION &amp; REVITALIZATION

Yo uth in the 21 st Ce ntury: Ho w will the y me e t the c ha lle ng e s? Ma ure e n E . K e

Ric ha rd Ro se a nd F e lic ia T ripp Co lle g e Co a c he s Ma king Wa ve s F o unda tio

Skin Cancer DR M I CHE LLE WHI TE M BBS FRACG P AUS TR ALI A How Can You Reduce Your Risk? R

Ra ndo mize d Co ntro lle d T ria ls, De ve lo pme nt E c o no mic s a nd Po lic y Ma king in

6/16/2017 ME DICAT ION MANAGE ME NT T he c ha lle ng e of ma na g ing e ve r c ha ng ing

MACHINE LEARNING Spectral Clustering 1 ADVANCED MACHINE LEARNING Outline of Todays Lecture

Dr. Damien Fay. SRG group, Computer Lab, University of Cambridge. A graph metric: motivation.

Off- -The The- -Shelf Classifiers Shelf Classifiers Off A method that can be applied directly

Introduction to Dialectometry II Wilbert Heeringa German Academic Exchange Service DAAD

Statistics and learning Multivariate statistics 2 and clustering Emmanuel Rachelson and Matthieu

Recovering dialect geography from an unaligned comparable corpus Yves Scherrer LATL, Department

Partial order embedding with multiple kernels Brian McFee and Gert Lanckriet University of

Gomory Reloaded Matteo Fischetti, DEI, University of Padova (joint work with Domenico Salvagnin)

Sambuz

Useful Links

Newsletter

Mail Us

L a Sa lle Ba nk & T rust # 5200 WH F to PUD fo r a n Anima l Ho spita l* L a Sa lle

OVE RVI E W I NSURT E CH & T E L E MAT I CS Dr. Mic he lle O sb o rne , MBA, C

JACK CKSONVILLE LLE 2 2017 AN OPPORTUNITY FOR HISTORIC PRESERVATION & REVITALIZATION