Introduction, Difficulties and Perspectives Tutorial on Manifold - PowerPoint PPT Presentation

Introduction, Difficulties and Perspectives Tutorial on Manifold Learning with Medical Images Diana Mateus CAMP (Computed Aided Medical Procedures) TUM (Technische Universität München) & Helmholtz Zentrum September 22, 2011 1

Outline Manifold Learning Three seminal algorithms Common Practical Problems Breaking the implicit assumptions Determining the parameters Mapping new points Large data-sets Conclusions 2

Manifold learning GOAL: dimensionality reduction • For data lying on (or close to) a manifold ... • find a new representation ... • that is low dimensional ... • allowing a more efficient processing of the data. 3

Three seminal algorithms • Isomap [ ⊲ Joshua Tenenbaum, Vin de Silva, John Langford. A Global Geometric Framework for Nonlinear Dimensionality Reduc- tion, Science , 2000.] • Locally Linear Embedding (LLE) [ ⊲ Sam Roweis, Lawrence Saul, Nonlinear Dimensionality Re- duction by Locally Linear Embedding, Science , 2000. ] • Laplacian Eigenmaps (LapEigs) [ ⊲ M. Belkin, P. Niyogi. Laplacian Eigenmaps for Dimen- sionality Reduction and Data Representation. Neural Compu- tation , June 2003; 15 (6):1373-1396.] 4

Three seminal algorithms The data is assumed to lie on or close to a Manifold 5

Three seminal algorithms Access is given only to a number of samples of the Manifold (data points) 6

Three seminal algorithms Build a neighborhood graph using the samples. The graph approximates the manifold 7

Three seminal algorithms To complete the graph by determining weights on the edges (between every pair of neighbor nodes). The graph can then be expressed in a matrix form :   0 w 1 , 2 w 1 , 3 0 0 0 0 0 0 0 0 0 0 0 w 1 , 2 0 w 2 , 3 w 2 , 4 w 2 , 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 w 1 , 3 w 2 , 3 w 3 , 5    0 w 2 , 4 0 0 w 4 , 5 0 w 4 , 7 w 4 , 8 0 0 0 0 0 0    0 w 2 , 5 w 3 , 5 w 4 , 5 0 w 5 , 6 w 5 , 7 0 0 0 0 0 0 0   0 0 0 0 0 0 0 0 0 0 0 0 w 5 , 6 w 6 , 7   0 0 0 w 4 , 7 w 5 , 7 w 6 , 7 0 w 7 , 8 w 7 , 9 0 0 0 0 0   W =  0 0 0 w 4 , 8 0 0 w 7 , 8 0 w 8 , 9 w 8 , 10 0 0 0 0    0 0 0 0 0 0 0 0 0 w 7 , 9 w 8 , 9 w 9 , 10 w 9 , 13 w 9 , 14   0 0 0 0 0 0 0 w 8 , 10 w 9 , 10 0 w 10 , 11 w 10 , 12 0 0    0 0 0 0 0 0 0 0 0 w 10 , 11 0 w 11 , 12 w 11 , 13 0   0 0 0 0 0 0 0 0 0 0 0  w 10 , 12 w 11 , 12 w 12 , 13   0 0 0 0 0 0 0 0 w 9 , 13 0 w 11 , 13 w 12 , 13 0 w 13 , 14 0 0 0 0 0 0 0 0 w 9 , 14 0 0 0 w 13 , 14 0 8

Three seminal algorithms Find Y ⋆ by optimizing some cost function J Y ⋆ = min Y J ( T ( W ) , Y ) 9

Three seminal algorithms Y ⋆ = min Y J ( T ( W ) , Y ) In spectral methods the optimization of J can be expressed in the form: v l ⊤ T ( W ) v l min ∀ l ∈ { 1 , . . . , d } (Rayleigh Quotient) v l ⊤ v l v l 10

Three seminal algorithms constraints v ⊤ l T ( W ) v l min Y + (orthonormality, centering of v ⊤ l v l (Rayleigh Quotient) Y , ...) Solved by a spectral decomposition of T ( W ) . 11

Algorithm: Locally Linear Embedding (LLE) [ ⊲ Sam Roweis & Lawrence Saul. Nonlinear Dimensionality Reduction by Locally Linear Embedding, Science , 2000. ] ⇓ Find the reconstructing weights � � w ij x j || 2 E ( W ) = || x i − x i x j ∈N ( x i ) � x i ≈ w ij x j x j ∈N ( x i ) 13

Algorithm: Locally Linear Embedding (LLE) High-dim space Find the new coordinates that preserve the R D reconstructing weights x i ≈ � x j ∈N ( x i ) w ij x j ⇓ � � w ij y j || 2 J ( W , Y ) = || y i − Preserve the weights y i y j ∈N ( y i ) Low-dim space Using the transformation: T ( W ) = ( I − W ) ⊤ ( I − W ) The solution is given by the d + 1 R d eigenvectors of T ( W ) corresponding to the smallest eigenvalues. � y i ≈ w ij y j y j ∈N ( y i ) 14

Algorithm: Laplacian Eigenmaps (LapEig) 1. Neighborhood graph with similarity values w ij (binary or Gaussian w ij = exp( − || x i − x j || 2 ) ) σ 2 2. Define a cost that preserves neighborhood relations: if x j ∈ N ( x i ) → y j ∈ N ( y i ) . � � w ij ( y i − y j ) 2 J ( W , Y ) = i j 3. Define D = diag( d 1 , . . . , d N ) , where d i = � j w ij and the Laplacian L = T ( W ) = D − W 4. The min of J ( Y ) is given by the eigenvectors (corresp. to the smallest eigenvalues) of L . 16

Three seminal algorithms • Isomap • Locally Linear Embedding (LLE) • Laplacian Eigenmaps (LapEigs) 17

Common Points and Differences • Non-linear: build a non-linear mapping R D �→ R d • Graph-based: Use a neighborhood graph to approx the manifold. • Impose a preservation criteria : Isomap: geodesic distances (the metric structure of the manifold). LLE: the local reconstruction weights. LapEig: the neighborhood relations. • Closed form solution obtained through the spectral decomposition of a p.s.d. matrix ( spectral methods ). • Global vs. Local : Isomap : global , eigendecomposition of a full matrix. LLE, LapEig : local , eigendecomposition of a sparse matrix. 18

Other popular algorithms • Kernel PCA [ ⊲ B. Schölkopf, A. Smola, and K.R. Müller. Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation . 10, 5 (July 1998), 1299-1319.] • LTSA (Local Tangent Space Alignment), [ ⊲ Z. Zhang and H. Zha. 2005. Principal Manifolds and Nonlinear Dimensionality Reduction via Tangent Space Alignment. SIAM J. Sci. Comput. 26, 1 (January 2005), 313-338.] • MVU (Maximum Variance Unfolding or Semidefinite Embedding) [ ⊲ Weinberger, K. Q. Saul, L. K. An Introduction to Nonlinear Dimensionality Reduction by Maxi- mum Variance Unfolding National Conf. on Arti- ficial Intelligence (AAAI). 2006.] 19

Other popular probabilistic algorithms • GTM (Generative Topographic Mapping), [ ⊲ C. M. Bishop, M. Svensén and C. K. I. Williams, GTM: The Generative Topographic Mapping, Neural Computation . 1998, 10:1, 215-234.] • GPLVM (Gaussian Process Latent Variable Models) [ ⊲ N. Lawrence, Probabilistic Non-linear Principal Component Analysis with Gaussian Process Latent Variable Models, Journal of Machine Learning Research 6(Nov):1783–1816, 2005.] • Diffusion maps [ ⊲ R.R. Coifman and S. Lafon. Diffusion maps. Applied and Computational Harmonic Analysis , 2006 ] • t-SNE (t-Distributed Stochastic Neighbor Embedding) [ ⊲ L.J.P. van der Maaten and G.E. Hinton. Visualizing High-Dimensional Data Using t-SNE. Journal of Machine Learning Research. 9(Nov):2579-2605, 2008.] 20

Code Laurens van der Maaten released a Matlab toolbox featuring 25 dimensionality reduction techniques http://ticc.uvt.nl/~lvdrmaaten/Laurens_van_der_Maaten/ Home.html 21

Breaking implicit assumptions • The manifold assumption • The sampling assumption 23

The manifold assumption The high-dimensional input data lies on or close to a lower-dimensional manifold. Manifold definition A manifold is a topological space that is locally Euclidean . [ ⊲ Wolfram MathWorld] • Does a low-dimensional manifold really exist? • Does the data form clusters in low dimensional subspaces? • How much noise can be tolerated? Problem In many cases we do not know in advance. 24

The sampling assumption It is possible to acquire enough data samples with uniform density from the manifold How many samples are really enough? 25

Breaking the assumptions • Manifold assumption: Clusters!!! • Sampling assumption: Low number of samples w.r.t. the complexity of the manifold ◦ Images of objects with deformations of high-order. → Disconnected components (good for clustering). → Unexpected Results. → Not better than PCA or linear methods. 26

Good data examples • Rythmic motions : motion capture walk, breathing/cardiac motions, musical pieces • Images/motions with a reduced number of deformation “modes” : MNIST dataset, population studies of a rigid organ. (walk.mp4) • Images with smooth changes in viewpoint/lightning. 27

Introduction, Difficulties and Perspectives Tutorial on Manifold - PowerPoint PPT Presentation

Introduction, Difficulties and Perspectives Tutorial on Manifold Learning with Medical Images Diana Mateus CAMP (Computed Aided Medical Procedures) TUM (Technische Universitt Mnchen) & Helmholtz Zentrum September 22, 2011 1 Outline

Difficulties with Mandarin Tones: Difficulties with Mandarin Tones: Learners Perspectives

Parents with Learning Difficulties The London Network of Parents with Learning Difficulties We

Pediatric Feeding Pediatric Feeding Difficulties Difficulties Erin Erin Reier Reier, OTD,

Adult Community SLT Referral and Management of residents with swallowing difficulties What do we

For assistance with technical difficulties, please email sarah.mak@sunnybrook.ca For assistance

THE MODEL: PERSPECTIVES AND CHALLENGES PERSPECTIVES AND CHALLENGES NOU 2012:2 Outside and Inside

Introductory Session FACTS AND PERSPECTIVES IN PSYCHOLOGY Overview Perspectives in psychology

Compartment syndrome Diagnostic difficulties & future developments Henrik Grnborg,

Dealing with difficulties in maths by using authentic materials National & Kapodistrian

An Analysis of f Transition-to to-Proof Course Students Proving Difficulties Ahmed Benkhalti,

Social Security and Social Security and Employment: Employment: Perspectives on the Linkages

PROJECT CONCEPT 2 Project Introduction 13 Mar 2017 Project Location and Access 4 Project

Recent results and and perspectives perspectives on on Recent results cosmic ray matter and

NHEC Perspectives on Energy NHEC Perspectives on Energy Efficiency and Sustainable Energy

AGATA: Status AGATA: Status and Perspectives and Perspectives E.Farnea INFN Sezione di Padova,

THE AFRICAN UNION AND THE AFRICAN UNION AND ITS PERSPECTIVES ON ITS PERSPECTIVES ON BIOSAFETY

Symblicit algorithms for optimal strategy synthesis in monotonic Markov decision processes Aaron

The Puget Sound Regional Economy 2017 Seattle Rotary 4 January 4, 2017 Chris Mefford President

Mathematical Models as Research Data why do need precise and well-written information about

The roots of computability theory September 5, 2016 Algorithms An algorithm for a task or

Computer Algebra or Computer Mathematics? James H. Davenport a July 11, 2012 Department of

W ELCOME T O CMPT 110 1 Chapter 1 C OURSE I NFO Instructor: Richard Frank rfrank@sfu.ca

Chapter 3 Programming with Recursion (Version of 16 November 2005) 1. Examples . . . . . . . .

Multi Robot Physical Interaction Dongun Lee@RSS MultirobotSystems WS 7/16/2015 1

Introduction, Difficulties and Perspectives Tutorial on Manifold - PowerPoint PPT Presentation

Introduction, Difficulties and Perspectives Tutorial on Manifold Learning with Medical Images Diana Mateus CAMP (Computed Aided Medical Procedures) TUM (Technische Universitt Mnchen) & Helmholtz Zentrum September 22, 2011 1 Outline

Difficulties with Mandarin Tones: Difficulties with Mandarin Tones: Learners Perspectives

Parents with Learning Difficulties The London Network of Parents with Learning Difficulties We

Pediatric Feeding Pediatric Feeding Difficulties Difficulties Erin Erin Reier Reier, OTD,

Adult Community SLT Referral and Management of residents with swallowing difficulties What do we

For assistance with technical difficulties, please email sarah.mak@sunnybrook.ca For assistance

THE MODEL: PERSPECTIVES AND CHALLENGES PERSPECTIVES AND CHALLENGES NOU 2012:2 Outside and Inside

Introductory Session FACTS AND PERSPECTIVES IN PSYCHOLOGY Overview Perspectives in psychology

Compartment syndrome Diagnostic difficulties &amp; future developments Henrik Grnborg,

Dealing with difficulties in maths by using authentic materials National &amp; Kapodistrian

An Analysis of f Transition-to to-Proof Course Students Proving Difficulties Ahmed Benkhalti,

Social Security and Social Security and Employment: Employment: Perspectives on the Linkages

PROJECT CONCEPT 2 Project Introduction 13 Mar 2017 Project Location and Access 4 Project

Recent results and and perspectives perspectives on on Recent results cosmic ray matter and

NHEC Perspectives on Energy NHEC Perspectives on Energy Efficiency and Sustainable Energy

AGATA: Status AGATA: Status and Perspectives and Perspectives E.Farnea INFN Sezione di Padova,

THE AFRICAN UNION AND THE AFRICAN UNION AND ITS PERSPECTIVES ON ITS PERSPECTIVES ON BIOSAFETY

Symblicit algorithms for optimal strategy synthesis in monotonic Markov decision processes Aaron

The Puget Sound Regional Economy 2017 Seattle Rotary 4 January 4, 2017 Chris Mefford President

Mathematical Models as Research Data why do need precise and well-written information about

The roots of computability theory September 5, 2016 Algorithms An algorithm for a task or

Computer Algebra or Computer Mathematics? James H. Davenport a July 11, 2012 Department of

W ELCOME T O CMPT 110 1 Chapter 1 C OURSE I NFO Instructor: Richard Frank rfrank@sfu.ca

Chapter 3 Programming with Recursion (Version of 16 November 2005) 1. Examples . . . . . . . .

Multi Robot Physical Interaction Dongun Lee@RSS MultirobotSystems WS 7/16/2015 1

Compartment syndrome Diagnostic difficulties & future developments Henrik Grnborg,

Dealing with difficulties in maths by using authentic materials National & Kapodistrian