Nonlinear Dimensionality Reduction Donovan Parks Overview Direct - - PowerPoint PPT Presentation

nonlinear dimensionality reduction
SMART_READER_LITE
LIVE PREVIEW

Nonlinear Dimensionality Reduction Donovan Parks Overview Direct - - PowerPoint PPT Presentation

Nonlinear Dimensionality Reduction Donovan Parks Overview Direct visualization vs. dimensionality reduction Nonlinear dimensionality reduction techniques: ISOMAP, LLE, Charting A fun example that uses non- metric, replicated


slide-1
SLIDE 1

Nonlinear Dimensionality Reduction

Donovan Parks

slide-2
SLIDE 2

Overview

 Direct visualization vs.

dimensionality reduction

 Nonlinear dimensionality reduction

techniques:

 ISOMAP, LLE, Charting

 A fun example that uses non-

metric, replicated MDS

slide-3
SLIDE 3

Direct visualization

 Visualize all dimensions

Sources: Chuah (1998), Wegman (1990)

slide-4
SLIDE 4

Dimensionality reduction

 Visualize the intrinsic low-dimensional structure

within a high-dimensional data space

 Ideally 2 or 3 dimensions so data can be

displayed with a single scatterplot

Dimensionality Reduction

slide-5
SLIDE 5

When to use:

 Direct visualization:

 Interested in relationships between

attributes (dimensions) of the data

 Dimensionality reduction:

 Interested in geometric relationships

between data points

slide-6
SLIDE 6

Nonlinear dimensionality reduction

 Isometric mapping (ISOMAP)

 Mapping a Manifold of Perceptual Observations.

Joshua B. Tenenbaum. Neural Information Processing Systems, 1998.

 Locally Linear Embedding (LLE)

 Think Globally, Fit Locally: Unsupervised

Learning of Nonlinear Manifolds. Lawrence K. Saul & Sam T. Roweis. University of Pennsylvania Technical Report MS-CIS-02-18, 2002.

 Charting

 Charting a Manifold. Matthew Brand, NIPS

2003.

slide-7
SLIDE 7

Why do we need nonlinear dimensionality reduction?

X Y

Linear DR (PCA, Classic MDS, ...) Nonlinear DR (Metric MDS , ISOMAP, LLE, ...)

slide-8
SLIDE 8

ISOMAP

 Extension of multidimensional

scaling (MDS)

 Considers geodesic instead of

Euclidean distances

slide-9
SLIDE 9

Geodesic vs. Euclidean distance

Source: Tenenbaum, 1998

slide-10
SLIDE 10

Calculating geodesic distances

 Q: How do we calculate geodesic

distance?

slide-11
SLIDE 11

ISOMAP Algorithm

  Construct neighborhood graph  Compute geodesic distance matrix  Apply favorite MDS algorithm

Geodesic Distance Matrix 1 2 3 Observations in High -D space Neighborhood Graph ISOMAP Embedding

Modified from: Tenenbaum, 1998

slide-12
SLIDE 12

Example: ISOMAP vs. MDS

slide-13
SLIDE 13

Example: Punctured sphere

 ISOMAP generally fails for manifolds

with holes

slide-14
SLIDE 14

+/-’s of ISOMAP

 Advantages:

 Easy to understand and implement

extension of MDS

 Preserves “true” relationship between

data points

 Disadvantages:

 Computationally expensive  Known to have difficulties with “holes”

slide-15
SLIDE 15

Locally Linear Embedding (LLE)

 Forget about global constraints, just

fit locally

 Why? Removes the need to

estimate distances between widely separated points

 ISOMAP approximates such distances

with an expensive shortest path search

slide-16
SLIDE 16

Are local constraints sufficient? A Geometric Interpretation

 Maintains approximate global structure

since local patches overlap

slide-17
SLIDE 17

Are local constraints sufficient? A Geometric Interpretation

 Maintains approximate global structure

since local patches overlap

slide-18
SLIDE 18

LLE Algorithm

2

( )

i ij j i j

W X W X

  • =
  • 2

( )

i ij j i j

Y Y W Y

  • =
  • Source: Saul, 2002
slide-19
SLIDE 19

Example: Synthetic manifolds

Modified from: Saul, 2002

slide-20
SLIDE 20

Example: Real face images

Source: Roweis, 2000

slide-21
SLIDE 21

+/-’s of LLE

 Advantages:

 More accurate in preserving local

structure than ISOMAP

 Less computationally expensive than

ISOMAP

 Disadvantages:

 Less accurate in preserving global

structure than ISOMAP

 Known to have difficulty on non-convex

manifolds (not true of ISOMAP)

slide-22
SLIDE 22

Charting

 Similar to LLE in that it considers

  • verlapping “locally linear patches”

(called charts in this paper)

 Based on a statistical framework

instead of geometric arguments

slide-23
SLIDE 23

Charting the data

 Place Gaussian at each point and estimate

covariance over local neighborhood

 Brand derives method for

determining optimal covariances in the MAP sense

 Enforces certain constraints to ensure

nearby Gaussians (charts) have similar covariance matrices

slide-24
SLIDE 24

Find local coordinate systems

 Use PCA in each chart to determine local

coordinate system

Local Coordinate Systems

slide-25
SLIDE 25

Connecting the charts

 Exploit overlap of each

neighborhood to determine how to connect the charts

 Brand suggest a

weighted least squares problem to minimize error in the projection of common points

Embedded Charts

slide-26
SLIDE 26

Example: Noisy synthetic data

Source: Brand, 2003

slide-27
SLIDE 27

+/-’s of Charting

 Advantage:

 More robust to noise than LLE or

ISOMAP

 Disadvantage:

 More testing needed to demonstrate

robustness to noise

 Unclear computational complexity

 Final step is quadratic in the number of

charts

slide-28
SLIDE 28

Conclusion: +/-’s of dimensionality reduction

 Advantages:

 Excellent visualization of relationship

between data points

 Limitations:

 Computationally expensive  Need many observations  Do not work on all manifolds

slide-29
SLIDE 29

Action Synopsis: A fun example

 Action Synopsis: Pose Selection and Illustration.

Jackie Assa, Yaron Caspi, Daniel Cohen-Or. ACM Transactions on Graphics, 2005.

Source: Assa, 2005

slide-30
SLIDE 30

Aspects of motion

 Input: pose of person at each frame  Aspects of motion:

 Joint position  Joint angle  Joint velocity  Joint angular velocity

Source: Assa, 2005

slide-31
SLIDE 31

Dimensionality reduction

 Problem: How can these aspects of motion

be combined?

 Solution: non-metric, replicated MDS

 distance matrix for each aspect of motion  best preserves rank order of distances across

several distance matrices

 Essentially NM-RMDS implicitly weights

each distance matrix

Source: Assa, 2005

slide-32
SLIDE 32

Pose selection

 Problem: how do you select

interesting poses from the “motion curve”?

 Typically 5-9 dimensions

 Assa et al. argue that

interesting poses occur at “locally extreme points”

Source: Assa, 2005

slide-33
SLIDE 33

Finding locally extreme points

Source: Assa, 2005

slide-34
SLIDE 34

Do you need dimensionality reduction?

Source: Assa, 2005

slide-35
SLIDE 35

Example: Monkey bars

Source: Assa, 2005

slide-36
SLIDE 36

Example: Potential application

Source: Assa, 2005

slide-37
SLIDE 37

Critique of Action Synopsis

Pros:

+ Results are convincing + Justified algorithm with user study

Cons:

  • Little justification for selected aspects of

motion

  • Requiring pose information as input is

restrictive

  • Unclear that having RMDS implicitly

weight aspects of motion is a good idea

slide-38
SLIDE 38

Literature

Papers covered:

Mapping a Manifold of Perceptual Observations. Joshua B. Tenenbaum. Neural Information Processing Systems, 1998.

Think Globally, Fit Locally: Unsupervised Learning of Nonlinear

  • Manifolds. Lawrence Saul & Sam Roweis. University of Pennsylvania

Technical Report MS-CIS-02-18, 2002.

Charting a Manifold. Matthew Brand, NIPS 2003.

Action Synopsis: Pose Selection and Illustration. Jackie Assa, Yaron Caspi, Daniel Cohen-Or. ACM Transactions on Graphics, 2005.

Additional reading:

Multidimensional scaling. Forrest W. Young. Forrest.psych.unc.edu/teaching/p208a/mds/mds.html

A Global Geometric Framework for Nonlinear Dimensionality

  • Reduction. Joshua B. Tenenbaum, Vin de Silva, John C. Langford,

Science, v. 290 no.5500, 2000.

Nonlinear dimensionality reduction by locally linear embedding. Sam Roweis & Lawrence Saul. Science v.290 no.5500, 2000.

Further citations:

Information Rich Glyphs for Software Management. M.C. Chuah and S.G. Eick, IEEE CG&A 18:4 1998.

Hyperdimensional Data Analysis Using Parallel Coordinates. Edward J.

  • Wegman. Journal of the American Statistical Association, Vol. 85, No.
  • 411. (Sep., 1990), pp. 664-675.