CS 171: Visualization
High-Dimensional Data
Hanspeter Pfister pfister@seas.harvard.edu
CS 171: Visualization High-Dimensional Data Hanspeter Pfister - - PowerPoint PPT Presentation
CS 171: Visualization High-Dimensional Data Hanspeter Pfister pfister@seas.harvard.edu This Week PII due Monday, April 8 Friday lab (10-11:30 am, MD G115): Hands-On D3 with Azalea, Sofia, and Billy (our last lab!) Th: Alberto Cairo
Hanspeter Pfister pfister@seas.harvard.edu
with Azalea, Sofia, and Billy (our last lab!)
A Functional Art: Storytelling with Data, Graphs, Maps, and Diagrams
Item
Attribute
ggplot2
R
ggplot2
R, lattice
R, lattice
Tableau
Protovis
19
Becker 1996
D3
“Hyperdimensional Data Analysis Using Parallel Coordinates”, Wegman, 1990 Based on slide from Munzner
“Hyperdimensional Data Analysis Using Parallel Coordinates”, Wegman, 1990 Based on slide from Munzner
D3
D3
Velocity (magnitude & direction) Vorticity (scalar, CW/CCW) Strain Tensor (second order) Turbulent Charge (vector & scalar)
42
Based on slide from P . Liang
Project the high-dimensional data onto a lower- dimensional subspace using linear or non-linear transformations
Based on slide from P . Liang
Based on slide from F. Sha
h"p://www.youtube.com/watch?v=4pnQd6jnCWk
Project data to a subspace such as to maximize the variance of the projected data
Based on slide from J. Leskovec
PC vectors are orthogonal
x1 x2 x1 1 x2 1
x2 x1
x1 x2 x1 1 0.7 x2 0.7 1
x2 x1
x1 x2 x1 1
x2
1
x2 x1
x2 x1
x2 x1
x2 x1
PC ¡1 PC ¡2
Enough PC vectors to cover 80-90% of the variance
Based on slide from J. Leskovec
Screeplot
HasGe ¡et ¡al.,”The ¡Elements ¡of ¡StaGsGcal ¡Learning: ¡Data ¡Mining, ¡Inference, ¡and ¡PredicGon”, ¡Springer ¡(2009)
Gunnar ¡Grimnes: h"p:// www.flickr.com/ photos/gromgull/ 3329844591/in/ photostream/
h"p://www.youtube.com/watch?v=7jLXDyQxck
First two PC directions First three PC directions
>45 features, projected onto two PC dimensions
–Find ¡a ¡set ¡of ¡points ¡whose ¡pairwise ¡distances ¡match ¡a ¡ given ¡distance ¡matrix
p1 p2 p3 p4 p5 p1 1 2 3 1 p2 1 2 4 1 p3 2 2 1 3 p4 3 4 1 1 p5 1 1 3 1
p1 p4 p2 p3 p5
1 2 2 3 4 1 1 1
Based on slide from T. Yang
! !
– Distance ¡= ¡1 ¡for ¡friends – Distance ¡= ¡2 ¡for ¡friends ¡of ¡friends ¡; ¡etc.
PCA
Based on slide from F. Sha
the global structure
Based on slide from F. Sha
Young[38]