Information Visualization 10-1 Ronald Peikert SciVis 2007 - - - PowerPoint PPT Presentation

information visualization
SMART_READER_LITE
LIVE PREVIEW

Information Visualization 10-1 Ronald Peikert SciVis 2007 - - - PowerPoint PPT Presentation

Information Visualization 10-1 Ronald Peikert SciVis 2007 - Information Visualization Overview Techniques for high-dimensional data scatter plots, PCA parallel coordinates link + brush pixel-oriented techniques


slide-1
SLIDE 1

Information Visualization

Ronald Peikert SciVis 2007 - Information Visualization 10-1

slide-2
SLIDE 2

Overview

Techniques for high-dimensional data

  • scatter plots, PCA
  • parallel coordinates
  • link + brush
  • pixel-oriented techniques
  • icon-based techniques

Techniques for hierarchical data and networks

  • trees: tree maps
  • graph clustering
  • distortion, focus+context

Ronald Peikert SciVis 2007 - Information Visualization 10-2

slide-3
SLIDE 3

High-dimensional data

"Dimension" refers often to data channels (attributes), not to true spatial dimension (coordinates) spatial dimension (coordinates). Roles of data and coordinates can be swapped: In scatter plots (multi-dimensional histograms) data become coordinates and vice versa Often no spatial coordinates exist, e.g. in visualization of (relational) data bases.

Ronald Peikert SciVis 2007 - Information Visualization 10-3

slide-4
SLIDE 4

Scatter plots

(2D) scatter plots are projections to 2D subspaces spanned by pairs of coordinate axes. n-dimensional data lead to a nxn matrix of scatter plots. n dimensional data lead to a nxn matrix of scatter plots. For small n, the matrix can directly serve as a visualization. Example (n=4):

Ronald Peikert SciVis 2007 - Information Visualization 10-4

Image credit: M. Ward

slide-5
SLIDE 5

Dimension reduction

Often the n attributes are not independent and the scatter plot lies almost in a k-dimensional linear subspace. Principal component analysis (PCA):

  • For each pair (X,Y) of attributes compute the covariance

( )( )

1

cov( , ) 1

n i i i

X X Y Y X Y n

=

− − = ∑ resulting in the (symmetric, positive semidefinite) covariance t i 1 n − matrix cov( , ) cov( , ) cov( , ) cov( , ) cov( , ) cov( , ) X X X Y X Z C Y X Y Y Y Z ⎛ ⎞ ⎜ ⎟ = ⎜ ⎟ ⎜ ⎟

Ronald Peikert SciVis 2007 - Information Visualization 10-5

cov( , ) cov( , ) cov( , ) Z X Z Y Z Z ⎜ ⎟ ⎜ ⎟ ⎝ ⎠

slide-6
SLIDE 6

Dimension reduction

  • Compute the eigenvalues (nonnegative) and eigenvectors

(orthogonal)

  • Sort eigenvectors by descending eigenvalues
  • Sort eigenvectors by descending eigenvalues
  • Project data to subspace defined by the mean and

the first k eigenvectors (directions of largest data variation)

( , , , ) X Y Z

Y Y

Ronald Peikert SciVis 2007 - Information Visualization 10-6

X X

slide-7
SLIDE 7

Parallel coordinates

Visualization method of parallel coordinates (Inselberg1985):

  • n parallel and equidistant axes (one per attribute)
  • axes scaled to [min max] range of corresponding attribute
  • axes scaled to [min, max] range of corresponding attribute
  • every data item is represented by a polyline which intersects

each of the axes at the point corresponding to its attribute value

Attr.1 Attr.2 Attr.3 Attr.n

Ronald Peikert SciVis 2007 - Information Visualization 10-7

Attr.1 Attr.2 Attr.3 Attr.n

slide-8
SLIDE 8

Parallel coordinates

What can be done with parallel coordinates? line in 10-space 5-dim sphere

  • Linear or spherical arrangement can be "seen" (according to

p g ( g Inselberg)

  • Algorithm for testing if a point is in the convex hull of a set of

points: check if the polyline is within the two envelopes of the set

Ronald Peikert SciVis 2007 - Information Visualization 10-8

points: check if the polyline is within the two envelopes of the set

  • f polylines
slide-9
SLIDE 9

Parallel coordinates

Queries in parallel coordinates:

  • Brushing technique (example: "ParallAX" tool by A. Inselberg)

Ronald Peikert SciVis 2007 - Information Visualization 10-9

slide-10
SLIDE 10

Parallel coordinates

  • Linked views (link & brush technique)

Ronald Peikert SciVis 2007 - Information Visualization 10-10

slide-11
SLIDE 11

Pixel-oriented techniques

Space-filling curves for query-independent visualization of database Idea: represent each record by a single pixel

  • map one attribute to color map sorting key to space-filling curve
  • map one attribute to color, map sorting key to space-filling curve

Peano-Hilbert Morton (Z-Curve)

Ronald Peikert SciVis 2007 - Information Visualization 10-11

slide-12
SLIDE 12

Pixel-oriented techniques

Spiral technique (Keim) for query dependent visualization.

  • sort records (near a query point) by distance to query
  • map sorted list to spiral
  • map sorted list to spiral

Spiral technique: Example: result of a complex query

C l di O ll di t d

Ronald Peikert SciVis 2007 - Information Visualization 10-12

Color coding: Overall distance and distance of attributes 1..5

slide-13
SLIDE 13

Pixel-oriented techniques

Axes technique (Keim) for query dependent visualization.

  • for two selected attributes, separate space into lower/higher

attribute values attribute values

  • draw spirals per quadrant

r2

  • s. Δattr
  • neg. p
  • neg. pos. Δattr1

C l di O ll

Ronald Peikert SciVis 2007 - Information Visualization 10-13

Color coding: Overall distance and distance of attributes 1..8

slide-14
SLIDE 14

Icon-based techniques

Chernoff Faces:

  • two attributes are mapped to the display axes
  • remaining attributes are mapped to shape and size of hair
  • remaining attributes are mapped to shape and size of hair,

eyebrows, eyes, nose, mouth, etc. Idea: Use the human ability to recognize and memorize faces. Example:

Ronald Peikert SciVis 2007 - Information Visualization 10-14

slide-15
SLIDE 15

Icon-based techniques

Stick figures (Grinstein)

  • two attributes are mapped to the display axes
  • remaining attributes are mapped to lengths of limbs or angles
  • remaining attributes are mapped to lengths of limbs or angles

between them Idea: Texture pattern in visualization shows certain characteristics. Example: Stick figure icon a family of stick figures Stick figure icon a family of stick figures

Ronald Peikert SciVis 2007 - Information Visualization 10-15

Image credit: G. Grinstein

slide-16
SLIDE 16

Icon-based techniques

Example: census data (age, income, sex, education, etc.) It can be observed that the structure is more homogenous for higher incomes than for lower ones

Ronald Peikert SciVis 2007 - Information Visualization 10-16

higher incomes than for lower ones.

slide-17
SLIDE 17

Hierarchical and network data

Mathematical description of hierarchies and networks: graphs. Some important special types of graphs: Some important special types of graphs:

  • undirected graphs
  • directed graphs
  • directed acyclic graphs (DAGs)
  • rooted trees
  • unrooted trees
  • unrooted trees

(In an unrooted tree, every node can be chosen as the root.)

  • forests, etc.

Ronald Peikert SciVis 2007 - Information Visualization 10-17

slide-18
SLIDE 18

Cone trees

Cone trees (Robertson) are 3D embeddings of trees.

  • Children arranged on circular cones
  • Navigation by interactive rotation at all hierarchy levels
  • Navigation by interactive rotation at all hierarchy levels

Useful for trees with high branching (no binary trees!) Example: file system visualization

Ronald Peikert SciVis 2007 - Information Visualization 10-18

Image credit: S. Card

slide-19
SLIDE 19

Tree maps

Trees with weight attribute at nodes can be visualized using tree maps (Johnson and Shneiderman). Tree maps are special Venn diagrams where Tree maps are special Venn diagrams where

  • subtrees are represented by rectangles
  • rectangle area is proportional to total weight of the subtree
  • split direction is vertical/horizontal for odd/even hierarchy level
  • nodes can have colors, labels, tool-tip info, etc.

Standard Venn diagram Tree map

Ronald Peikert SciVis 2007 - Information Visualization 10-19

Image credit: B. Shneiderman

slide-20
SLIDE 20

Tree maps

Example: file system with 1000 files

Image credit: B Shneiderman

Application: Combining tree maps and node-link diagrams (Zhao)

Image credit: B. Shneiderman

Ronald Peikert SciVis 2007 - Information Visualization 10-20

slide-21
SLIDE 21

P bl f t

Tree maps

Problem of tree maps: In large trees, hierarchical levels can be hard to see Examples "file system" and "organization"

Ronald Peikert SciVis 2007 - Information Visualization 10-21

slide-22
SLIDE 22

Tree maps

Solution: Cushion tree maps (van Wijk, van de Wetering 99) Idea: give rectangles a height profile, with height depending on the hierarchy level Example (1D): height profile for a binary tree. p ( ) g p y

Height profile = Sum of bump functions Bump functions for 1st, 2nd and 3rd level

Ronald Peikert SciVis 2007 - Information Visualization 10-22

Shaded height profile

slide-23
SLIDE 23

Tree maps

[ ]

Height function for an interval at the ith leveI

( ) ( )( )

1 2

4

i

h z x f x x x x Δ = − −

[ ]

1 2

, x x

It defines a bump with a peak height of

( ) ( )( )

1 2 2 1

z x f x x x x x x Δ −

(with user defined parameters h and f)

( )

2 1 i

f h x x −

(with user-defined parameters h and f) Modification for 2D: Alternate between vertical and horizontal "ridges", i.e. for even numbers i use the function:

( ) ( )( )

1 2

4 ,

i

h z x y f y y y y Δ = − −

Ronald Peikert SciVis 2007 - Information Visualization 10-23

( ) ( )( )

1 2 2 1

y y y y y y y −

slide-24
SLIDE 24

St d d hi t i " i ti " l

Tree maps

Standard vs. cushion tree maps in "organization" example

h = 0.5, f = 1 h = 0.5, f = 0.75 h = 0.5, f = 0.5

Ronald Peikert SciVis 2007 - Information Visualization 10-24

slide-25
SLIDE 25

St d d hi t i "fil t " l

Tree maps

Standard vs. cushion tree maps in "file system" example

h 0 5 f 1 h = 0.5, f = 1

Ronald Peikert SciVis 2007 - Information Visualization 10-25

slide-26
SLIDE 26

Tree maps

2nd problem of tree maps: bad aspect ratios Example: 7 children with weights 6 6 4 3 2 2 1: Example: 7 children with weights 6,6,4,3,2,2,1: Worst aspect ratios: 16:1 and 36:1 Worst aspect ratios: 16:1 and 36:1 Solution: Squarified tree maps (Bruls et al.) Idea: allow both vertical and horizontal splits within the same level

  • f the tree

Ronald Peikert SciVis 2007 - Information Visualization 10-26

  • f the tree.
slide-27
SLIDE 27

Tree maps

"Squarification" algorithm: Sort children by descending weight weight While list of children non empty

  • Insert the first child, splitting

th l d the larger edge

  • Repeat

– "Squeeze" the next child q into the same "row" (along the shorter edge) If aspect ratio is worse – If aspect ratio is worse than that of previous step, undo the step (steps 3 6 8 in example) and

Ronald Peikert SciVis 2007 - Information Visualization 10-27

3,6,8 in example) and break out of inner loop

Worst accepted aspect ratio: 25:9

slide-28
SLIDE 28

Tree maps

Squarified tree maps need visual cues for hierarchy levels. Squarified tree maps of "File system" and "organization": … and with cushions added:

Ronald Peikert SciVis 2007 - Information Visualization 10-28

slide-29
SLIDE 29

Tree maps

Alternative hierarchy enhancement techniques:

  • Nesting
  • Nesting
  • Frames

with profiles: applied to examples:

Ronald Peikert SciVis 2007 - Information Visualization 10-29

slide-30
SLIDE 30

Tree maps

Application: SequoiaView tool for file system visualization

http://www.win.tue.nl/sequoiaview/

Ronald Peikert SciVis 2007 - Information Visualization 10-30

(http://www.win.tue.nl/sequoiaview)

slide-31
SLIDE 31

Clustering techniques

Motivation for clustering in visualization of graphs (networks): Multiple levels of detail are obtained by identifying "highly Multiple levels-of-detail are obtained by identifying highly connected" subsets and representing them by glyphs Clustering techniques are often based on force models. Assume an undirected graph G=(V E) with set of nodes V and set Assume an undirected graph G=(V,E) with set of nodes V and set

  • f edges E.

Notation: edge connecting nodes i and j

ij

e

position of node i

j i

p

ij i j

p = − p p

Ronald Peikert SciVis 2007 - Information Visualization 10-31

slide-32
SLIDE 32

The attractive force is usually Hooke's spring law

Clustering techniques

y p g where is the zero energy length of the spring.

x

( ) ( )

f x A x x = ⋅ −

The repulsive force generally follows an inverse square law inspired by electrostatic fields: by electrostatic fields:

( )

2

B g x x =

The total potential energy is then:

x

( )

2

1 2

ij

ij e E i j ij

A P p x B p

∈ ≠

= − −

∑ ∑

Ronald Peikert SciVis 2007 - Information Visualization 10-32

slide-33
SLIDE 33

Difficult to visualize: small world graphs (Watts and Strogatz)

Clustering techniques

Difficult to visualize: small world graphs (Watts and Strogatz). Small world graphs are connected graphs having

  • a small average path length (between pair of nodes), and
  • a high clustering index,

both compared to a random graph with the same number of nodes and edges. The clustering index of a node v is the ratio between

  • number of existing edges in the 1-neighborhood N(v) of v
  • number of possible edges, which is k(k-1)/2 if

The clustering index of the graph is the average of the clustering

( )

k N v =

Ronald Peikert SciVis 2007 - Information Visualization 10-33

The clustering index of the graph is the average of the clustering indices of its nodes.

slide-34
SLIDE 34

Energy models suited for small-world problems:

Clustering techniques

gy p r-PolyLog energy models (Noack): Potential energy:

( ) ( )

ln

ij

r ij ij e E i j

P p x p

∈ ≠

= − −

∑ ∑

Attractive and repulsive forces are obtained by taking derivative. For 1-PolyLog:

( )

1 f x =

( )

1 g x =

Minimum energy configuration of 1-PolyLog has the property: Distance between two clusters C1 and C2 is inversely

( )

1 f x =

( )

g x x =

Distance between two clusters C1 and C2 is inversely proportional to their coupling:

{ }

1 2

: ,

ij

e i C j C ∈ ∈

Ronald Peikert SciVis 2007 - Information Visualization 10-34

{ }

1 2 1 2

,

ij

j C C

slide-35
SLIDE 35

Clustering techniques

Example small world graph:

  • 500 painters/sculptors
  • 2486 connections
  • average path length 4
  • clustering index 0.18

(random graph: 0 0093) (random graph: 0.0093)

Video credit: F van Ham TU Eindhoven Ronald Peikert SciVis 2007 - Information Visualization 10-35 Video credit: F. van Ham, TU Eindhoven

slide-36
SLIDE 36

Clustering techniques

Example: citations between Vis papers Color coding:

  • blue: volume vis
  • red: flow vis
  • green: terrain surfaces
  • green: terrain, surfaces
  • yellow: info vis

Ronald Peikert SciVis 2007 - Information Visualization 10-36

slide-37
SLIDE 37

Distortion techniques

Perspective wall (Robertson) Example: documents arranged on a perspective wall Example: documents arranged on a perspective wall

Ronald Peikert SciVis 2007 - Information Visualization 10-37

Image credit: S. Card

slide-38
SLIDE 38

Distortion techniques

Table lens (Rao and Card)

Inxight software Image credit: R. Rao

Ronald Peikert SciVis 2007 - Information Visualization 10-38

slide-39
SLIDE 39

Distortion techniques

Hyperbolic trees are based on the Poincaré Disk model (projection)

  • f the hyperbolic space H2.

In the Poincaré Disk, the role of straight lines is taken by

  • circles which intersect the bounding circle

circles which intersect the bounding circle

  • rthogonally, and
  • diameters of the bounding circle.

2 2

1 x y + =

M.C. Escher's "Circle Limit III", 1958, illustrates lines (white circles).

Ronald Peikert SciVis 2007 - Information Visualization 10-39

slide-40
SLIDE 40

Distortion techniques

Property of Poincaré Disk:

  • Triangles have sum of angles < 180°

g g

  • It has the metric

2 2 2 2

1 dx dy ds + =

  • The bounding circle is at infinity

Circle perimeter grows exponentially with its radius

2 2

1 x y − −

  • Circle perimeter grows exponentially with its radius.

As a consequence, trees can be drawn undistorted in hyperbolic space:

  • all edges having about the same length and

Ronald Peikert SciVis 2007 - Information Visualization 10-40

  • all nodes having the same angle available for their children
slide-41
SLIDE 41

Distortion techniques

Rigid transformations of Poincaré Disk: Möbius transformations of complex numbers: These are

( )

, 1 , 1 1

θ

θ θ θ + ′ = = = < +

c

z c z T z c c z

  • for c = 0: rotations around 0
  • for θ = 1: translations (mapping 0 to c and c to 0)
  • for θ = 1: translations (mapping 0 to c and -c to 0)
  • combinations:

( )

( )

( )

θ θ θ

=

c c c

T T z T z

with

( )

( )

( )

2 2 1 1

θ θ θ c c c 2 1 2 1 2 1 1 2

, 1 1 θ θ θ θ θ θ θ + + = = c c c c c

Ronald Peikert SciVis 2007 - Information Visualization 10-41

2 1 2 2 1 2

, 1 1 θ θ + + c c c c

slide-42
SLIDE 42

Distortion techniques

Hyperbolic tree technique (Lamping et al.). Change of focus, i.e. moving a different node towards the center, is achieved by performing a translation in hyperbolic space is achieved by performing a translation in hyperbolic space.

Example: Visualization of a large organizational hierarchy in hyperbolic space with different foci

Ronald Peikert SciVis 2007 - Information Visualization 10-42

Image credit: R. Rao