CS-5630 / CS-6630 Visualization Tables Alexander Lex - - PowerPoint PPT Presentation

cs 5630 cs 6630 visualization tables
SMART_READER_LITE
LIVE PREVIEW

CS-5630 / CS-6630 Visualization Tables Alexander Lex - - PowerPoint PPT Presentation

CS-5630 / CS-6630 Visualization Tables Alexander Lex alex@sci.utah.edu [xkcd] dataset types spatial channels are the most effective for all attribute types recall: attribute semantics when we arrange tabular data, attributes are chosen to


slide-1
SLIDE 1

CS-5630 / CS-6630 Visualization Tables

Alexander Lex alex@sci.utah.edu

[xkcd]

slide-2
SLIDE 2

dataset types

slide-3
SLIDE 3
slide-4
SLIDE 4

spatial channels are the most effective for all attribute types

slide-5
SLIDE 5

recall: attribute semantics

when we arrange tabular data, attributes are chosen to be keys and values

multidimensional

slide-6
SLIDE 6

Scale of Tables

Need different approaches for “normal” and “high- dimensional” tables.

Homogeneity

Same data type? Same scales?

Age Gender Height Bob 25 M 181 Alice 22 F 185 Chris 19 M 175 BPM 1 BPM 2 BPM 3 Bob 65 120 145 Alice 80 135 185 Chris 45 115 135

How many dimensions?

~50 – tractable with “just” vis ~1000 – need analytical methods

How many records?

~ 1000 – “just” vis is fine >> 10,000 – need analytical methods

slide-7
SLIDE 7

Analytic Component

no / little analytics strong analytics 
 component

Scatterplot Matrices


[Bostock]

Parallel Coordinates


[Bostock]

Pixel-based visualizations /
 heat maps Multidimensional Scaling

[Doerk 2011] [Chuang 2012]

slide-8
SLIDE 8

Express Values

No Keys

slide-9
SLIDE 9

encode using zero keys: scatterplots

slide-10
SLIDE 10

Encode one Key Attribute

slide-11
SLIDE 11

encode one key attribute:
 bar, dot, & line charts

slide-12
SLIDE 12

Encode Multiple Key Attributes

slide-13
SLIDE 13
slide-14
SLIDE 14

Stacked Bar Chart

slide-15
SLIDE 15

Comparison of bar chart types

Small 
 Multiples Stacked bar chart Pie Chart Layered
 Bar
 Chart Grouped
 Bar 
 Chart

Streit & Gehlenborg, PoV, Nature Methods, 2014

slide-16
SLIDE 16

Stacked Area Chart

http://stackoverflow.com/questions/2225995/how-can-i-create-stacked-line-graph-with-matplotlib

slide-17
SLIDE 17

100% Stacked Area Chart

http://stackoverflow.com/questions/16875546/create-a-100-stacked-area-chart-with-matplotlib

slide-18
SLIDE 18

Stacked Area vs. Line Graphs

leancrew.com & Practically Efficient

slide-19
SLIDE 19

VizWiz, A. Kriebel

slide-20
SLIDE 20

Table Lens

Rao & Card 1994

slide-21
SLIDE 21

Bertifier

Matrix/Table representation Authoring Interface

http://www.aviz.fr/bertifier Charles Perin, Pierre Dragicevic and Jean-Daniel Fekete

slide-22
SLIDE 22

LineUp

Video at http://lineup.caleydo.org

slide-23
SLIDE 23

Rankings are popular

23

slide-24
SLIDE 24

University Harvard, ¡USA Oxford, ¡UK Cambridge, ¡UK Princeton, ¡USA MIT, ¡USA Rank 2. 5. 4. 3. 1. Score 84.2 44.0 64.3 73.8 89.4 Score

slide-25
SLIDE 25

25

Support Multiple Attributes

slide-26
SLIDE 26

University Harvard, ¡USA Oxford, ¡UK Cambridge, ¡UK Princeton, ¡USA MIT, ¡USA Rank 2. 5. 4. 3. 1. Score A B C

Score ¡= ¡f(A, ¡B, ¡C)

slide-27
SLIDE 27

Combiner functions: f(A,B,C)

(Weighted) sum
 Score = wa A + wb B + wc C Maximum
 Score = max(A, B, C) Product Nesting …

àSerial ¡ àParallel ¡ àComplex
 ¡Combiners ¡

slide-28
SLIDE 28

Serial Combiner

University Harvard, ¡USA Oxford, ¡UK Cambridge, ¡UK Princeton, ¡USA MIT, ¡USA Rank 2. 5. 4. 3. 1. A B C

¡ ¡ ¡ ¡ ¡ ¡wa ¡A ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡+ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡wb ¡B ¡ ¡ ¡ ¡ ¡ ¡ ¡+ ¡ ¡ ¡ ¡ ¡ ¡ ¡wc ¡C

(as Stacked Bar)

slide-29
SLIDE 29

Serial Combiner

University Harvard, ¡USA Oxford, ¡UK Cambridge, ¡UK Princeton, ¡USA MIT, ¡USA Rank 2. 5. 4. 3. 1. A B C (as Stacked Bar)

wa ¡A + + wb ¡B wc ¡C

slide-30
SLIDE 30

Serial Combiner

University Harvard, ¡USA MIT, ¡USA Rank 2. 5. 4. 3. 1. Oxford, ¡UK Cambridge, ¡UK Princeton, ¡USA A B C (as Stacked Bar)

wa ¡A + + wb ¡B wc ¡C

slide-31
SLIDE 31
slide-32
SLIDE 32

Flexible Mapping of
 Attributes to Scores

slide-33
SLIDE 33

Min Max 100

1

slide-34
SLIDE 34

100

1

slide-35
SLIDE 35

100

1

slide-36
SLIDE 36

36

slide-37
SLIDE 37

37

Compare Rankings

slide-38
SLIDE 38

Bump Charts

Rank

2. 5. 4. 3. 1. Score University Harvard, ¡USA Oxford, ¡UK Cambridge, ¡UK Princeton, ¡USA MIT, ¡USA Rank 2. 5. 4. 3. 1. Score Score

(+1) (-­‑2) (+1)

slide-39
SLIDE 39

Bump Charts

Rank

2. 5. 3. 1. Score University Oxford, ¡UK Cambridge, ¡UK Princeton, ¡USA MIT, ¡USA Rank 5. 4. 3. 1. Score Score

(+1)

4. Harvard, ¡USA 2.

(-­‑2) (+1)

4. Harvard, ¡USA 2.

(-­‑2)

slide-40
SLIDE 40

Video showing:

  • Creating snapshot for comparison
  • Play with weights
  • Show delta
  • Select by clicking on slopegraph
slide-41
SLIDE 41

http:/ /lineup.caleydo.org

41

slide-42
SLIDE 42

Pixel Based Displays

Each cell is a “pixel”, value 
 encoded in color / value Ordering critical for interpretation If no ordering inherent, 
 clustering is used Scalable – 1 px per item Good for homogeneous data

same scale & type

[Gehlenborg & Wong 2012]

slide-43
SLIDE 43

3D Pitfall: Occlusion & Perspective

[Gehlenborg and Wong, Nature Methods, 2012]

slide-44
SLIDE 44

3D Pitfall: Occlusion & Perspective

[Gehlenborg and Wong, Nature Methods, 2012]

slide-45
SLIDE 45

Heterogeneous Data?

[Verhaak 2012]

slide-46
SLIDE 46

Bad Color Mapping

slide-47
SLIDE 47

Good Color Mapping

slide-48
SLIDE 48

Color is relative!

slide-49
SLIDE 49

Clustered Heat Map

slide-50
SLIDE 50

Multiple Line Charts

http://square.github.io/cubism/

slide-51
SLIDE 51

Combining Various Charts

slide-52
SLIDE 52

Design Critique

slide-53
SLIDE 53

Document: https://goo.gl/W6w0iI Website: http://goo.gl/D3mIsy

slide-54
SLIDE 54

Spatial Axis Orientation

slide-55
SLIDE 55

spatial axis orientation

slide-56
SLIDE 56
slide-57
SLIDE 57

Spatial Axis Orientation

Scatterplot Matrix

slide-58
SLIDE 58

Scatterplot Matrices (SPLOM)

Matrix of size d*d Each row/column is one dimension Each cell plots a scatterplot of two dimensions

slide-59
SLIDE 59

Scatterplot Matrices

Limited scalability (~20 dimensions, ~500-1k records) Brushing is important Often combined with “Focus Scatterplot” as F+C technique Algorithmic approaches: Clustering & aggregating records Choosing dimensions Choosing order

slide-60
SLIDE 60

SPLOM Aggregation - Heat Map

Datavore: http://vis.stanford.edu/projects/datavore/splom/

slide-61
SLIDE 61

SPLOM F+C, Navigation

[Elmqvist]

slide-62
SLIDE 62

Spatial Axis Orientation

Parallel Coordinates

slide-63
SLIDE 63

Parallel Coordinates (PC)

Axes represent attributes Lines connecting axes represent items

Inselberg 1985

A B X Y X Y A B A B

slide-64
SLIDE 64

Parallel Coordinates

Each axis represents dimension Lines connecting axis represent records Suitable for

all tabular data types heterogeneous data

slide-65
SLIDE 65

PC Limitation: 
 Scalability to Many Dimensions

500 axes

slide-66
SLIDE 66

PC Limitation: Scalability to Many Items

Solutions:

Transparency Bundling, Clustering Sampling

slide-67
SLIDE 67

PC Limitations 


Correlations only between adjacent axes

Solution: Interaction

Brushing Let user change order

slide-68
SLIDE 68

PC Limitation: 
 Ambiguity

Solutions:

Brushing Curves

Graham and Kennedy 2003

slide-69
SLIDE 69

Parallel Coordinates

Shows primarily relationships between adjacent axis Limited scalability (~50 dimensions, ~1-5k records)

Transparency of lines

Interaction is crucial

Axis reordering Brushing Filtering

Algorithmic support: Choosing dimensions Choosing order Clustering & aggregating records

http://bl.ocks.org/jasondavies/1341281

slide-70
SLIDE 70

HIERARCHICAL PARALLEL COORDINATES

goal: scale up parallel coordinates to large datasets

challenge: overplotting/occlusion

Fua 1999

slide-71
SLIDE 71

HPC: ENCODING DERIVED DATA

visual representation: variable- width opacity bands

show whole cluster, not just single item min / max: spatial position cluster density: transparency mean: opaque

Fua 1999

slide-72
SLIDE 72

HPC: INTERACTING WITH DERIVED DATA

interactively change level of detail to navigate cluster hierarchy

Fua 1999

slide-73
SLIDE 73

Star Plot

Similar to parallel coordinates Radiate from a common origin

[Coekin1969]

http://www.itl.nist.gov/div898/handbook/eda/section3/starplot.htm http://start1.jpl.nasa.gov/caseStudies/autoTool.cfm

http://bl.ocks.org/kevinschaul/raw/8833989/

slide-74
SLIDE 74

Data Reduction

Sampling

Don’t show every element, show a (random) subset Efficient for large dataset Apply only for display purposes Outlier-preserving approaches

Filtering

Define criteria to remove data, e.g.,

minimum variability > / < / = specific value for one dimension consistency in replicates, …

Can be interactive, combined with 
 sampling

[Ellis & Dix, 2006]

slide-75
SLIDE 75

Spatial Axis Orientation

Hybrids

slide-76
SLIDE 76

Flexible Linked Axes (FLINA)

Claessen & van Wijk 2011

slide-77
SLIDE 77

Web-based implementation of 
 FLINA concept

http://vis.pku.edu.cn/mddv/val/ ¡

slide-78
SLIDE 78

Connected Charts

Viau ¡& ¡McGuffin ¡2012 ¡

slide-79
SLIDE 79
  • rigin

ARTISTS Australia Europe North America studio albums WcountH continent first album WyearH number one hits

5 Countries 5 Artists

start of career WyearH career status in business at first album inactive gender gender ∩ inactive sold albums WabsoluteH COUNTRIES population WmillionH Barbados Ireland Sweden UK US

Rihanna U2 ABBA Elton John The Beatles Whitney Houston The Black Eyed Peas Britney Spears Eminem Michael Jackson Madonna Elvis Presley Australia France Italy Sweden Span Austria Germany Netherlands Ireland UK US Canada

inactive active male group female

Artists Countries 12 12 1

Domino

Gratzl ¡et ¡al. ¡2014 ¡

slide-80
SLIDE 80

Spatial Axis Orientation

Parallel Sets

slide-81
SLIDE 81

Parallel Sets

builds on PC to better handle categorical data

discrete small number of values no implied ordering between attributes

task: find relationship between attributes interaction driven technique

slide-82
SLIDE 82

Visual Encoding

boxes scaled by frequency color coded by values for current active dimension

Bendix, Kosara, Hauser, 2005

slide-83
SLIDE 83

Bendix, Kosara, Hauser, 2005

Visual Encoding

  • boxes expand to show histogram
slide-84
SLIDE 84

Bendix, Kosara, Hauser, 2005

Interaction: Reorder

slide-85
SLIDE 85

Bendix, Kosara, Hauser, 2005

Interaction: Aggregate

slide-86
SLIDE 86

Bendix, Kosara, Hauser, 2005

Interaction: Filter

slide-87
SLIDE 87

Bendix, Kosara, Hauser, 2005

Interaction: Highlight

slide-88
SLIDE 88
slide-89
SLIDE 89

Filling Space

slide-90
SLIDE 90

filling space

slide-91
SLIDE 91

Dense pixel display: VisDB

represent each data item, or each attribute in an item as a single pixel can fit as many items on the screen as there are pixels,

  • n the order of millions

relies heavily on color coding challenge: what’s the layout?

slide-92
SLIDE 92

The data…

large database where each item has multiple attributes (on the order of 10) goal: visualize the relevance of set of items which satisfy a query plot out data items in a spiral pattern,

  • rdered by relevance

Keim, Kreigel, 1994

slide-93
SLIDE 93

relevance

  • dim. 1
  • dim. 2
  • dim. 3
  • dim. 4
  • dim. 5

factor

Keim, Kreigel, 1994

slide-94
SLIDE 94
  • c. Grouping Arrangement
  • a. Basic Visualization Technique

Keim, Kreigel, 1994