Information Visualization Crash Course (AKA Information - - PowerPoint PPT Presentation

information visualization crash course
SMART_READER_LITE
LIVE PREVIEW

Information Visualization Crash Course (AKA Information - - PowerPoint PPT Presentation

Information Visualization Crash Course (AKA Information Visualization 101) Chad Stolper Assistant Professor Southwestern University (graduated from Georgia Tech CS PhD) 1 What is Infovis? Why is it Important? Human Perception Chart Basics


slide-1
SLIDE 1

Information Visualization Crash Course

Chad Stolper Assistant Professor Southwestern University

(graduated from Georgia Tech CS PhD) 1

(AKA Information Visualization 101)

slide-2
SLIDE 2

What is Infovis? Why is it Important? Human Perception Chart Basics

(If Time, Some Color Theory)

The Shneiderman Mantra Where to Learn More

2
slide-3
SLIDE 3

What is Information Visualization?

4
slide-4
SLIDE 4

Information Visualization

“The use of computer-supported, interactive, visual representations of abstract data to amplify cognition.” Card, Mackinlay, and Shneiderman 1999

5
slide-5
SLIDE 5

Communication Exploratory Data Analysis (EDA)

6
slide-6
SLIDE 6

Communication

7

(gone wrong)

slide-7
SLIDE 7 8
slide-8
SLIDE 8 9 X

Edward Tufte

An American statistician and professor emeritus of political science, statistics, and computer science at Yale University. He is noted for his writings

  • n information design and

as a pioneer in the field of data visualization.

  • Wikipedia
slide-9
SLIDE 9

Space Shuttle Challenger

January 28, 1986

10

What happened?

Morning Temperature: 31°F

slide-10
SLIDE 10 11
slide-11
SLIDE 11 12
slide-12
SLIDE 12 13
slide-13
SLIDE 13 14

https://www.youtube.com/watch?v=6Rwcbsn19c0

slide-14
SLIDE 14

How did this happen?

15
slide-15
SLIDE 15

Morton Thiokol’s Presentation

16
slide-16
SLIDE 16 17
slide-17
SLIDE 17 18
slide-18
SLIDE 18 20
slide-19
SLIDE 19 21
slide-20
SLIDE 20 26
slide-21
SLIDE 21 27
slide-22
SLIDE 22 28
slide-23
SLIDE 23 29
slide-24
SLIDE 24 30
slide-25
SLIDE 25

So, communication is extremely important.

Visualization can help with that – communicate ideas and insights.

31
slide-26
SLIDE 26 32 http://www.ted.com/talks/hans_rosling_shows_the_best_stats_you_ve_ever_seen.html
slide-27
SLIDE 27

Visualization can also help with Exploratory Data Analysis (EDA)

But why do you need to explore data at all???

33
slide-28
SLIDE 28

“There are three kinds of lies: lies, damned lies, and statistics.”

35
slide-29
SLIDE 29

Mystery Data Set

36
slide-30
SLIDE 30

Mystery Data Set

Property Value mean( x ) 9 variance ( x ) 11 mean( y ) 7.5 variance ( y ) 4.122 correlation ( x,y ) 0.816 Linear Regression Line y = 3 + 0.5x

37
slide-31
SLIDE 31 38
slide-32
SLIDE 32 39
slide-33
SLIDE 33 40
slide-34
SLIDE 34 41
slide-35
SLIDE 35 42
slide-36
SLIDE 36

Anscombe’s Quartet

43

https://en.wikipedia.org/wiki/Anscombe%27s_quartet

slide-37
SLIDE 37

Anscombe’s Quartet Sanity Checking Models Outlier Detection

44
slide-38
SLIDE 38

Anscombe’s Quartet Sanity Checking Models Outlier Detection

45
slide-39
SLIDE 39

Anscombe’s Quartet Sanity Checking Models Outlier Detection

46
slide-40
SLIDE 40

Anscombe’s Quartet Sanity Checking Models Outlier Detection

47
slide-41
SLIDE 41

Data visualization leverages human perception

49
slide-42
SLIDE 42

Name the five senses.

51
slide-43
SLIDE 43 52

Sense Bandwidth (bits/sec) Sight 10,000,000 Touch 1,000,000 Hearing 100,000 Smell 100,000 Taste 1,000

http://www.britannica.com/EBchecked/topic/287907/information-theory/214958/Physiology
slide-44
SLIDE 44

A (Simple) Model

  • f Human Visual Perception
53
slide-45
SLIDE 45

A (Simple) Model of Human Perception

Parallel detection of basic features into an iconic store Serial processing

  • f object

identification and spatial layout

Stage 1 Stage 2

54
slide-46
SLIDE 46

Stage 1: Pre-Attentive Processing

Rapid Parallel Automatic

(Fleeting = lasting for a short time)

55
slide-47
SLIDE 47

Stage 2: Serial Processing Relatively Slow (Incorporates Memory) Manual

56
slide-48
SLIDE 48

Stage 1: Pre-Attentive Processing

The eye moves every 200ms

57
slide-49
SLIDE 49

Stage 1: Pre-Attentive Processing

The eye moves every 200ms (so this processing occurs every 200ms-250ms)

58
slide-50
SLIDE 50

Example

1281768756138976546984506985604982826762 9809858458224509856458945098450980943585 9091030209905959595772564675050678904567 8845789809821677654876364908560912949686

59
slide-51
SLIDE 51

Example

1281768756138976546984506985604982826762 9809858458224509856458945098450980943585 9091030209905959595772564675050678904567 8845789809821677654876364908560912949686

60
slide-52
SLIDE 52

A few more examples from

  • Prof. Chris Healy at NC State
61
slide-53
SLIDE 53 62

Left Side Right Side

slide-54
SLIDE 54 63

Left Side Right Side

slide-55
SLIDE 55

Raise your hand if a RED DOT is present…

64
slide-56
SLIDE 56 65
slide-57
SLIDE 57 66
slide-58
SLIDE 58

Color (hue) is pre-attentively processed.

67
slide-59
SLIDE 59

Raise your hand if a RED DOT is present…

68
slide-60
SLIDE 60 69
slide-61
SLIDE 61 70
slide-62
SLIDE 62

Shape is pre-attentively processed.

71
slide-63
SLIDE 63

Determine if a RED DOT is present…

72
slide-64
SLIDE 64 73
slide-65
SLIDE 65 74
slide-66
SLIDE 66

Hue and shape together are NOT pre-attentively processed.

75
slide-67
SLIDE 67

Pre-Attentive Processing

  • length
  • width
  • size
  • curvature
  • number
  • terminators
  • intersection
  • closure
  • hue
  • lightness
  • flicker
  • direction of motion
  • binocular lustre
  • stereoscopic depth
  • 3-D depth cues
  • lighting direction
76
slide-68
SLIDE 68

Stephen Few “Now You See It”

  • pg. 39
77
slide-69
SLIDE 69

Pre-Attentive à Cognitive

78
slide-70
SLIDE 70

Gestalt Psychology

Berlin, Early 1900s

79
slide-71
SLIDE 71

Gestalt Psychology

Goal was to understand pattern perception

Gestalt (German) = “seeing the whole picture all at once”

Identified 8 “Laws of Grouping”

80

http://study.com/academy/lesson/gestalt-psychology-definition-principles-quiz.html

slide-72
SLIDE 72

Gestalt Psychology

  • 1. Proximity
  • 2. Similarity
  • 3. Closure
  • 4. Symmetry
  • 5. Common Fate
  • 6. Continuity
  • 7. Good Gestalt
  • 8. Past Experience
81
slide-73
SLIDE 73

How many groups are there?

82
slide-74
SLIDE 74 83
slide-75
SLIDE 75

Proximity

84
slide-76
SLIDE 76

How many groups are there?

85
slide-77
SLIDE 77 86
slide-78
SLIDE 78

Similarity

87
slide-79
SLIDE 79

How many shapes are there?

88
slide-80
SLIDE 80 89
slide-81
SLIDE 81

Closure

90
slide-82
SLIDE 82

How many items are there?

91
slide-83
SLIDE 83

( ) { } [ ]

92
slide-84
SLIDE 84

( ) { } [ ]

Symmetry

93
slide-85
SLIDE 85

How many sets are there?

94
slide-86
SLIDE 86 95
slide-87
SLIDE 87 96

Common Fate

slide-88
SLIDE 88

How many objects are there?

97
slide-89
SLIDE 89 98
slide-90
SLIDE 90

Continuity

99
slide-91
SLIDE 91

How many objects are there?

100
slide-92
SLIDE 92 101
slide-93
SLIDE 93

Good Gestalt

102
slide-94
SLIDE 94

What is this word? (Please Shout)

103
slide-95
SLIDE 95

FLICK

104
slide-96
SLIDE 96

Past Experience

FLICK

105
slide-97
SLIDE 97

Past Experience

FLICK

106
slide-98
SLIDE 98

Pre-Attentive Processing Gestalt Laws

107
slide-99
SLIDE 99

Detect Quickly

108
slide-100
SLIDE 100

Detect quickly does NOT mean

detect accurately

Ideally you want both.

109
slide-101
SLIDE 101 110 Crowdsourcing Graphical Perception: Using Mechanical Turk to Assess Visualization Design.Heer and Bostock. Proc ACM Conf. Human Factors in Computing Systems (CHI) 2010, p. 203–212.

Positions Rectangular areas

(aligned or in a treemap)

Angles Circular areas

slide-102
SLIDE 102 111 Crowdsourcing Graphical Perception: Using Mechanical Turk to Assess Visualization Design.Heer and Bostock. Proc ACM Conf. Human Factors in Computing Systems (CHI) 2010, p. 203–212.

Crowdsourced Results

1.0 1.5 2.0 2.5 3.0 T1 T2 T3 T4 T5 T6 T7 T8 T9

Log Error

Positions Rectangular areas (aligned or in a treemap) Angles Circular areas
slide-103
SLIDE 103 Automating the Design of Graphical Presentations l 125

More accurate Less accurate

I I

Position

IMll

1 I

Length

F-l

Iha I

I0.I

I I

Volume

rl

l¶kJ Color

cl

mot Shown)
  • Fig. 14. Accuracy ranking of quantitative
perceptual tasks. Higher tasks are accom- plished more accurately than lower tasks. Cleveland and McGill empirically verified the basic properties of this ranking. Quantitative Ordinal Nominal Position Position Color Saturation Position Color Hue Texture Connection Containment Density Color Saturation Color Saturation Shape Length Angle Slope Area Volume
  • Fig. 15. Ranking of perceptual tasks. The tasks shown in the gray boxes are not relevant to these
types of data.

An example analysis for area perception is shown in Figure 16. The top line shows that a series of decreasing areas can be used to encode a tenfold quantitative

  • range. Of course, in a real diagram such as Figure 13, the areas would be laid out

randomly, making it more difficult to judge the relative sizes of different areas accurately (hence, area is ranked fifth in Figure 14). Nevertheless, small mis- judgments about the size of an area only leads to small misperceptions about the corresponding quantitative value that is encoded. The middle line shows that area can encode three ordinal values. However, one must be careful to make sure

ACM Transactions
  • n Graphics, Vol. 5, No. 2, April
1986.

Mackinlay, 1986

112
slide-104
SLIDE 104

Stephen Few “Now You See It”

  • pg. 41
113
slide-105
SLIDE 105

What does this tell us?

114
slide-106
SLIDE 106

Barcharts, scatterplots, and line charts are really effective for quantitative data

20 40 20 40 20 40 20 40 20 40

115
slide-107
SLIDE 107

(and for statistical distributions) Tukey Box Plots

119
slide-108
SLIDE 108

Median Outliers Largest < Q3 + 1.5 IQR Smallest > Q1 - 1.5 IQR Largest < Q3 Smallest > Q1

120
slide-109
SLIDE 109 121
slide-110
SLIDE 110

Tufte’s Chart Principles

122

Edward Tufte

slide-111
SLIDE 111

Tufte’s Chart Principles

DO NOT LIE!

Maximize Data-Ink Ratio Minimize Chart Junk

125
slide-112
SLIDE 112

Tufte’s Chart Principles

DO NOT LIE!

Maximize Data-Ink Ratio Minimize Chart Junk

126
slide-113
SLIDE 113 127
slide-114
SLIDE 114 128
slide-115
SLIDE 115

http://www.perceptualedge.com/blog/?p=790

129
slide-116
SLIDE 116 130 http://xkcd.com/1138/
slide-117
SLIDE 117

10 20 30 10 20 30 10 20 30 10 20 30 10 20 30

131
slide-118
SLIDE 118

Tufte’s Chart Principles

DO NOT LIE!

Maximize Data-Ink Ratio Minimize Chart Junk

132
slide-119
SLIDE 119 http://skilfulminds.com/2011/04/05/exploring-the-usefulness-of-chartjunk-at-stl-ux-2011/ 133
slide-120
SLIDE 120

Please…

135
slide-121
SLIDE 121

No pie charts. No 2.5D charts.

136
slide-122
SLIDE 122 137
slide-123
SLIDE 123

37 36 24 2 1

138
slide-124
SLIDE 124

5 10 15 20 25 30 35 40

139
slide-125
SLIDE 125 140
slide-126
SLIDE 126

PLEASE DON’T EVER DO THIS!

141
slide-127
SLIDE 127

10 20 30 40

142
slide-128
SLIDE 128

Two times to use a pie chart…

143
slide-129
SLIDE 129

50-50

144
slide-130
SLIDE 130

75-25

145
slide-131
SLIDE 131

But otherwise…

146
slide-132
SLIDE 132

Barcharts, scatterplots, and line charts are really effective for quantitative data

20 40 20 40 20 40 20 40 20 40

147
slide-133
SLIDE 133

Anyone else bored by my color choices?

148
slide-134
SLIDE 134

In fact, grayscale can be risky…

149
slide-135
SLIDE 135

In fact, grayscale can be risky…

150
slide-136
SLIDE 136

Color is Powerful

151
slide-137
SLIDE 137

Call attention to information Increase appeal Increase memorability Another dimension to work with

Color

152
slide-138
SLIDE 138

How many of you have heard of RGB?

153
slide-139
SLIDE 139 154
slide-140
SLIDE 140

We see in RGB, but we don’t interpret in RGB…

155
slide-141
SLIDE 141

How many have heard of HSV?

156
slide-142
SLIDE 142

HSV Color Model

Hue/“Color” Saturation/Chroma Value/Lightness

157
slide-143
SLIDE 143 158
slide-144
SLIDE 144

Hue

Post & Greene, 1986

159
slide-145
SLIDE 145

Hue

http://blog.xkcd.com/2010/05/03/color-survey-results/ 160
slide-146
SLIDE 146

Hue and Colorblindness

10% of males and 1% of females are Red-Green Colorblind

162
slide-147
SLIDE 147 163
slide-148
SLIDE 148 168 http://viz.wtf/post/98981561686/ht-matthewbgilmore-noaas-new-weather-modelling
slide-149
SLIDE 149

Color and Quantitative Data

Gray scale Single sequence part spectral scale Full spectral scale Single sequence single hue scale Double-ended multiple hue scale 169
slide-150
SLIDE 150

Color and Quantitative Data

Can you order these (lowàhi)?

170
slide-151
SLIDE 151 171

us

Binary Diverging Categorical Sequential Categorical Categorical

http://www.personal.psu.edu/faculty/c/a/cab38/ColorSch/Schemes.html via Munzner
slide-152
SLIDE 152

Color Scales

Color Brewer

http://colorbrewer2.org/

172
slide-153
SLIDE 153

Overview Zoom+Filter Details on Demand

Shneiderman Mantra (Information-Seeking Mantra)

173
slide-154
SLIDE 154 174 http://www.babynamewizard.com/voyager
slide-155
SLIDE 155 175
slide-156
SLIDE 156

http://visual.ly/every-single-death-game-thrones-series

176
slide-157
SLIDE 157

Where to learn more?

185
slide-158
SLIDE 158

CS 7450 Information Visualization Every Fall

186
slide-159
SLIDE 159

Visualization @GeorgiaTech

vis.gatech.edu

187
slide-160
SLIDE 160

How to Make Good Charts

  • Edward Tufte’s One-Day Workshop

– http://www.edwardtufte.com/tufte/courses

  • Edward Tufte, Visual Display of Quantitative

Information

– http://www.edwardtufte.com/tufte/books_vdqi

  • Stephen Few, Show Me the Numbers:

Designing Tables and Graphs to Enlighten

– http://www.amazon.com/Show-Me-Numbers- Designing- Enlighten/dp/0970601972/ref=la_B001H6IQ5M_1_ 2?s=books&ie=UTF8&qid=1385050724&sr=1-2

188
slide-161
SLIDE 161

Visualization Theory “Books”

  • Tamara Munzner VIS Tutorial and Book

– http://www.cs.ubc.ca/~tmm/talks.html – http://www.cs.ubc.ca/~tmm/vadbook/

  • Colin Ware, Information Visualization: Perception for Design

– http://www.amazon.com/Information-Visualization-Perception-Interactive- Technologies/dp/1558605118

  • Stephen Few, Now You See It

– http://www.amazon.com/Now-You-See-Visualization- Quantitative/dp/0970601980/ref=pd_bxgy_b_img_z

  • Edward Tufte, Envisioning Information

– http://www.edwardtufte.com/tufte/books_ei

  • Edward Tufte, Visual Explanations

– http://www.edwardtufte.com/tufte/books_visex

  • Edward Tufte, Beautiful Evidence

– http://www.edwardtufte.com/tufte/books_be

  • Tamara Munzner, Visualization Analysis & Design

– http://www.amazon.com/Visualization-Analysis-Design-AK- Peters/dp/1466508914

189
slide-162
SLIDE 162

Perception and Color Websites

  • Chris Healy, NC State

– http://www.csc.ncsu.edu/faculty/healey/PP/index.h tml

  • Color Brewer

– http://colorbrewer2.org/

  • Maureen C. Stone (Color Links, Blog,

Workshops)

– http://www.stonesc.com/color/index.htm

  • Subtleties of Color by Robert Simmon of

NASA

– http://blog.visual.ly/subtleties-of-color/

190
slide-163
SLIDE 163

Visualization Blogs

  • Flowing Data by Nathan Yau

– http://flowingdata.com/

  • Information Aesthetics by Andrew Vande Moere

– http://infosthetics.com/

  • Information is Beautiful by David McCandless

– http://www.informationisbeautiful.net/

  • Visual.ly Blog

– http://blog.visual.ly/

  • Indexed Comic by Jessica Hagy

– http://thisisindexed.com/

191
slide-164
SLIDE 164

Infographics

Visual.ly/view

(wtfviz.net)

192
slide-165
SLIDE 165

Thanks!

Chad Stolper

chadstolper@gatech.edu

193
slide-166
SLIDE 166

Questions?

Chad Stolper

chadstolper@gatech.edu

194 thisisindexed.com Jessica Hagy