Information Visualization Crash Course
Chad Stolper Assistant Professor Southwestern University
(graduated from Georgia Tech CS PhD)
1
(AKA Information Visualization 101)
Information Visualization Crash Course (AKA Information - - PowerPoint PPT Presentation
Information Visualization Crash Course (AKA Information Visualization 101) Chad Stolper Assistant Professor Southwestern University (graduated from Georgia Tech CS PhD) 1 What is Infovis? Why is it Important? Human Perception Chart Basics
Information Visualization Crash Course
Chad Stolper Assistant Professor Southwestern University
(graduated from Georgia Tech CS PhD)
1
(AKA Information Visualization 101)
What is Infovis? Why is it Important? Human Perception Chart Basics
(If Time, Some Color Theory)
The Shneiderman Mantra Where to Learn More
2
What is Information Visualization?
3
Information Visualization
“The use of co compu puter er-supported, in interact eractiv ive, vi visual representations of abstract data to am amplif plify co cognit itio ion.” Card, Mackinlay, and Shneiderman 1999
4
Co Communication Ex Expl ploratory Data Ana nalysis (EDA)
5
6
(gone wrong)
7
8 X
Edward Tufte
An American statistician and professor emeritus of political science, statistics, and computer science at Yale University. He is noted for his writings
as a pioneer in the field of data visualization.
Sp Space Sh Shuttle le C Challe llenger
January 28, 1986
9
Morning Temperature: 31°F
10
11
Tufte, E. R. (2012). Visual explanations: images and quantities, evidence and narrative. Cheshire, CT: Graphics Press.13
Video originally from: http://www.FeynmanPhysicsLectures.com
Most Watched Science Experiment
Richard Feynman, Physics Nobel laureate explained how rubber became rigid in cold temperate YouTube video: https://youtu.be/6Rwcbsn19c0
How did this happen?
14
15
Tufte, E. R. (2012). Visual explanations: images and quantities, evidence and narrative. Cheshire, CT: Graphics Press.Engineers at Morton Thiokol, the rocket maker, presented on the day before and recommended not to launch.
19
24
25
26
27
28
Visualization can help with that – com communicat cate e ideas eas an and insigh ghts.
29
30
http://www.ted.com/talks/hans_rosling_shows_the_best_stats_you_ve_ever_seen.html
Visualization can also help with Ex Expl ploratory Data Ana nalysis (EDA)
But But why hy do do you u ne need d to ex explor
e dat ata a at at al all???
31
“There are three kinds of lies: lies, damned lies, and statistics.”
33
Mystery Data Set
34
Mystery Data Set
Pr Proper erty Va Value mean( x ) 9 variance ( x ) 11 mean( y ) 7.5 variance ( y ) 4.122 correlation ( x,y ) 0.816 Linear Regression Line y = 3 + 0.5x
35
36
37
38
39
Anscombe’s Quartet
40
https://en.wikipedia.org/wiki/Anscombe%27s_quartet
Anscombe’s Quartet Sanity Checking Models Outlier Detection
41
Data visualization leverages hum human n pe percept ption
43
Name the five senses.
44
45
Sense Sense Ba Bandwidth (bi bits/ s/sec) sec) Sight 10,000,000 Touch 1,000,000 Hearing 100,000 Smell 100,000 Taste 1,000
http://www.britannica.com/EBchecked/topic/287907/information-theory/214958/Physiology
A (Simple) Model
46
A (Simple) Model of Human Perception
47
Parallel detection of basic features into an iconic store Serial processing of
spatial layout
Stage 1 Stage 2
Stage 1: Pre-Attentive Processing
(Fleeting = lasting for a short time)
48
Stage 2: Serial Processing Relatively Slow (Incorporates Memory) Manual
49
Stage 1: Pre-Attentive Processing
The eye moves every 200ms (so this processing occurs every 200ms-250ms)
50
Example
1281768756138976546984506985604982826762 9809858458224509856458945098450980943585 9091030209905959595772564675050678904567 8845789809821677654876364908560912949686
51
Example
1281768756138976546984506985604982826762 9809858458224509856458945098450980943585 9091030209905959595772564675050678904567 8845789809821677654876364908560912949686
52
A few more examples from
53
54
Left Side Right Side
Raise your hand if a RE RED DOT is present…
(On the left or on the right?)
55
56
57
Color (hue) is pre-attentively processed.
58
Raise your hand if a RED DOT is present…
59
60
61
Shape is pre-attentively processed.
62
Determine if a RED DOT is present…
63
64
65
Hue and shape together are NOT pre-attentively processed.
66
Pre-Attentive Processing
67
Stephen Few “Now You See It”
68
Pre-Attentive à Cognitive
69
Gestalt Psychology
Berlin, Early 1900s
70
Gestalt Psychology
Goal was to understand pattern perception
Gestalt (German) = “seeing the whole picture all at once” instead of a collection of parts
Identified 8 “Laws of Grouping”
71
http://study.com/academy/lesson/gestalt-psychology-definition-principles-quiz.html
Gestalt Psychology
72
How many groups are there?
73
74
Proximity
75
How many groups are there?
76
77
Similarity
78
How many shapes are there?
79
80
Closure
81
How many items are there?
82
83
Symmetry
84
How many sets are there?
85
86
87
Common Fate
How many objects are there?
88
89
Continuity
90
How many objects are there?
91
92
Good Gestalt
93
What is this word?
94
95
Past Experience
96
Pre-Attentive Processing Gestalt Laws
101
Detect Quickly
102
Detect quickly does NOT mean
Id Ideally you want both.
103
104
Crowdsourcing Graphical Perception: Using Mechanical Turk to Assess Visualization Design.Heer and Bostock. Proc ACM Conf. Human Factors in Computing Systems (CHI) 2010, p. 203–212.
Positions Rectangular areas
(aligned or in a treemap)
Angles Circular areas
105
Crowdsourcing Graphical Perception: Using Mechanical Turk to Assess Visualization Design.Heer and Bostock. Proc ACM Conf. Human Factors in Computing Systems (CHI) 2010, p. 203–212.
Crowdsourced Results
1.0 1.5 2.0 2.5 3.0 T1 T2 T3 T4 T5 T6 T7 T8 T9
Log Error
Positions Rectangular areas (aligned or in a treemap) Angles Circular areasAutomating the Design of Graphical Presentations
l125
More accurate Less accurate
I I
Position
IMll
1 I
Length
F-l
Iha I
I0.I
I I
Volume
rl
l¶kJ
Color
mot
Shown)
perceptual tasks. Higher tasks are accom- plished more accurately than lower tasks. Cleveland and McGill empirically verified the basic properties of this ranking. Quantitative Ordinal Nominal Position Position Color Saturation Position Color Hue Texture Connection Containment Density Color Saturation Color Saturation Shape Length Angle Slope Area Volume
types of data.
An example analysis for area perception is shown in Figure 16. The top line shows that a series of decreasing areas can be used to encode a tenfold quantitative
randomly, making it more difficult to judge the relative sizes of different areas accurately (hence, area is ranked fifth in Figure 14). Nevertheless, small mis- judgments about the size of an area only leads to small misperceptions about the corresponding quantitative value that is encoded. The middle line shows that area can encode three ordinal values. However, one must be careful to make sure
ACM Transactions
1986.
Mackinlay, 1986
106
Stephen Few “Now You See It”
107
What does this tell us?
108
Barcharts, scatterplots, and line charts are really effective for quantitative data
20 40 20 40 20 40 20 40 20 40
109
(and for statistical distributions) Tukey Box Plots
113
Median Outliers Largest < Q3 + 1.5 IQR Smallest > Q1 - 1.5 IQR Largest < Q3 Smallest > Q1
114
115
Tufte’s Chart Principles
116
Edward Tufte
Tufte’s Chart Principles
Maximize Data-Ink Ratio Minimize Chart Junk
119
Tufte’s Chart Principles
Maximize Data-Ink Ratio Minimize Chart Junk
120
121
122
“Cumulative”
http://www.perceptualedge.com/blog/?p=790
123
124
http://xkcd.com/1138/
Tufte’s Chart Principles
Maximize Data-Ink Ratio Minimize Chart Junk
126
http://skilfulminds.com/2011/04/05/exploring-the-usefulness-of-chartjunk-at-stl-ux-2011/ 127
128
Please…
130
No pie charts. No 2.5D charts.
131
132
37 37 36 36 24 24
2 1
133
5 10 15 20 25 30 35 40
134
135
136
10 20 30 40
137
But otherwise…
141
Barcharts, scatterplots, and line charts are re really eff ffective for quantitative data
20 40 20 40 20 40 20 40 20 40
142
Anyone else bored by my color choices?
143
In fact, grayscale can be risky…
144
In fact, grayscale can be risky…
145
Color is Powerful
146
Call attention to information Increase appeal Increase memorability Another dimension to work with
Color
147
148
Have you heard of RGB?
RGB color model. (2017, November 20). Retrieved December 01, 2017, from https://en.wikipedia.org/wiki/RGB_color_modelAdditive color model: colors create by mixing red, green, blue light
We see in RGB, but we don’t interpret in RGB…
149
150
Hue Lightness Saturation
Source: color picker in Affinity Designer
HSV Color Model
Hue
Post & Greene, 1986
151
Hue
http://blog.xkcd.com/2010/05/03/color-survey-results/
152
Hue and Colorblindness
10% of males and 1% of females are Red-Green Colorblind
153
154
155
http://viz.wtf/post/98981561686/ht-matthewbgilmore-noaas-new-weather-modelling
Color and Quantitative Data
Can you order these (lowàhi)?
157
158
us
Binary Diverging Categorical Sequential Categorical Categorical
http://www.personal.psu.edu/faculty/c/a/cab38/ColorSch/Schemes.html via Munzner
159
Color Brewer for Picking Color Scales
COLORBREWER 2.0. (n.d.). Retrieved December 01, 2017, from http://colorbrewer2.org/Overview Zoom+Filter Details on Demand
Shneiderman Mantra (Information-Seeking Mantra)
160
https://www.mat.ucsb.edu/g.legrady/academic/courses/11w259/schneiderman.pdf
161
http://visual.ly/every-single-death-game-thrones-series
162
163
http://www.babynamewizard.com/voyager
Where to learn more?
172
CS 7450 Information Visualization Every Fall
173
Visualization @GeorgiaTech
174
How to Make Good Charts
– http://www.edwardtufte.com/tufte/courses
Information
– http://www.edwardtufte.com/tufte/books_vdqi
Designing Tables and Graphs to Enlighten
– http://www.amazon.com/Show-Me-Numbers- Designing- Enlighten/dp/0970601972/ref=la_B001H6IQ5M_1_ 2?s=books&ie=UTF8&qid=1385050724&sr=1-2
175
Visualization Theory “Books”
– http://www.cs.ubc.ca/~tmm/talks.html – http://www.cs.ubc.ca/~tmm/vadbook/
– http://www.amazon.com/Information-Visualization-Perception-Interactive- Technologies/dp/1558605118
– http://www.amazon.com/Now-You-See-Visualization- Quantitative/dp/0970601980/ref=pd_bxgy_b_img_z
– http://www.edwardtufte.com/tufte/books_ei
– http://www.edwardtufte.com/tufte/books_visex
– http://www.edwardtufte.com/tufte/books_be
– http://www.amazon.com/Visualization-Analysis-Design-AK- Peters/dp/1466508914
176
Perception and Color Websites
– http://www.csc.ncsu.edu/faculty/healey/PP/index.h tml
– http://colorbrewer2.org/
Workshops)
– http://www.stonesc.com/color/index.htm
NASA
– http://blog.visual.ly/subtleties-of-color/
177
Visualization Blogs
– http://flowingdata.com/
– http://infosthetics.com/
– http://www.informationisbeautiful.net/
– http://blog.visual.ly/
– http://thisisindexed.com/
178
Infographics
(wtfviz.net)
179
Thanks!
Chad Stolper
chadstolper@gatech.edu
180
Questions?
Chad Stolper
chadstolper@gatech.edu
181
thisisindexed.com Jessica Hagy