1
Using Space Effectively
Ma Maneesh Agrawala
CS 448B: Visualization Winter 2020
1
Last Time: EDA
2
Using Space Effectively Ma Maneesh Agrawala CS 448B: Visualization - - PDF document
Using Space Effectively Ma Maneesh Agrawala CS 448B: Visualization Winter 2020 1 Last Time: EDA 2 1 Data Wrangling One often needs to manipulate data prior to analysis. Tasks include reformatting, cleaning, quality assessment, and
1
2
3
4
6
13
14
Quarter + Product Type = {(Qtr1),(Qtr2),(Qtr3),(Qtr4)} + {(Coffee), (Espresso)} = {(Qtr1),(Qtr2),(Qtr3),(Qtr4),(Coffee),(Espresso)} Profit + Sales = {(Profit[-310,620]),(Sales[0,1000])} 15
Quarter x Product Type = {(Qtr1,Coffee), (Qtr1, Tea), (Qtr2, Coffee), (Qtr2, Tea), (Qtr3, Coffee), (Qtr3, Tea), (Qtr4, Coffee), (Qtr4,Tea)} Product Type x Profit = 16
17
18
19
20
21
22
Step 1: Pick domain & data Step 2: Pose questions Step 3: Profile data Iterate as needed
Interact with data Refine questions
Screenshots of most insightful views (10+) Include titles and captions for each view
23
26
27
28
Government payrolls in 1937 [Huff 93]
29
Yearly CO2 concentrations [Cleveland 85]
30
31
Sim Simplicit licity - numbers are multiples of 10, 5, 2 Co Coverage - ticks near the ends of the data Den Density - not too many, nor too few Leg Legibi bility - whitespace, horizontal text, size 32
33
34
Well marked scale break [Cleveland 85] Poor scale break [Cleveland 85]
35
[Cleveland 85]
36
I
Log scale - easy comparisons of all data
I
Scale break – more difficult to compare across break [Cleveland 85]
37
MSFT MSFT
10 20 30 60 40 50 10 20 30 60 40 50
38
I
Absolute change
I
Small fluctuations
I
Percent change
d(10,20) = d(30,60)
MSFT MSFT
10 20 30 60 40 50 10 20 30 60 40 50
39
40
41
10 1 100 1 2
1 2 1 10 100
44
45
Yearly CO2 concentrations [Cleveland 85]
46
William S. Cleveland The Elements of Graphing Data
47
William S. Cleveland The Elements of Graphing Data
48
49
Median-Absolute-Slope Average-Absolute-Orientation Unweighted Weighted Average-Absolute-Slope Max-Orientation-Resolution Global (over all i, j s.t. i¹j) Local (over adjacent segments)
i i
i
i
2
i j i j
2 1
i i i
+
i x y
i x y
50
[Talbot et al, 2011]
55
56
[Talbot et al, 2011]
60
CO2 Measurements William S. Cleveland Visualizing Data
Trends may occur at different scales! Apply banking to the original data or to fitted trend lines. [Heer & Agrawala ’06]
64
76
[The Elements of Graphing Data. Cleveland 94]
77
[The Elements of Graphing Data. Cleveland 94]
78
[The Elements of Graphing Data. Cleveland 94]
79
[The Elements of Graphing Data. Cleveland 94]
80
[Cleveland 85]
81
I Plot vertical distance from best fit curve I Residual graph shows accuracy of fit
[Cleveland 85]
82
90
91
92
Tehnolemn Timisoara Slide Rule Archive
94
95
Johannes Lambert used graphs to study the rate of water evaporation as function of temperature [from Tufte 83]
97
98
122
[From Cartography, Dent]
124
http://www-personal.umich.edu/~mejn/election/ % voted democrat % voted republican
131
% voted democrat % voted republican http://www-personal.umich.edu/~mejn/election/
132
http://www-personal.umich.edu/~mejn/election/
133
134
[Cleveland and McGill 84]
135
[Cleveland and McGill 84]
136
American population [van Kreveld and Speckmann 04]
137
Native American population [van Kreveld and Speckmann 04]
138
139
140
http://www.ncgia.ucsb.edu/projects/Cartogram_Central/types.html
141
[From Cartography, Dent]
142
http://www.thetube.com/content/history/map.asp
144
Distorted Undistorted
145
146
147
[Agrawala & Stolte 2001] Hand-drawn route map LineDrive route map
148
I
I
I
I
149