1
Using Space Effectively
Ma Maneesh Agrawala
CS 448B: Visualization Fall 2020
1 2
Using Space Effectively Ma Maneesh Agrawala CS 448B: Visualization - - PDF document
Using Space Effectively Ma Maneesh Agrawala CS 448B: Visualization Fall 2020 1 2 1 Last Time: EDA 3 Data Wrangling One often needs to manipulate data prior to analysis. Tasks include reformatting, cleaning, quality assessment, and
1 2
3
4
5
6
13
14
Quarter + Product Type = {(Qtr1),(Qtr2),(Qtr3),(Qtr4)} + {(Coffee), (Espresso)} = {(Qtr1),(Qtr2),(Qtr3),(Qtr4),(Coffee),(Espresso)} Profit + Sales = {(Profit[-310,620]),(Sales[0,1000])} 15
Quarter x Product Type = {(Qtr1,Coffee), (Qtr1, Tea), (Qtr2, Coffee), (Qtr2, Tea), (Qtr3, Coffee), (Qtr3, Tea), (Qtr4, Coffee), (Qtr4,Tea)} Product Type x Profit = 16
17
18
19
20
21
22
Step 1: Pick domain & data Step 2: Pose questions Step 3: Profile data Iterate as needed
Interact with data Refine questions
Screenshots of most insightful views (10+) Include titles and captions for each view
23
26
27
28
Government payrolls in 1937 [Huff 93]
29
Yearly CO2 concentrations [Cleveland 85]
30
31
Sim Simplicit licity - numbers are multiples of 10, 5, 2 Co Coverage - ticks near the ends of the data Den Density - not too many, nor too few Leg Legibi bility - whitespace, horizontal text, size 32
33
34
Well marked scale break [Cleveland 85] Poor scale break [Cleveland 85]
35
[Cleveland 85]
36
I
Log scale - easy comparisons of all data
I
Scale break – more difficult to compare across break [Cleveland 85]
37
MSFT MSFT
10 20 30 60 40 50 10 20 30 60 40 50
38
I
Absolute change
I
Small fluctuations
I
Percent change
d(10,20) = d(30,60)
MSFT MSFT
10 20 30 60 40 50 10 20 30 60 40 50
39
40
41
10 1 100 1 2
1 2 1 10 100
44
45
William S. Cleveland The Elements of Graphing Data
46
William S. Cleveland The Elements of Graphing Data
47
48
[Talbot et al, 2011]
54 55
CO2 Measurements William S. Cleveland Visualizing Data
Trends may occur at different scales! Apply banking to the original data or to fitted trend lines. [Heer & Agrawala ’06]
63
75
[The Elements of Graphing Data. Cleveland 94]
76
[The Elements of Graphing Data. Cleveland 94]
77
[The Elements of Graphing Data. Cleveland 94]
78
[The Elements of Graphing Data. Cleveland 94]
79
[Cleveland 85]
80
I Plot vertical distance from best fit curve I Residual graph shows accuracy of fit
[Cleveland 85]
81
89
[Becker, Cleveland, and Shyu 96]
90
Panel variables
type, yield
Condition variables
location, year
[Becker, Cleveland, and Shyu 96]
91
Alphabetical ordering Main-effects ordering
92
93
94
95
96
Tehnolemn Timisoara Slide Rule Archive
98 99
Johannes Lambert used graphs to study the rate of water evaporation as function of temperature [from Tufte 83]
101 102
126