CS 171: Visualization
Process & Visual Variables
Hanspeter Pfister pfister@seas.harvard.edu
CS 171: Visualization Process & Visual Variables Hanspeter - - PowerPoint PPT Presentation
CS 171: Visualization Process & Visual Variables Hanspeter Pfister pfister@seas.harvard.edu This Week Friday lab 10:30-11 am in MD G115 HW1 due today, group reflection due Monday Readings for next week Chapter 1 Group
Hanspeter Pfister pfister@seas.harvard.edu
Chapter 1
improvements
“Well-designed presentations of interesting data are a matter of substance, of statistics, and of design.”
Washington Post, 2012
at UBC, Canada
Center, Compaq Research
InfoVis
target translate design implement validate evaluate user-centered design usability engineering participatory design
target translate design implement validate evaluate user-centered design usability engineering participatory design
Fellow at Harvard
and molecular biology data
target translate design implement validate choose a specific domain define research question(s) find & clean the data
target translate design implement validate formulate data analysis tasks exploratory data analysis transform & summarize data
Same mean, variance, correlation coefficient, and linear regression line
http://upload.wikimedia.org/wikipedia/commons/b/b6/Anscombe.svg
gene expression
t1 g1 g2 g3 g4 g5 g6 g7 g8 0.2 1.0
1.0
t1 t2 g1 g2 g3 g4 g5 g6 g7 g8 0.2 0.4 1.0 0.0
0.8 1.0 0.0
0.8
0.5
0.0 t1 t2 t3 t4 g1 g2 g3 g4 g5 g6 g7 g8 0.2 0.4 1.0 1 1.0 0.0 0.0 0.0
0.8 1.0 1 1.0 0.0 0.2 0.5
0.8 0.5
0.5 0.8
0.4
0.0 0.0
t1 t2 t3 t4 t5 g1 g2 g3 g4 g5 g6 g7 g8 0.2 0.4 1.0 1.0 1.0 1.0 0.0 0.0 0.0 1.0
0.8 1.0 1.0 0.8 1.0 0.0 0.2 0.5 1.0
0.8 0.5
0.5 0.8
0.4
0.0 0.0
t1 t2 t3 t4 t5 t6 g1 g2 g3 g4 g5 g6 g7 g8 0.2 0.4 1.0 1.0 1.0 1.0 1.0 0.0 0.0 0.0 1.0 0.8
0.8 1.0 1.0 0.8 0.2 1.0 0.0 0.2 0.5 1.0 0.2
0.8 0.5
0.5 0.8
0.5
0.4
0.0 0.0
t1 t2 t3 t4 t5 t6 g1 g2 g3 g4 g5 g6 g7 g8 0.2 0.4 1.0 1.0 1.0 1.0 1.0 0.0 0.0 0.0 1.0 0.8
0.8 1.0 1.0 0.8 0.2 1.0 0.0 0.2 0.5 1.0 0.2
0.8 0.5
0.5 0.8
0.5
0.4
0.0 0.0
t1 t2 t3 t4 t5 t6 g1 m1 g2 m2 g3 m3 g7 g8 0.2 0.4 1.0 1.0 1.0 1.0 1.0 0.0 0.0 0.0 1.0 0.8
0.8 1.0 1.0 0.8 0.2 1.0 0.0 0.2 0.5 1.0 0.2
0.8 0.5
0.5 0.8
0.5
0.4
0.0 0.0
tca cycle glycolysis
similarity scores
s1 s2 s3 s4 s5 s6 t1 t2 t3 t4 t5 t6 g1 0.2 0.4 1.0 1.0 1.0 1.0 t1 t2 t3 t4 t5 t6 g1 0.2 0.4 1.0 1.0 1.0 1.0 t1 t2 t3 t4 t5 t6 g1 0.2 0.4 1.0 1.0 1.0 1.0
, , , ...
s1 s2 s3
= 0.83
for a gene/metabolite
across species
Spearman, others
aggregate
metabolic pathways
140 metabolites
called metabolites
phylogeny
relationship
tree
target translate design implement validate design visual encodings design interactions sketch many ideas!
Blake Walsh, Gabriel Trevino, Antony Bett
Bang Wong
target translate design implement validate use code “sketches” define data structures find efficient algorithms
target translate design implement validate what? how? 80% 20%
target translate design implement validate is the abstraction right? does it support the tasks? does it provide new insights?
Visualization Design and Validation
target translate design implement
Nominal
Categorical Qualitative
Ordinal Interval Ratio
On the theory of scales and measurements [S. Stevens, 46]
Are = or ≠ to other values Apples, Oranges, Bananas,...
Obey a < relationship Small, medium, large
Can do arithmetic on them 10 inches, 23 inches, etc.
Dates: Jan 19; Location: (Lat, Long) Only differences (i.e., intervals) can be compared
Measurements: Length, Mass, Temp, ... Origin is meaningful, can measure ratios & proportions
On the theory of scales and measurements [S. Stevens, 46]
Item
Attribute
1 = Quantitative 2 = Nominal 3 = Ordinal
1 = Quantitative 2 = Nominal 3 = Ordinal
Nominal /Ordinal = Dimensions
Describe the data, independent variables
Quantitative = Measures
Numbers to be analyzed, dependent variables
Set with operations, e.g., floats with +, -, /, *
Includes semantics, supports reasoning Data Conceptual 1D floats temperature 3D vector of floats space
32.5, 54.0, -17.3, … (floats)
Temperature
Continuous to 4 significant figures (Q) Hot, warm, cold (O) Burned vs. Not burned (N)
Based on slide from Munzner
[1918-2010]
[1967]
for visual encodings
Semiology of Graphics [J. Bertin, 67]
Points Lines Areas Marks
Position Size (Grey)Value Texture Color Orientation Shape
Channels
Nominal Ordinal Quantitative Position
✔ ✔ ✔
Size
✔ ✔ ~
(Grey)Value
✔ ✔ ~
Texture
✔ ~ ✖
Color
✔ ✖ ✖
Orientation
✔ ✖ ✖
Shape
✔ ✖ ✖ ✔ = Good ~ = OK ✖ = Bad
[Mackinlay, Automating the Design of Graphical Presentations of Relational Information, 1986]
Decreasing
[“Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases” Chris Stolte, Diane Tang, and Pat Hanrahan, 2002]
stimulus magnitude increases
Weber fraction (constant!) Just-Noticeable Difference Base intensity
∆I I = k ∆I = k
∆I I = k ∆I = k
∆I I = k ∆I = k
is logarithmic
Sensation Intensity
Based on slide from Mazur
From Wilkinson 99, based on Stevens 61
Underestimating Overestimating
0.5 0.6 0.7 1.3 1.5 3.5 1.0 Electric
Most Efficient Least Efficient
VisualizingEconomics.com
Quantitative Ordinal Nominal
VisualizingEconomics.com
VisualizingEconomics.com
Rogowitz and Treinish, Why should engineers and scientists be worried about color?
Rogowitz and Treinish, Why should engineers and scientists be worried about color?, 1996
Southeastern United States and Gulf of Mexico zero crossing not explicit
hard to order easy to order creates artifacts lower resolution Borland 2007
Sunday Star Times, 2012
Peter and Maria Hoey (Source: Tommy McCall/Environmental Law Institute)
woodgears.ca