http://www.cs.ubc.ca/~tmm/courses/436V-20
Information Visualization Data Abstraction
Tamara Munzner Department of Computer Science University of British Columbia
Lect 2, 9 Jan 2020
Information Visualization Data Abstraction Tamara Munzner - - PowerPoint PPT Presentation
Information Visualization Data Abstraction Tamara Munzner Department of Computer Science University of British Columbia Lect 2, 9 Jan 2020 http://www.cs.ubc.ca/~tmm/courses/436V-20 Nested Model 2 How to evaluate a visualization: So many
http://www.cs.ubc.ca/~tmm/courses/436V-20
Lect 2, 9 Jan 2020
2
– quant: system performance, memory
– quant: (human) time and error rates, preferences – qual: behavior/strategy observations
– quant: usage logs – qual: interviews with users, case studies, observations
– quant: metrics computed on result images – qual: consider what structure is visible in result images
– qual: perceptual principles, best practices
3
– who are the target users?
– translate from specifics of domain to vocabulary of visualization
– often must transform data, guided by task
– how is it shown?
– efficient computation
4
[A Nested Model of Visualization Design and Validation.
TVCG 15(6):921-928, 2009 (Proc. InfoVis 2009). ]
algorithm idiom abstraction domain
[A Multi-Level Typology of Abstract Visualization Tasks Brehmer and Munzner. IEEE TVCG 19(12):2376-2385, 2013 (Proc. InfoVis 2013). ]
5
Domain situation You misunderstood their needs You’re showing them the wrong thing Visual encoding/interaction idiom The way you show it doesn’t work Algorithm Your code is too slow Data/task abstraction
6
Domain situation Observe target users using existing tools Visual encoding/interaction idiom Justify design with respect to alternatives Algorithm Measure system time/memory Analyze computational complexity Observe target users after deployment ( ) Measure adoption Analyze results qualitatively Measure human time with lab experiment (lab study) Data/task abstraction
computer science design psychology anthropology/ ethnography anthropology/ ethnography problem-driven work technique-driven work
[A Nested Model of Visualization Design and
qual qual qual qual quant quant quant
7
Domain situation Observe target users using existing tools Visual encoding/interaction idiom Justify design with respect to alternatives Algorithm Measure system time/memory Analyze computational complexity Observe target users after deployment ( ) Measure adoption Analyze results qualitatively Measure human time with lab experiment (lab study) Data/task abstraction
[A Nested Model of Visualization Design and
lab studies can't confirm task abstraction benchmarks can't confirm design
8
– two points far from each other in 3D space? – two points close to each other in 2D space, with 15 links between them, and a weight of 100001 for the link? – something else??
– food shipment of produce (basil & pear) arrived in satisfactory condition on 7th day of month – Basil Point neighborhood of city had 7 inches of snow cleared by the Pear Creek Limited snow removal service – lab rat Basil made 7 attempts to find way through south section of maze, these trials used pear as reward food
9
10
11
–item, link, attribute, position, (grid) –different from data types in programming!
12
–eg patient, car, stock, city –"independent variable"
–eg height, blood pressure for patient –eg horsepower, make for car –"dependent variable"
13
item: person attributes: name, age, shirt size, fave fruit
–express relationship between two items –eg friendship on facebook, interaction between proteins
–spatial data: location in 2D or 3D –pixels in photo, voxels in MRI scan, latitude/longitude
–sampling strategy for continuous data
14
15
Tables
Attributes (columns) Items (rows) Cell containing value
Tables
Items Attributes
item: person attributes: name, age, shirt size, fave fruit
–one item per row –each column is attribute –cell holds value
–one item per row –each column is attribute –cell holds value for item-attribute pair –unique key (could be implicit)
16
Tables
Attributes (columns) Items (rows) Cell containing value
Tables
Items Attributes
item: person attributes: name, age, shirt size, fave fruit
17
18
–indexing based on multiple keys
19
Tables
Attributes (columns) Items (rows) Cell containing value
Tables
Items Attributes
Multidimensional Table
Value in cell
20
https://bl.ocks.org/jasondavies/1341281
–nodes (vertices) connected by links (edges) –tree is special case: no cycles
21
Tables
Attributes (columns) Items (rows) Cell containing value
Networks
Link Node (item)
Trees
Tables Networks & Trees
Items Attributes Items (nodes) Links Attributes
22
https://bost.ocks.org/mike/miserables/ https://observablehq.com/@d3/force-directed-graph http://atlas.cid.harvard.edu/explore/? tradeDirection=import&year=2012&product=726&country=undefined&red irected=true
23
Node em)
Fields (Continuous)
Attributes (columns) Value in cell
Cell Grid of positions
Spatial
Net Tables
Attributes (columns) Items (rows) Cell containing value
Networks
Link Node (item)
Trees
Tables Networks & Trees Fields
Items Attributes Items (nodes) Links Attributes Grids Positions Attributes
–eg temperature, pressure, wind velocity
24
Node em)
Fields (Continuous)
Attributes (columns) Value in cell
Cell Grid of positions
Spatial
Net
– eg temperature, pressure, wind velocity
– sampling where attributes are measured – interpolation how to model attributes elsewhere – grid types
25
26
scalar vector tensor
– eg temperature, pressure, wind velocity
– sampling where attributes are measured – interpolation how to model attributes elsewhere – grid types, tensors
27
Node em)
Fields (Continuous)
Attributes (columns) Value in cell
Cell Grid of positions
Geometry (Spatial)
Position
Spatial
Net Tables
Attributes (columns) Items (rows) Cell containing value
Networks
Link Node (item)
Trees
Tables Networks & Trees Fields Geometry
Items Attributes Items (nodes) Links Attributes Grids Positions Attributes Items Positions
–(volumes outside scope of class)
–graphics: geometry taken as given –vis: geometry is result of a design decision
28
29
Node em)
Fields (Continuous)
Attributes (columns) Value in cell
Cell Grid of positions
Geometry (Spatial)
Position
Spatial
Net Tables
Attributes (columns) Items (rows) Cell containing value
Networks
Link Node (item)
Trees
Tables Networks & Trees Fields Geometry Clusters, Sets, Lists
Items Attributes Items (nodes) Links Attributes Grids Positions Attributes Items Positions Items
–unique items, unordered
–ordered, duplicates possible
–groups of similar items
30
31
Data Types Items Attributes Links Positions Grids Data and Dataset Types Tables Networks & Trees Fields Geometry Clusters, Sets, Lists
Items Attributes Items (nodes) Links Attributes Grids Positions Attributes Items Positions Items
–compare equality –no implicit ordering
–ordinal
–quantitative
32
Attribute Types Categorical Ordered
Ordinal Quantitative
33
34
35
36
Attribute Types Ordering Direction Categorical Ordered
Ordinal Quantitative
Sequential Diverging Cyclic
Dataset Availability Static Dynamic
–space –time –others
37
–how many items in the dataset? –what is cardinality of each attribute?
–guided by understanding of task
38
–mathematical abstraction
–mental construction (semantics) –supports reasoning –typically based on understanding of tasks [stay tuned, next week]
–for transforming data if needed
39
–32.52, 54.06, -14.35, ...
–temperature
–continuous to 2 significant figures: quantitative
–hot, warm, cold: ordinal
–above freezing, below freezing: categorical
40
–simple change of type –acquire additional data –complex transformation
41
exports imports
Derived Data
trade balance = exports −imports trade balance
42
https:// data.cityofnewyork.us/ Environment/2018-Central- Park-Squirrel-Census- Squirrel-Data/vfnx-vebw https:// www.thesquirrelcensus.com/
Datasets
What?
Attributes Dataset Types Data Types Data and Dataset Types Tables
Attributes (columns) Items (rows) Cell containing value
Networks
Link Node (item)
Trees
Fields (Continuous) Geometry (Spatial)
Attributes (columns) Value in cell
Cell
Multidimensional Table
Value in cell
Items Attributes Links Positions Grids Attribute Types Ordering Direction Categorical Ordered
Ordinal Quantitative
Sequential Diverging Cyclic Tables Networks & Trees Fields Geometry Clusters, Sets, Lists
Items Attributes Items (nodes) Links Attributes Grids Positions Attributes Items Positions Items
Grid of positions Position
43
Dataset Availability Static Dynamic
–refresher only if you need it: JS/HTML [90 min] –Intro to HTML/CSS/SVG [35 min] –Intro to D3.js [45 min]
– due Wed Jan 15
44
45