CS171 Visualization Alexander Lex alex@seas.harvard.edu The - - PowerPoint PPT Presentation

cs171 visualization
SMART_READER_LITE
LIVE PREVIEW

CS171 Visualization Alexander Lex alex@seas.harvard.edu The - - PowerPoint PPT Presentation

CS171 Visualization Alexander Lex alex@seas.harvard.edu The Visualization Alphabet: Marks and Channels [xkcd] This Week Thursday: Task Abstraction, Validation Homework 1 due on Friday! Any more problems with private GitHub repositories?


slide-1
SLIDE 1

CS171 Visualization

Alexander Lex alex@seas.harvard.edu

[xkcd]

The Visualization Alphabet: Marks and Channels

slide-2
SLIDE 2

This Week

Thursday: Task Abstraction, Validation Homework 1 due on Friday! Any more problems with private GitHub repositories? Later today: Introduction to HW 2 Reading: D3, Chapter 12; VAD, Chapters 3&4

slide-3
SLIDE 3

Next Week

Lecture 7: Homework 2 Design Studio Lecture 8: Interaction
 Guest Lecture, Jean-Daniel Fekete (INRIA) Sections: D3 & JS: Data Structures, Layouts

slide-4
SLIDE 4

No Device Policy

No Computers, Tablets, Phones in lecture hall

except when used for exercises

Switch off, mute, flight mode Why?

It’s better to take notes by hand Notifications are designed to grab your attention

slide-5
SLIDE 5

Last Week

slide-6
SLIDE 6

Terms

Dataset Types

what can be visualized?

Data Types

fundamental units combinations make up Dataset Types

Tables

Attributes (columns) Items (rows) Cell containing value

Networks

Link Node (item)

Trees

Fields (Continuous)

Attributes (columns) Value in cell

Cell

Multidimensional Table

Value in cell

Grid of positions

Geometry (Spatial)

Position

Dataset Types

Data Types Items Attributes Links Positions Grids

slide-7
SLIDE 7

Tables

Flat Table

  • ne item per row

each column is attribute unique (implicit) key no duplicates

Multidimensional Table

indexing based on multiple keys

Item Values Keys Attributes

slide-8
SLIDE 8

Multidimensional Tables

Keys: Patients Keys: Genes

slide-9
SLIDE 9

Graphs/Networks

A graph G(V,E) consists of a set of vertices (nodes) V and a set of edges (links) E connecting these vertices. A tree is a graph with no ¡cycles

slide-10
SLIDE 10

Fields

Attribute values associated with cells Cell contains data from continuous domain

Temperature, pressure, wind velocity

Measured or simulated Sampling & Interpolation

Signal processing & stats

slide-11
SLIDE 11

Other Collections

Sets

Unique items, unordered

Lists

Ordered, duplicates allowed

Clusters

Groups of similar items

slide-12
SLIDE 12

Data Types

Categorical/Nominal (labels)

Operations: =, ≠

Ordinal (ordered)

Operations: =, ≠, >, <

Interval (location of zero arbitrary)

Operations: =, ≠, >, <, +, − (distance)

Ratio (zero fixed)

Operations: =, ≠, >, <, +, −,×, ÷ (proportions)

On the theory of scales and measurements [S. Stevens, 46]

slide-13
SLIDE 13

Item/Element/ (Independent) Variable

slide-14
SLIDE 14

Attribute/ Dimension/ (Dependent) Variable/ Feature

slide-15
SLIDE 15

Semantics

slide-16
SLIDE 16

Keys?

slide-17
SLIDE 17

Attribute Types?

slide-18
SLIDE 18

Categorical Ordinal Quantitative

slide-19
SLIDE 19

Design Critique

slide-20
SLIDE 20

Recalled Cars NY Times

http://goo.gl/82tE6b

slide-21
SLIDE 21

The Visualization Alphabet: Marks and Channels

slide-22
SLIDE 22

How can I visually represent two numbers, e.g.,

4 and 8

slide-23
SLIDE 23

Marks & Channels

Marks: represent items or links Channels: change appearance based on attribute Channel = Visual Variable

slide-24
SLIDE 24

Example: Homework 2

slide-25
SLIDE 25

Marks for Items

Basic geometric elements 3D mark: Volume, but rarely used

0D 2D 1D

slide-26
SLIDE 26

Marks for Links

Containment Connection

slide-27
SLIDE 27

Containment can be nested

[Riche & Dwyer, 2010]

slide-28
SLIDE 28

Channels (aka Visual Variables)

Control appearance proportional to or based on attributes

slide-29
SLIDE 29

Jacques Bertin

French cartographer [1918-2010] Semiology of Graphics [1967] Theoretical principles for visual encodings

slide-30
SLIDE 30

Bertin’s Visual Variables

Semiology of Graphics [J. Bertin, 67]

Points Lines Areas Marks:

Position Size (Grey)Value Texture Color Orientation Shape

slide-31
SLIDE 31

Using Marks and Channels

Mark: Line Channel: Length/Position 1 quantitative attribute 1 categorical attribute Adding Hue +1 categorical attr. Adding Size +1 quantitative attr. Mark: Point Channel: Position 2 quantitative attr.

slide-32
SLIDE 32

Redundant encoding

Length, Position and Value

slide-33
SLIDE 33

Good bar chart?

Rule: Use channel proportional to data!

slide-34
SLIDE 34

Types of Channels

Identity Channels What? Where? Shape Color (hue) Spatial region … Magnitude Channels How much? Position Length Saturation …

Categorical Data Ordinal & Quantitative Data

slide-35
SLIDE 35

Magnitude Channels: Ordered Attributes Identity Channels: Categorical Attributes Spatial region Color hue Motion Shape Position on common scale Position on unaligned scale Length (1D size) Tilt/angle Area (2D size) Depth (3D position) Color luminance Color saturation Curvature Volume (3D size) Channels: Expressiveness Types and Efgectiveness Ranks

slide-36
SLIDE 36

What visual variables are used?

http://www.nytimes.com/interactive/2013/05/25/sunday-review/corporate-taxes.html

slide-37
SLIDE 37

What visual variables are used?

slide-38
SLIDE 38

Characteristics of Channels

Selective

Is a mark distinct from other marks? Can we make out the difference between two marks?

Associative

Does it support grouping?

Quantitative (Magnitude vs Identity Channels)

Can we quantify the difference between two marks?

slide-39
SLIDE 39

Characteristics of Channels

Order (Magnitude vs Identity)

Can we see a change in order?

Length

How many unique marks can we make?

slide-40
SLIDE 40

Position

Strongest visual variable Suitable for all data types Problems:

Sometimes not available (spatial data) Cluttering

Selective: yes Associative: yes Quantitative: yes Order: yes Length: fairly big

slide-41
SLIDE 41

Example: Scatterplot

slide-42
SLIDE 42

Position in 3D?

[Spotfire]

slide-43
SLIDE 43

Length & Size

Good for 1D, OK for 2D, Bad for 3D Easy to see whether one is bigger Aligned bars use position redundantly

For 1D length: Selective: yes Associative: yes Quantitative: yes Order: yes Length: high

slide-44
SLIDE 44

Example 2D Size: Bubbles

slide-45
SLIDE 45

Value/Luminance/Saturation

OK for quantitative data when length & size are used. Not very many shades recognizable

Selective: yes Associative: yes Quantitative: somewhat (with problems) Order: yes Length: limited

slide-46
SLIDE 46

Example: Diverging Value-Scale

slide-47
SLIDE 47

Color

Good for qualitative data (identity channel) Limited number of classes/length (~7-10!) Does not work for quantitative data! Lots of pitfalls! Be careful! My rule:

minimize color use for encoding data use for brushing

Selective: yes Associative: yes Quantitative: no Order: no Length: limited

< < ?????

slide-48
SLIDE 48

Cliff Mass

Color: Bad Example

slide-49
SLIDE 49

Color: Good Example

slide-50
SLIDE 50

Shape

Great to recognize many classes. No grouping, ordering.

Selective: yes Associative: limited Quantitative: no Order: no Length: vast

< < ?????

slide-51
SLIDE 51
slide-52
SLIDE 52

Chernoff Faces

Idea: use facial parameters to map quantitative data

Critique: https://eagereyes.org/criticism/chernoff-faces Does it work? Not really!

slide-53
SLIDE 53

More Channels

slide-54
SLIDE 54

Why are quantitative channels different?

S = sensation I = intensity

slide-55
SLIDE 55

Steven’s Power Law, 1961

From Wilkinson 99, based on Stevens 61

Electric

slide-56
SLIDE 56

How much longer?

A B

2x

slide-57
SLIDE 57

How much longer?

A B

4x

slide-58
SLIDE 58

How much steeper?

A B

~4x

slide-59
SLIDE 59

How much larger (area)?

A B

5x

slide-60
SLIDE 60

How much larger (area)?

A B

3x

slide-61
SLIDE 61

How much larger (diameter)?

A B

2x

slide-62
SLIDE 62

How much darker?

A B

2x

slide-63
SLIDE 63

Position, Length & Angle

slide-64
SLIDE 64

Other Factors Affecting Accuracy

Alignment Distractors Distance Common scale …

A B Unframed Aligned Framed Unaligned A B A B Unframed Unaligned

VS VS VS

slide-65
SLIDE 65

Cleveland / McGill, 1984

William S. Cleveland; Robert McGill , “Graphical Perception: Theory, Experimentation, and Application to the Development of Graphical Methods.” 1984

slide-66
SLIDE 66

Heer & Bostock, 2010

slide-67
SLIDE 67

Positions Rectangular areas

(aligned or in a treemap)

Angles Circular areas Cleveland & McGill’s Results Crowdsourced Results

1.0 3.0 1.5 2.5 2.0 Log Error 1.0 3.0 1.5 2.5 2.0 Log Error

slide-68
SLIDE 68

[Mackinlay, Automating the Design of Graphical Presentations of Relational Information, 1986]

Jock Mackinlay, 1986

Decreasing

slide-69
SLIDE 69

Magnitude Channels: Ordered Attributes Identity Channels: Categorical Attributes Spatial region Color hue Motion Shape Position on common scale Position on unaligned scale Length (1D size) Tilt/angle Area (2D size) Depth (3D position) Color luminance Color saturation Curvature Volume (3D size) Channels: Expressiveness Types and Efgectiveness Ranks

slide-70
SLIDE 70

Separability of Attributes

Can we combine multiple visual variables?

  • T. Munzner,

Visualization Analysis and Design, 2014