Will Start at 2:35 pm Data Visualization The Good, the Bad, the - - PowerPoint PPT Presentation

will start at 2 35 pm data visualization the good the bad
SMART_READER_LITE
LIVE PREVIEW

Will Start at 2:35 pm Data Visualization The Good, the Bad, the - - PowerPoint PPT Presentation

Will Start at 2:35 pm Data Visualization The Good, the Bad, the Weird Nam Wook Kim Mini-Courses January @ GSAS 2018 Goal Rules of thumbs to critique visualization design 45 Ways to Communicate Two Quantities [Santiago Ortiz 2012]


slide-1
SLIDE 1

Will Start at 2:35 pm

slide-2
SLIDE 2

Data Visualization The Good, the Bad, the Weird

Nam Wook Kim Mini-Courses — January @ GSAS 2018

slide-3
SLIDE 3

Goal

Rules of thumbs
 to critique visualization design

slide-4
SLIDE 4
slide-5
SLIDE 5
slide-6
SLIDE 6
slide-7
SLIDE 7
slide-8
SLIDE 8
slide-9
SLIDE 9
slide-10
SLIDE 10
slide-11
SLIDE 11

45 Ways to Communicate Two Quantities

[Santiago Ortiz 2012]

slide-12
SLIDE 12

Which one is the best and why?

slide-13
SLIDE 13

There are numerous ways to draw even just two numbers…

slide-14
SLIDE 14

Is there an ideal way to visualize a data set?

slide-15
SLIDE 15

It depends on

Data types e.g., table, network, spatial, temporal Context of the data Tasks to perform e.g., identify trends, compare values Questions to answer Messages to deliver

slide-16
SLIDE 16

But, is there at least a guide for visualization design?

slide-17
SLIDE 17

Edward Tufte’s Design Principles

slide-18
SLIDE 18

Graphical Integrity

slide-19
SLIDE 19
slide-20
SLIDE 20

Flowing data

slide-21
SLIDE 21

Flowing data

Bar Chart should have a zero-baseline. Because you are comparing the lengths

slide-22
SLIDE 22

Flowing data

Or comparing positions from the baseline.

slide-23
SLIDE 23

Do we always need a zero-baseline?

slide-24
SLIDE 24
slide-25
SLIDE 25

Flowing data

Line Chart may not have a zero-baseline. Because you are comparing the angle

slide-26
SLIDE 26

Line chart’s aspect ratios can matter too.

https://eagereyes.org/basics/banking-45-degrees

slide-27
SLIDE 27

Line chart’s aspect ratios can matter too.

A rule of thumb is banking to 45 degrees to minimize errors in visual judgments of slope ratios.

slide-28
SLIDE 28

Comes down to the message you want to deliver

slide-29
SLIDE 29
slide-30
SLIDE 30

RANDOM QUARTER

slide-31
SLIDE 31

INCORRECT Y-SCALE 2 mil 172 pixels 138 pixels 4.5 mil

slide-32
SLIDE 32

15 MIL JOB LOSS? MISLEADING TITLE

slide-33
SLIDE 33

The truth is…

SOURCE: Worst chart I've seen all day, Jocelyn Fong (2010)

unemployed not job loss! stabilized after 2009

slide-34
SLIDE 34

what is depicted in the previous one.

slide-35
SLIDE 35
slide-36
SLIDE 36

Partial data

http://www.motherjones.com/kevin-drum/2012/01/lying-charts-global-warming-edition/

slide-37
SLIDE 37

Show data in context

slide-38
SLIDE 38

Average Annual Global Temperature (℉) 1880-2015

slide-39
SLIDE 39

Choose axis scales wisely.

slide-40
SLIDE 40

Tell the Truth!

The representation of numbers … should be directly proportional to the numerical quantities measured. — Edward Tufte 1983

slide-41
SLIDE 41

Lie Factor

slide-42
SLIDE 42

Lie Factor = Size of effect in graphic Size of effect in data

slide-43
SLIDE 43

Lie Factor = Size of effect in graphic Size of effect in data Size of effect = Percentage change | V1 — V2 | = | V1 |

slide-44
SLIDE 44

Year Standard MPG

slide-45
SLIDE 45

This line, representing 18 miles per gallon in 1978, is 0.6 inches long

slide-46
SLIDE 46

This line, representing 18 miles per gallon in 1978, is 0.6 inches long This line, representing 27.5 miles per gallon in 1985, is 5.3 inches long

slide-47
SLIDE 47

This line, representing 18 miles per gallon in 1978, is 0.6 inches long This line, representing 27.5 miles per gallon in 1985, is 5.3 inches long Lie Factor = (5.3-0.6) / 0.6 (27.5-18) / 18 = 14.8

slide-48
SLIDE 48
slide-49
SLIDE 49

This chart uses radius of the balloon to encode the data Doubling the radius (or data) increases the perceived area by four.
 Lie factor = (4-1)/(2-1) = 3.

slide-50
SLIDE 50

The size of the balloon should be something like this.

slide-51
SLIDE 51

Avoid Distortion!

slide-52
SLIDE 52

Maximize Data-Ink Ratio

slide-53
SLIDE 53

Data-Ink Ratio = Data-Ink Total Ink in Graphic

slide-54
SLIDE 54
slide-55
SLIDE 55
slide-56
SLIDE 56
slide-57
SLIDE 57

Stephen Few (2011)

slide-58
SLIDE 58

Avoid Chart Junks

slide-59
SLIDE 59

Chart Junks = Unnecessary visual elements in charts that distracts the viewer from the information

slide-60
SLIDE 60
slide-61
SLIDE 61
slide-62
SLIDE 62
slide-63
SLIDE 63
slide-64
SLIDE 64
slide-65
SLIDE 65
slide-66
SLIDE 66

Are these chart junks?

slide-67
SLIDE 67

Not all chart junks are the same.

slide-68
SLIDE 68

Useful chart junks?

Source: Useful Junk? The Effects of Visual Embellishment on 
 Comprehension and Memorability of Charts, CHI’10.

slide-69
SLIDE 69

Not harmful in comprehension 
 but more engaging & memorable

slide-70
SLIDE 70

“The same ink should often serve more than

  • ne graphical purpose. A graphical element

may carry data information and also perform a design function usually left to non-data-ink.” — [Edward Tufte 83]

Replace with an aesthetic function?
slide-71
SLIDE 71

Haroz et al. CHI’15

Contextual representation can be helpful

slide-72
SLIDE 72

Pie Charts

slide-73
SLIDE 73

[Schutz 2007]

Challenge: Find the biggest pie slice in each pie chart!

Ask the second choice
slide-74
SLIDE 74

[Schutz 2007]

slide-75
SLIDE 75

Share of Coverage by Topic on TechCrunch

[http://www.storytellingwithdata.com/blog/2011/07/death-to-pie-charts]

slide-76
SLIDE 76

Share of Coverage by Topic on TechCrunch Too many slices! Too many colors! Bad color contract!

[http://www.storytellingwithdata.com/blog/2011/07/death-to-pie-charts]

slide-77
SLIDE 77
slide-78
SLIDE 78
slide-79
SLIDE 79

?!

slide-80
SLIDE 80

Math fail?!

slide-81
SLIDE 81

World’s Most Accurate Pie Chart

slide-82
SLIDE 82

Some rules of thumb

  • You're comparing the parts to the whole.
  • There's a small number of slices.
  • Sort the values.
  • Start at 12 o’clock.
  • Use alternatives (e.g., bar chart)

Source: https://eagereyes.org/pie-charts

Pie charts are space efficient though

slide-83
SLIDE 83

Rainbow Colormap

slide-84
SLIDE 84

[http://factmyth.com/factoids/visible-light-is-electromagnetic-radiation/]

The rainbow color map is based

  • n the order of colors in the the

visible light spectrum.

slide-85
SLIDE 85
slide-86
SLIDE 86

Can you say which color represents a higher or lower value group?

slide-87
SLIDE 87

How The Rainbow Color Map Misleads (Kosara 2013)

Luminance Hue

Lack of perceptual ordering

slide-88
SLIDE 88

Example: How many low ESS regions found?

39% 71% 10.2 sec/region 5.6 sec/region

ESS: Endothelial shear stress [M. Borkin et al 2011]

slide-89
SLIDE 89

Some rules of thumb

Qualitative (rainbow) scheme — categorical data. Sequential scheme — ordered categories or numerical data Diverging scheme — numerical data with a meaningful mid-point.

[Color Brewer]

slide-90
SLIDE 90

3D charts

slide-91
SLIDE 91

Perspective distorts information

2D is more accurate and efficient in identifying problematic regions.

71% 91% 2.4 sec/region 5.6 sec/region

[M. Borkin et al 2011]

slide-92
SLIDE 92

Perspective distorts information

slide-93
SLIDE 93

Can rotate, but still no picture at once.

Occlusion hides information

slide-94
SLIDE 94

Exploring and Reducing the Effects of Orientation on Text Readability in Volumetric Displays (CHI 2007)

Tilted text isnʼt legible

slide-95
SLIDE 95

Depth judgment is bad

Human underestimate depth! Actual intensity change vs Perceived Sensation

slide-96
SLIDE 96

External cognition vs Internal cognition

slide-97
SLIDE 97

Eyes beats memory

http://www.randalolson.com/2015/08/23/small-multiples-vs-animated-gifs-for-showing-changes-in-fertility-rates-over-time/

Harder to compare visible item to memory of what you saw Animation

slide-98
SLIDE 98

Use external cognition over internal cognition

Eyes beats memory

Easy to compare by moving eyes between side-by-side views Small multiples

slide-99
SLIDE 99

Tufte Principles Other Considerations

  • Graphical integrity
  • The lie factor
  • Maximize data-ink ratio
  • Avoid harmful chart junk
  • Pie charts
  • Rainbow colormap
  • 3D charts
  • Eyes beats memory
slide-100
SLIDE 100

Subjective Dimensions

Aesthetics: Attractive things are perceived as more useful. Style: Communicates brand, process, who the designer is.
 Playfulness: Encourages experimentation and exploration. 
 Vividness: Can make a visualization more memorable.

slide-101
SLIDE 101

Subjective Dimensions

Aesthetics: Attractive things are perceived as more useful. Style: Communicates brand, process, who the designer is.
 Playfulness: Encourages experimentation and exploration. 
 Vividness: Can make a visualization more memorable.

Important if you want others to look at your chart!

slide-102
SLIDE 102
slide-103
SLIDE 103
slide-104
SLIDE 104

Where is a red circle?

Graphical Perception

Next

slide-105
SLIDE 105

10 min break