Data Visualization The Good, the Bad, the Weird Nam Wook Kim - - PowerPoint PPT Presentation

data visualization the good the bad the weird
SMART_READER_LITE
LIVE PREVIEW

Data Visualization The Good, the Bad, the Weird Nam Wook Kim - - PowerPoint PPT Presentation

Data Visualization The Good, the Bad, the Weird Nam Wook Kim Mini-Courses January @ GSAS 2018 Goal Rules of thumbs to critique visualization design 45 Ways to Communicate Two Quantities [Santiago Ortiz 2012] Which one is the best


slide-1
SLIDE 1

Data Visualization The Good, the Bad, the Weird

Nam Wook Kim Mini-Courses — January @ GSAS 2018

slide-2
SLIDE 2

Goal

Rules of thumbs
 to critique visualization design

slide-3
SLIDE 3
slide-4
SLIDE 4
slide-5
SLIDE 5
slide-6
SLIDE 6
slide-7
SLIDE 7
slide-8
SLIDE 8
slide-9
SLIDE 9
slide-10
SLIDE 10

45 Ways to Communicate Two Quantities

[Santiago Ortiz 2012]

slide-11
SLIDE 11

Which one is the best and why?

slide-12
SLIDE 12

There are so many ways to draw just two numbers…

slide-13
SLIDE 13

Is there an ideal way to visualize a data set?

slide-14
SLIDE 14

It depends on

Data types e.g., table, network, spatial, temporal Context of the data Tasks to perform e.g., identify trends, compare values Questions to answer Messages to deliver

slide-15
SLIDE 15

But, is there at least a guide for visualization design?

slide-16
SLIDE 16

Edward Tufte’s Design Principles

slide-17
SLIDE 17

Graphical Integrity

slide-18
SLIDE 18
slide-19
SLIDE 19

Flowing data

slide-20
SLIDE 20

Flowing data

Bar Chart should have a zero-baseline. Because you are comparing the lengths

slide-21
SLIDE 21

Flowing data

Or comparing positions from the baseline.

slide-22
SLIDE 22

Do we always need a zero-baseline?

slide-23
SLIDE 23
slide-24
SLIDE 24

Flowing data

Line Chart may not have a zero-baseline. Because you are comparing the angle

slide-25
SLIDE 25

Line chart’s aspect ratios can matter too.

https://eagereyes.org/basics/banking-45-degrees

slide-26
SLIDE 26

Line chart’s aspect ratios can matter too.

A rule of thumb is banking to 45 degrees to minimize errors in visual judgments of slope ratios.

slide-27
SLIDE 27

Comes down to the message you want to deliver

slide-28
SLIDE 28
slide-29
SLIDE 29

RANDOM QUARTER

slide-30
SLIDE 30

INCORRECT Y-SCALE

2 mil 172 pixels 138 pixels 4.5 mil

slide-31
SLIDE 31

15 MIL JOB LOSS? MISLEADING

slide-32
SLIDE 32

The truth is…

SOURCE: Worst chart I've seen all day, Jocelyn Fong (2010)

unemployment rate not job loss! stabilized after 2009

slide-33
SLIDE 33

what is depicted in the previous one.

slide-34
SLIDE 34
slide-35
SLIDE 35

Partial data

http://www.motherjones.com/kevin-drum/2012/01/lying-charts-global-warming-edition/

slide-36
SLIDE 36

Show data in context

slide-37
SLIDE 37

Average Annual Global Temperature (℉) 1880-2015

slide-38
SLIDE 38

Choose axis scales wisely.

slide-39
SLIDE 39

Same here…

slide-40
SLIDE 40

Tell the Truth!

The representation of numbers, as physically measured on the surface of the graphic itself, should be directly proportional to the numerical quantities measured. — [Edward Tufte 83]

slide-41
SLIDE 41

Lie Factor

slide-42
SLIDE 42

Lie Factor = Size of effect shown in graphic Size of effect in data

slide-43
SLIDE 43

Lie Factor = Size of effect shown in graphic Size of effect in data

where size of effect = percentage change | first value — second value | | first value | =

slide-44
SLIDE 44
slide-45
SLIDE 45

This line, representing 18 miles per gallon in 1978, is 0.6 inches long

slide-46
SLIDE 46

This line, representing 18 miles per gallon in 1978, is 0.6 inches long This line, representing 27.5 miles per gallon in 1985, is 5.3 inches long

slide-47
SLIDE 47

This line, representing 18 miles per gallon in 1978, is 0.6 inches long This line, representing 27.5 miles per gallon in 1985, is 5.3 inches long Lie Factor = (5.3-0.6) / 0.6 (27.5-18) / 18 = 14.8

slide-48
SLIDE 48
slide-49
SLIDE 49

This chart uses radius of the balloon to encode the data Doubling the radius (or data) increases the perceived area by four.
 Lie factor = (4-1)/(2-1) = 3.

slide-50
SLIDE 50

The size of the balloon should be something like this.

slide-51
SLIDE 51

Avoid Distortion!

slide-52
SLIDE 52

Maximize Data-Ink Ratio

slide-53
SLIDE 53

Data-Ink Ratio = Data-Ink Total ink used in graphic

slide-54
SLIDE 54
slide-55
SLIDE 55
slide-56
SLIDE 56
slide-57
SLIDE 57

Stephen Few (2011)

slide-58
SLIDE 58

Avoid Chart Junks

slide-59
SLIDE 59

Chart Junks = Unnecessary visual elements in charts that distracts the viewer from the information

slide-60
SLIDE 60
slide-61
SLIDE 61
slide-62
SLIDE 62
slide-63
SLIDE 63
slide-64
SLIDE 64
slide-65
SLIDE 65
slide-66
SLIDE 66

Are these chart junks?

slide-67
SLIDE 67

Not all chart junks are the same.

slide-68
SLIDE 68

Useful chart junks?

Source: Useful Junk? The Effects of Visual Embellishment on Comprehension and Memorability of Charts (CHI 2010)

slide-69
SLIDE 69

Not harmful in comprehension 
 but more engaging & memorable

slide-70
SLIDE 70

“The same ink should often serve more than

  • ne graphical purpose. A graphical element

may carry data information and also perform a design function usually left to non-data-ink.” — [Edward Tufte 83]

slide-71
SLIDE 71

Contextual representation can be helpful

slide-72
SLIDE 72

Pie Charts

slide-73
SLIDE 73

[Schutz 2007]

Challenge: Find the biggest pie slice in each pie chart!

slide-74
SLIDE 74

[Schutz 2007]

slide-75
SLIDE 75

Share of Coverage by Topic on TechCrunch

[http://www.storytellingwithdata.com/blog/2011/07/death-to-pie-charts]

slide-76
SLIDE 76

Share of Coverage by Topic on TechCrunch

[http://www.storytellingwithdata.com/blog/2011/07/death-to-pie-charts]

Too many slices! Too many colors! Bad color contract!

slide-77
SLIDE 77
slide-78
SLIDE 78
slide-79
SLIDE 79

?!

slide-80
SLIDE 80

Math fail?!

slide-81
SLIDE 81

World’s Most Accurate Pie Chart

slide-82
SLIDE 82

Some rules of thumb

  • You're comparing the parts to the whole.
  • There's a small number of slices.
  • Sort the values.
  • Start at 12 o’clock.
  • Use alternatives (e.g., bar chart)

Source: https://eagereyes.org/pie-charts

slide-83
SLIDE 83

Rainbow Colormap

slide-84
SLIDE 84

The rainbow color map is based on the order

  • f colors in the the visible light spectrum.

[http://factmyth.com/factoids/visible-light-is-electromagnetic-radiation/]

slide-85
SLIDE 85
slide-86
SLIDE 86

Can you say which color represents a higher or lower value group?

slide-87
SLIDE 87 How The Rainbow Color Map Misleads (Kosara 2013)

Luminance Hue

Lack of perceptual ordering

slide-88
SLIDE 88

Example: How many low ESS regions found?

39% 71% 10.2 sec/region 5.6 sec/region

ESS: Endothelial shear stress [M. Borkin et al 2011]

World’s Most Accurate Pie Chart

slide-89
SLIDE 89

Some rules of thumb

Qualitative (rainbow) scheme — categorical data. Sequential scheme — ordered categories or numerical data Diverging scheme — numerical data with a meaningful mid-point.

[Color Brewer]

slide-90
SLIDE 90

3D charts

slide-91
SLIDE 91

Perspective distorts information

2D is more accurate and efficient in identifying problematic regions.

71% 91% 2.4 sec/region 5.6 sec/region

[M. Borkin et al 2011]

slide-92
SLIDE 92

Perspective distorts information

slide-93
SLIDE 93

Occlusion hides information

Can rotate, but still no picture at once.

slide-94
SLIDE 94

Exploring and Reducing the Effects of Orientation on Text Readability in Volumetric Displays (CHI 2007)

Tilted text isnʼt legible

slide-95
SLIDE 95

Depth judgment is bad

Human underestimate depth! Actual intensity change vs Perceived Sensation

slide-96
SLIDE 96

External cognition vs Internal cognition

slide-97
SLIDE 97

Eyes beats memory

http://www.randalolson.com/2015/08/23/small-multiples-vs-animated-gifs-for-showing-changes-in-fertility-rates-over-time/

Harder to compare visible item to memory of what you saw Animation— narration can help

slide-98
SLIDE 98

Small multiples— high data density Easy to compare by moving eyes between side-by-side views

Eyes beats memory

Use external cognition over internal cognition

slide-99
SLIDE 99

Tufte Principles Other considerations

  • Graphical integrity
  • The lie factor
  • Maximize data-ink ratio
  • Avoid harmful chart junk
  • Pie charts
  • Rainbow colormap
  • 3D charts
  • Eyes beats memory
slide-100
SLIDE 100

Subjective Dimensions

  • Aesthetics: Attractive things are perceived as more useful.
  • Style: Communicates brand, process, who the designer is.

  • Playfulness: Encourages experimentation and exploration. 

  • Vividness: Can make a visualization more memorable.
slide-101
SLIDE 101

Subjective Dimensions

  • Aesthetics: Attractive things are perceived as more useful.
  • Style: Communicates brand, process, who the designer is.

  • Playfulness: Encourages experimentation and exploration. 

  • Vividness: Can make a visualization more memorable.

Important if you want others to look at your chart!

slide-102
SLIDE 102
slide-103
SLIDE 103
slide-104
SLIDE 104

Where is a red circle?

Graphical Perception

Next

slide-105
SLIDE 105

10 min break