Common visualization Issues & how to fix them Duen Horng (Polo) - - PowerPoint PPT Presentation

common visualization issues how to fix them
SMART_READER_LITE
LIVE PREVIEW

Common visualization Issues & how to fix them Duen Horng (Polo) - - PowerPoint PPT Presentation

http://poloclub.gatech.edu/cse6242 CSE6242 / CX4242: Data & Visual Analytics Common visualization Issues & how to fix them Duen Horng (Polo) Chau Associate Professor Associate Director, MS Analytics Machine Learning


slide-1
SLIDE 1

http://poloclub.gatech.edu/cse6242


CSE6242 / CX4242: Data & Visual Analytics


Common visualization Issues &
 how to fix them

Duen Horng (Polo) Chau


Associate Professor
 Associate Director, MS Analytics
 Machine Learning Area Leader, College of Computing
 Georgia Tech

Partly based on materials by 
 Professors Guy Lebanon, Jeffrey Heer, John Stasko, Christos Faloutsos

slide-2
SLIDE 2

2

Student of 
 Edward Tufte

http://a.co/6BhlPfZ

slide-3
SLIDE 3

Also Highly Recommended:

slide-4
SLIDE 4

4

Bar Charts

slide-5
SLIDE 5

4

The color scheme reminds you of what?

Bar Charts

slide-6
SLIDE 6

5

Better than Christmas


(Use color brewer to find good color schemes)

slide-7
SLIDE 7

6

Company Profits

slide-8
SLIDE 8

6

Don’t show profits in red!!
 Think carefully about your color choices.

Company Profits

slide-9
SLIDE 9

7

slide-10
SLIDE 10

8

slide-11
SLIDE 11

8

Misleading Bar Charts

slide-12
SLIDE 12

9

Vertical axis of bar charts 
 should start at 0, almost always

slide-13
SLIDE 13

10

slide-14
SLIDE 14

10

Disorienting color bars

slide-15
SLIDE 15

11

Use gradation

slide-16
SLIDE 16

12

slide-17
SLIDE 17

12

Avoid Tilted or Rotated Labels

slide-18
SLIDE 18

13

Bars Can be Horizontal

slide-19
SLIDE 19

13

Bars Can be Horizontal

When labels are hard to read, try horizontal layout. Don’t settle for the default.

slide-20
SLIDE 20

14 http://www.apple.com/imac/performance/

slide-21
SLIDE 21

15

Line Charts (a.k.a. fever lines)

Can you improve the tick labels?

slide-22
SLIDE 22

16

Use ticks at common intervals (e.g., 2, 5, 10, etc.)

slide-23
SLIDE 23

17

Note y-axis does not need to start at 0. 
 Why not as bad as in the case of bar chart?

Too flat or too steep?

Too flat obscures the message Too exaggerated

  • verstates the trend
slide-24
SLIDE 24

18

Rule of Thumb

slide-25
SLIDE 25

19

Multiple Patterned Lines 
 in one chart

slide-26
SLIDE 26

20

Better? 


Note the “double encoding” of line width and brightness.


What if you have many lines you want to show?

slide-27
SLIDE 27

21

“Small Multiple” - Edward Tufte
 Better than overlapping (sometimes)

“a series or grid of small similar graphics or charts, allowing them to be easily compared”

slide-28
SLIDE 28

22

What can you improve?

Tables

Name Data Data Data Company A 0.0 0.0 0.0 Company B 0.0 0.0 0.0 Company C 0.0 0.0 0.0 Company D 0.0 0.0 0.0

slide-29
SLIDE 29

What’s the problem with making everything 
 bold or italic?

slide-30
SLIDE 30

https://youtu.be/1E9pKU_N15A

slide-31
SLIDE 31

“Everyone is special” ➞ “No one is”

https://youtu.be/1E9pKU_N15A

slide-32
SLIDE 32

When everyone is special, no one is!

25

Name Data Data Data Company A 0.0 0.0 0.0 Company B 0.0 0.0 0.0 Company C 0.0 0.0 0.0 Company D 0.0 0.0 0.0

slide-33
SLIDE 33

26

Name Data Data Data Data Data Data Company A 0.0 0.0 0.0 0.0 0.0 0.0 Company B 0.0 0.0 0.0 0.0 0.0 0.0 Company C 0.0 0.0 0.0 0.0 0.0 0.0 Company D 0.0 0.0 0.0 0.0 0.0 0.0 Company E 0.0 0.0 0.0 0.0 0.0 0.0 Company F 0.0 0.0 0.0 0.0 0.0 0.0 Company G 0.0 0.0 0.0 0.0 0.0 0.0 Company H 0.0 0.0 0.0 0.0 0.0 0.0

slide-34
SLIDE 34

26

A lot of “chart junk”. 
 Low “data to ink” ratio (Edward Tufte)

Name Data Data Data Data Data Data Company A 0.0 0.0 0.0 0.0 0.0 0.0 Company B 0.0 0.0 0.0 0.0 0.0 0.0 Company C 0.0 0.0 0.0 0.0 0.0 0.0 Company D 0.0 0.0 0.0 0.0 0.0 0.0 Company E 0.0 0.0 0.0 0.0 0.0 0.0 Company F 0.0 0.0 0.0 0.0 0.0 0.0 Company G 0.0 0.0 0.0 0.0 0.0 0.0 Company H 0.0 0.0 0.0 0.0 0.0 0.0

slide-35
SLIDE 35

27

Higher “data to ink” ratio

Name Data Data Data Data Data Data Company A 0.0 0.0 0.0 12.0 0.0 0.0 Company B 0.0 0.0 0.0 11.0 0.0 0.0 Company C 0.0 0.0 0.0 10.0 0.0 0.0 Company D 0.0 0.0 0.0 9.0 0.0 0.0 Company E 0.0 0.0 0.0 8.0 0.0 0.0 Company F 0.0 0.0 0.0 7.0 0.0 0.0 Company G 0.0 0.0 0.0 6.0 0.0 0.0 Company H 0.0 0.0 0.0 5.0 0.0 0.0 Company I 0.0 0.0 0.0 4.0 0.0 0.0 Company J 0.0 0.0 0.0 3.0 0.0 0.0 Company K 0.0 0.0 0.0 2.0 0.0 0.0 Company L 0.0 0.0 0.0 1.0 0.0 0.0

slide-36
SLIDE 36

28

Problems?

Name Data Company A 1000 Company B 900 Company C 80 Company D 7 Name Data Company A 10.82 Company B 9.49 Company C 8 Company D 7.4

slide-37
SLIDE 37

29

Name Data Company A 10.8 Company B 9.5 Company C 8.0 Company D 7.4 Name Data Company A 10.82 Company B 9.49 Company C 8 Company D 7.4

slide-38
SLIDE 38

Beautiful Publication-quality LaTeX Tables

https://tex.stackexchange.com/questions/112343/beautiful-table-samples
 
 Short guide: https://www.inf.ethz.ch/personal/markusp/teaching/guides/guide-tables.pdf Long guide: http://cpansearch.perl.org/src/LIMAONE/LaTeX-Table-v1.0.6/examples/examples.pdf

30

slide-39
SLIDE 39

31

The Dreaded Pie Charts

Why people like to use pie charts?

slide-40
SLIDE 40

32 http://www.wired.com/2008/02/macworlds-iphon/

slide-41
SLIDE 41

33 http://flowingdata.com/2012/06/15/what-3-d-pie-charts-are-good-for/

slide-42
SLIDE 42

34 http://wonkette.com/412361/all-193-of-republicans-support-palin-romney-and-huckabee

slide-43
SLIDE 43

35

slide-44
SLIDE 44

36 http://infosthetics.com/archives/2008/09/funniest_pie_chart_ever.html

slide-45
SLIDE 45

Log scale instead of linear scale

Include numbers from different orders of magnitude

37

slide-46
SLIDE 46

38

Example

log-log

slide-47
SLIDE 47

39

Example

“log” also works well for time

The yield curve of Treasury bills, notes and bonds

slide-48
SLIDE 48

In-class Exercise.

Applying what you have just learned.

40

slide-49
SLIDE 49

41

Can you improve its visual design?

A brief description that outlines what the data shows HEADLINE OF THE CHART

slide-50
SLIDE 50

42

Which is better?

A brief description that outlines what the data shows

Headline of the chart 8 6 4 2

Town A Town B Town C Town D

A brief description that outlines what the data shows HEADLINE OF THE CHART

slide-51
SLIDE 51

How to fix the defaults

43

http://www.darkhorseanalytics.com/blog/clear-off-the-table

slide-52
SLIDE 52

How to fix the defaults

43

http://www.darkhorseanalytics.com/blog/clear-off-the-table

slide-53
SLIDE 53

How to fix the defaults

44

http://www.darkhorseanalytics.com/blog/clear-off-the-table

slide-54
SLIDE 54

How to fix the defaults

44

http://www.darkhorseanalytics.com/blog/clear-off-the-table

slide-55
SLIDE 55

Practitioners’ Guide

Colors: start with black & white, then add colors, carefully

Forces you to focus on content and layout

Fonts: sans-serif generally easier to read

(On Mac: Helvetica is great start) Animation: start with no animation, then add meaningful ones

45

slide-56
SLIDE 56

Practitioners’ Guide: Use Pictures and Videos

“Pictures” include tables, diagrams, charts, etc.

  • Pictures often more succinct & memorable
  • People like pictures and love movies

46

And show them ASAP! 


Once people fall asleep, it’s hard to wake them up!
 If you have good stuff, show them now.

slide-57
SLIDE 57

47

Example

slide-58
SLIDE 58

48

Example

slide-59
SLIDE 59

49

Example

slide-60
SLIDE 60

Practitioners’ Guide: Additional Tips for Researchers

Crown-jewel pictures are important

  • Overview of what readers is going to get — cut to the

chase (don’t tease!)

  • People skim and look at “interesting” things first
  • Reviewers are busy and sleepy 😵 (read 5-10 papers per

conference) — it’s refreshing to read an interesting paper How to do it?

  • Use your most impressive figure
  • Can be similar to another shown later

50

slide-61
SLIDE 61

Figures should be self-contained

Why?

  • Don’t make people go back

and forth between text & figure

  • Bad figures means bad first

impression (reject!) How to fix?

  • Succinctly describe your main

(take-away) messages 


51

slide-62
SLIDE 62

52

http://www.cs.cmu.edu/~dchau/polonium_sdm2011.pdf

Example

slide-63
SLIDE 63

More generally, how to write “good” papers?

53

http://faculty.washington.edu/wobbrock/pubs/ Wobbrock-2015.pdf

http://approximatelycorrect.com/2018/01/29/heuristics- technical-scientific-writing-machine-learning-perspective/

slide-64
SLIDE 64

Use legible fonts.

54

slide-65
SLIDE 65

For printed materials, print them out and check! Rule of thumb: about 7 lines of text on a slide.

55

If people can’t see it, they won’t appreciate it.

slide-66
SLIDE 66

Redesign figures for presentation

Designing for print is different from designing for the screen

  • Resolution (which is higher?)
  • Levels of details (people mostly want a few

“take-away” messages from your talk)

56

slide-67
SLIDE 67

57

Example

slide-68
SLIDE 68

Higher is better. Apolo wins.

* Statistically significant, by two-tailed t test, p <0.05

Judges’ Scores

8 16

Model- based *Prototyping *Average

Apolo Scholar

Score

Example

slide-69
SLIDE 69

Great Work destroyed by
 Poor Presentation

Bad color schemes Bad, tiny fonts Too much animation Too much data

59

100 times faster!

http://www.youtube.com/watch?v=lpvgfmEU2Ck&feature=player_embedded

Don McMillan: Life After Death by PowerPoint

can you read this?