CS-5630 / CS-6630 Visualization Design Guidelines; Tasks Alexander - - PowerPoint PPT Presentation

cs 5630 cs 6630 visualization design guidelines tasks
SMART_READER_LITE
LIVE PREVIEW

CS-5630 / CS-6630 Visualization Design Guidelines; Tasks Alexander - - PowerPoint PPT Presentation

CS-5630 / CS-6630 Visualization Design Guidelines; Tasks Alexander Lex alex@sci.utah.edu [xkcd] Design Critique / Redesign https://goo.gl/lHWp4x Sunday Star Times, 2012 Quantity encoded by diameter, not area! Fixing that: R. Cunliffe,


slide-1
SLIDE 1

CS-5630 / CS-6630 Visualization Design Guidelines; Tasks

Alexander Lex alex@sci.utah.edu

[xkcd]

slide-2
SLIDE 2

Design Critique / Redesign

slide-3
SLIDE 3

Sunday Star Times, 2012

https://goo.gl/lHWp4x

slide-4
SLIDE 4
  • R. Cunliffe, Stats Chat

Quantity encoded by diameter, not area! Fixing that:

slide-5
SLIDE 5
  • R. Cunliffe, Stats Chat

But is this visual encoding appropriate in the first place?

slide-6
SLIDE 6

Design Guidelines

slide-7
SLIDE 7

Edward Tufte

slide-8
SLIDE 8

Design Excellence

“Well-designed presentations of interesting data are a matter of substance, of statistics, and of design.”

slide-9
SLIDE 9
slide-10
SLIDE 10

Tufte’s Lessons

Practice: graphical integrity and excellence Theory: design principles for data graphics

slide-11
SLIDE 11

Graphical Integrity

Flowing Data

slide-12
SLIDE 12

Scale Distortions

Flowing Data

slide-13
SLIDE 13

What’s wrong?

slide-14
SLIDE 14

Scale Distortions

slide-15
SLIDE 15

Scale Distortions

slide-16
SLIDE 16

Start Scales at 0?

  • A. Kriebel,

VizWiz

slide-17
SLIDE 17

Global Warming?

The Daily Mail, UK, Jan 2012

slide-18
SLIDE 18

Global Warming?

Mother Jones

slide-19
SLIDE 19

Global Warming - Frame the Data

Mother Jones

slide-20
SLIDE 20

The Lie Factor

Tufte, VDQI

Size of effect shown in graphic Size of effect in data

slide-21
SLIDE 21

The Lie Factor

(Size of effect in graphic)/(size of effect in data)

5.3 − 0.6 0.6 /27.5 − 18 18 = 14.8

Tufte, VDQI

slide-22
SLIDE 22

The Lie Factor

Tufte, VDQI

slide-23
SLIDE 23

Tufte’s Integrity Principles

Show data variation, not design variation Clear, detailed, and thorough labeling and appropriate scales Size of the graphic effect should be directly proportional to the numerical quantities (“lie factor”)

slide-24
SLIDE 24

Death to Pie Charts

Cole Nussbaumer www.storytellingwithdata.com/2011/07/death-to-pie-charts.html

“I hate pie charts. I mean, really hate them.”

Share of coverage

  • n TechCrunch
slide-25
SLIDE 25

Redesign

slide-26
SLIDE 26

Can you spot the differences?

slide-27
SLIDE 27

Can you spot the differences?

slide-28
SLIDE 28

My favorite pie chart

slide-29
SLIDE 29

My second favorite pie chart

slide-30
SLIDE 30

Visualization Design Principles

slide-31
SLIDE 31

Maximize Data-Ink Ratio

0-$24,999 $25,000+ 0-$24,999 $25,000+

slide-32
SLIDE 32

Maximize Data-Ink Ratio

175 350 525 700 Males Females

0-$24,999 $25,000+ 0-$24,999 $25,000+

slide-33
SLIDE 33

Avoid Chartjunk

  • ngoing, Tim Brey

Extraneous visual elements that distract from the message

slide-34
SLIDE 34

Avoid Chartjunk

  • ngoing, Tim Brey
slide-35
SLIDE 35

Avoid Chartjunk

  • ngoing, Tim Brey
slide-36
SLIDE 36

Avoid Chartjunk

  • ngoing, Tim Brey
slide-37
SLIDE 37

Avoid Chartjunk

  • ngoing, Tim Brey
slide-38
SLIDE 38

Avoid Chartjunk

  • ngoing, Tim Brey
slide-39
SLIDE 39

Which is better?

[Bateman et al. 2010]

slide-40
SLIDE 40

Which is better?

https://eagereyes.org/criticism/chart-junk-considered-useful-after-all

[Bateman et al. 2010]

slide-41
SLIDE 41
slide-42
SLIDE 42

EXPERIMENTAL RESULTS

  • 1. No significant difference between plain and image charts for interactive

interpretation accuracy

  • 2. No significant difference in recall accuracy after a five-minute gap
  • 3. Significantly better recall for Holmes charts of both the chart topic and

the details (categories and trend) after long-term gap (2-3 weeks).

  • 4. Participants saw value messages in the Holmes charts significantly

more often than in the plain charts.

  • 5. Participants found the Holmes charts more attractive, most enjoyed

them, and found that they were easiest and fastest to remember.

slide-43
SLIDE 43

PROS persuasion memorability engagement CONS unbiased analysis trustworthiness interpretability space efficiency

Use Chart Junk? It depends!

slide-44
SLIDE 44

Don’t

matplotlib gallery

Excel Charts Blog
slide-45
SLIDE 45

Tasks

Why are we using Visualization?

slide-46
SLIDE 46

Domain and Abstract Tasks

Infinite numbers of domain tasks Can be broken down into simpler abstract tasks We know how to address the abstract tasks! Identify task - data combination: solutions probably exist

slide-47
SLIDE 47

Tasks

Analyze

high-level choices consume vs produce

Search

find a known/unknown item

Query

find out about characteristics of item by itself or relative to others

slide-48
SLIDE 48

Example 1

Find good universities with a high faculty student ratio.

Identify high-ranked universities In this subset: compare universities & identify high faculty student ratio

OR

Derive a ranking with a high weight for faculty student ratio

slide-49
SLIDE 49

Example 2

Contrast Harvard’s reputation scores with MIT’s Match up Harvard with Yale

First, find Harvard and Yale, then compare their (two) reputation scores

slide-50
SLIDE 50

Example 3

Find a combination of weights and parameters where Harvard is better than MIT

Produce a new dataset by deriving from the input parameters

slide-51
SLIDE 51

Result

slide-52
SLIDE 52

High-level actions: Analyze

Consume discover vs present

classic split: explore vs explain

enjoy: casual, social Produce Annotate, record Derive: crucial design choice

Analyze Consume

Present Enjoy Discover

Produce

Annotate Record Derive

tag

slide-53
SLIDE 53

Example: Annotate

slide-54
SLIDE 54

Example: Derive

slide-55
SLIDE 55

Example: Derive

Country Club Club Continent Ronaldo Portugal Real Madrid Europe Lahm Germany Bayern München Europe Robben Netherlands Bayern München Europe Khedira Germany Real Madrid Europe Phogba Italy Juventus Europe Messi Argentina Barcelona Europe

slide-56
SLIDE 56
slide-57
SLIDE 57

Actions: Mid-level search, low- level query

what does user know?

target, location

how much of the data matters?

  • ne, some, all

Search Query Identify Compare Summarize

Target known Target unknown Location known Location unknown

Lookup Locate Browse Explore

slide-58
SLIDE 58

Example Compare (& Derive)

slide-59
SLIDE 59

Why: Targets

Trends ALL DATA Outliers Features ATTRIBUTES One Many

Distribution Dependency Correlation Similarity Extremes

NETWORK DATA SPATIAL DATA Shape Topology

Paths

slide-60
SLIDE 60

Examples

Trends: How did the job market develop since the recession overall? Outliers: Looking at real estate related jobs

slide-61
SLIDE 61

How? A Preview

Encode Manipulate Facet Reduce Arrange Map Change Select Navigate Express Separate Order Align Use Juxtapose Partition Superimpose Filter Aggregate Embed from categorical and ordered attributes