cs171 visualization
play

CS171 Visualization Alexander Lex alex@seas.harvard.edu Design - PowerPoint PPT Presentation

CS171 Visualization Alexander Lex alex@seas.harvard.edu Design Guidelines Tasks [xkcd] Next Week Lecture 7: Homework 2 Design Studio Lecture 8: Interaction Guest Lecture, Jean-Daniel Fekete (INRIA) Sections: D3 & JS: Data


  1. CS171 Visualization Alexander Lex alex@seas.harvard.edu Design Guidelines Tasks [xkcd]

  2. Next Week Lecture 7: Homework 2 Design Studio Lecture 8: Interaction 
 Guest Lecture, Jean-Daniel Fekete (INRIA) Sections: D3 & JS: Data Structures, Layouts

  3. Last Tuesday The Visualization Alphabet: Marks and Channels

  4. How can I visually represent two numbers, e.g., 4 and 8

  5. Marks & Channels Marks : represent items or links Channels : change appearance based on attribute Channel = Visual Variable

  6. Marks for Items Basic geometric elements 0D 1D 2D 3D mark: Volume, but rarely used

  7. Marks for Links Containment Connection

  8. Channels (aka Visual Variables) Control appearance proportional to or based on attributes

  9. Types of Channels Magnitude Channels Identity Channels How much? What? Where? Position Shape Length Color (hue) Saturation … Spatial region … Ordinal & Quantitative Data Categorical Data

  10. Channels: Expressiveness Types and E fg ectiveness Ranks Magnitude Channels: Ordered Attributes Identity Channels: Categorical Attributes Position on common scale Spatial region Position on unaligned scale Color hue Length (1D size) Motion Tilt/angle Shape Area (2D size) Depth (3D position) Color luminance Color saturation Curvature Volume (3D size)

  11. Position Strongest visual variable Suitable for all data types Problems: Sometimes not available (spatial data) Cluttering

  12. Example: Scatterplot

  13. Length & Size Good for 1D, OK for 2D, Bad for 3D Easy to see whether one is bigger Aligned bars use position redundantly

  14. Example 2D Size: Bubbles

  15. Value/Luminance/Saturation OK for quantitative data when length & size are used. Not very many shades recognizable Selective: yes Associative: yes Quantitative: somewhat (with problems) Order: yes Length: limited

  16. Example: Diverging Value-Scale

  17. ????? Color < < Selective: yes Good for qualitative data (identity channel) Associative: yes Limited number of classes/length (~7-10!) Quantitative: no Does not work for quantitative data! Order: no Lots of pitfalls! Be careful! Length: limited My rule: minimize color use for encoding data use for brushing

  18. Color: Bad Example Cliff Mass

  19. Color: Good Example

  20. Shape ????? < < Great to recognize many classes. No grouping, ordering. Selective: yes Associative: limited Quantitative: no Order: no Length: vast

  21. Why are quantitative channels different? S = sensation I = intensity

  22. How much longer? A 2x B

  23. How much longer? A 4x B

  24. How much steeper? ~4x A B

  25. How much larger (area)? 5x A B

  26. How much larger (area)? 3x A B

  27. How much larger (diameter)? 2x A B

  28. How much darker? 2x A B

  29. How much darker? 3x A B

  30. Other Factors Affecting Accuracy Alignment Distractors Distance B B A B A A Common scale Unframed Framed Unframed Unaligned Aligned Unaligned … VS VS VS

  31. Channels: Expressiveness Types and E fg ectiveness Ranks Magnitude Channels: Ordered Attributes Identity Channels: Categorical Attributes Position on common scale Spatial region Position on unaligned scale Color hue Length (1D size) Motion Tilt/angle Shape Area (2D size) Depth (3D position) Color luminance Color saturation Curvature Volume (3D size)

  32. Separability of Attributes Can we combine multiple visual variables?

  33. Sins from the past… [Mueller 09, Mueller 14]

  34. Common Mistakes

  35. Death to Pie Charts Share of coverage on TechCrunch “I hate pie charts. I mean, really hate them.” Cole Nussbaumer www.storytellingwithdata.com/2011/07/death-to-pie-charts.html

  36. Redesign

  37. Can you spot the differences?

  38. Can you spot the differences?

  39. My favorite pie chart

  40. My second favorite pie chart

  41. Sunday Star Times, 2012

  42. Quantity encoded by diameter, not area! Fixing that: R. Cunliffe, Stats Chat

  43. But is this visual encoding appropriate in the first place? R. Cunliffe, Stats Chat

  44. Graphical Integrity Flowing Data

  45. Scale Distortions Flowing Data

  46. What’s wrong?

  47. Scale Distortions

  48. Scale Distortions

  49. Start Scales at 0? A. Kriebel, VizWiz

  50. Global Warming? The Daily Mail, UK, Jan 2012

  51. Global Warming? Mother Jones

  52. Global Warming - Frame the Data Mother Jones

  53. The Lie Factor Size of effect shown in graphic Size of effect in data Tufte, VDQI

  54. The Lie Factor 5 . 3 − 0 . 6 / 27 . 5 − 18 = 14 . 8 0 . 6 18 (Size of effect in graphic)/(size of effect in data) Tufte, VDQI

  55. The Lie Factor Tufte, VDQI

  56. Tufte’s Integrity Principles Show data variation , not design variation Clear, detailed, and thorough labeling and appropriate scales Size of the graphic effect should be directly proportional to the numerical quantities (“lie factor”)

  57. Visualization Design Principles

  58. Maximize Data-Ink Ratio 0-$24,999 $25,000+ 0-$24,999 $25,000+

  59. Maximize Data-Ink Ratio 700 525 350 175 0 Males Females 0-$24,999 $25,000+ 0-$24,999 $25,000+

  60. Avoid Chartjunk Extraneous visual elements that distract from the message ongoing, Tim Brey

  61. Avoid Chartjunk ongoing, Tim Brey

  62. Avoid Chartjunk ongoing, Tim Brey

  63. Avoid Chartjunk ongoing, Tim Brey

  64. Avoid Chartjunk ongoing, Tim Brey

  65. Avoid Chartjunk ongoing, Tim Brey

  66. Which is better? [Bateman et al. 2010]

  67. Which is better? [Bateman et al. 2010] https://eagereyes.org/criticism/chart-junk-considered-useful-after-all

  68. Don’t matplotlib gallery Excel Charts Blog

  69. Design Critique

  70. Design Critique http://goo.gl/DA67PG

  71. Tasks Why are we using Visualization?

  72. Domain and Abstract Tasks Infinite numbers of domain tasks Can be broken down into simpler abstract tasks We know how to address the abstract tasks! Identify task - data combination: solutions probably exist

  73. Tasks Analyze high-level choices consume vs produce Search find a known/unknown item Query find out about characteristics of item by itself or relative to others

  74. Example 1 Find good universities with a high faculty student ratio. Identify high-ranked universities In this subset: compare universities & identify high faculty student ratio OR Derive a ranking with a high weight for faculty student ratio

  75. Example 2 Contrast Harvard’s reputation scores with MIT’s Match up Harvard with Yale First, find Harvard and Yale, then compare their (two) reputation scores

  76. Example 3 Find a combination of weights and parameters where Harvard is better than MIT Produce a new dataset by deriving from the input parameters

  77. Result

  78. High-level actions: Analyze Analyze Consume Consume discover vs present Discover Present Enjoy classic split: explore vs explain enjoy: casual, social Produce Produce Annotate Record Derive Annotate, record tag Derive: crucial design choice

  79. Example: Annotate

  80. Example: Derive

  81. Example: Derive Country Club Club Continent Ronaldo Portugal Real Madrid Europe Lahm Germany Bayern München Europe Robben Netherlands Bayern München Europe Khedira Germany Real Madrid Europe Phogba Italy Juventus Europe Messi Argentina Barcelona Europe

  82. Actions: Mid-level search, low- level query Search what does user know? Target known Target unknown Location Lookup Browse target, location known Location Locate Explore unknown how much of the data Query matters? Identify Compare Summarize one, some, all

  83. Example Compare (& Derive)

  84. Why: Targets NETWORK DATA ALL DATA Topology Trends Outliers Features Paths ATTRIBUTES One Many SPATIAL DATA Dependency Correlation Similarity Distribution Shape Extremes

  85. Examples Trends: How did the job market develop since the recession overall? Outliers: Looking at real estate related jobs

  86. How? A Preview Encode Manipulate Facet Reduce Arrange Change Juxtapose Filter Express Separate Select Partition Aggregate Order Align Use Navigate Superimpose Embed Map from categorical and ordered attributes

  87. Next time: Evaluation

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend