Introduction to Data Visualization Morine Amutorine Benjamin - - PowerPoint PPT Presentation

introduction to data visualization
SMART_READER_LITE
LIVE PREVIEW

Introduction to Data Visualization Morine Amutorine Benjamin - - PowerPoint PPT Presentation

Introduction to Data Visualization Morine Amutorine Benjamin Akera Elaine Nsoesie Instructor introductions Contacts: Morine - morine.amutorine@one.un.org / T witter: @M_moryn Elaine - onelaine@bu.edu / T witter: @ensoesie Ben -


slide-1
SLIDE 1

Introduction to Data Visualization

Morine Amutorine Benjamin Akera Elaine Nsoesie

slide-2
SLIDE 2

Instructor introductions …

Contacts: Morine - morine.amutorine@one.un.org / T witter: @M_moryn Elaine - onelaine@bu.edu / T witter: @ensoesie Ben - akeraben@gmail.com

slide-3
SLIDE 3

Resources

Github Repo https://github.com/ensoesie/DSA_Visualization Google Trends https://trends.google.com Twitter https://developer.twitter.com

slide-4
SLIDE 4

Why visualize data?

A picture is worth a thousand words It is easier to remember pictures than text Useful for understanding data Can summarize large amounts of complex data

slide-5
SLIDE 5

Visualization in Data Science can be used to:

  • Explore data
  • Analyze data
  • Communicate fjndings
  • Quickly draw attention to key messages
slide-6
SLIDE 6

How to use visualizations to communicate efgectively?

slide-7
SLIDE 7

Decide on what your visualization should convey

The style and structure

  • f your visualization will

depend on its purpose

1

Tell a good story with a clear message Design for a specific audience FOCUS ON THE DATA

slide-8
SLIDE 8

Use color and size to highlight and suppress information

2

slide-9
SLIDE 9
slide-10
SLIDE 10
slide-11
SLIDE 11

https://venngage.com/blog/how-to-pick-colors/

slide-12
SLIDE 12

Use length and position to express quantitative information. Use color for categorical information

3

Scatter plots and bar charts allow for more accurate comparison of information

  • ver time compared to pie

charts

slide-13
SLIDE 13

Think carefully about color selection and usage

4

https://africaindata.org/#/3

Use color to create groupings Add a single color to a black and white image Use black and white to add contrast to an image with a single color gradient

slide-14
SLIDE 14

Think carefully about color selection and usage

4

Some colors have pre- established meanings Consider those with color blindness Red

Stop Dangerous Hot

Green

Moving Money Plants

Blue

Water Cool Safe

slide-15
SLIDE 15

Use all available space and proper scales

5

Scale does not always have to include zero Optimize the ratio between plot objects to capture accurate relationships Transform data to a different scale e.g. use log scale to show percentage change over time

slide-16
SLIDE 16

Use text and labels to improve interpretation

6

Use meaningful titles Label axis, as needed Add texts directly to the image - do not always rely

  • n legends

Lines should not obstruct points Use colors (e.g. light grey) and weight that lessen focus on tick marks and grids

https://flowingdata.com/2016/05/17/the-changing-american-diet/

slide-17
SLIDE 17

Balance complexity and clarity

7

GapMinder (https://www.gapminder.org/tool)

slide-18
SLIDE 18

Balance complexity and clarity (infographics)

7

Templates and examples available online Can combine visualizations from python with manual editing

slide-19
SLIDE 19

Examples

slide-20
SLIDE 20

Bubbles

GapMinder (https://www.gapminder.org/tool)

When to use?

Visualize correlation/association

slide-21
SLIDE 21
  • Connected scatter

Correlogram Heatmap

Scatterplot

https://python-graph-gallery.com

slide-22
SLIDE 22

Maps

When to use?

Useful for spatial visualizations

slide-23
SLIDE 23
slide-24
SLIDE 24
  • Maps with bubbles
  • Maps with pins

healthmap.org

slide-25
SLIDE 25

Bar plots

How Africa Tweets. https://portland-communications.com/publications/how-africa-tweets-2018/

When to use?

Useful for rankings

slide-26
SLIDE 26
  • Box plot

Lollipop plot Word cloud

https://python-graph-gallery.com

slide-27
SLIDE 27

Area/density plots

When to use?

Useful for showing evolution

Jain et al. The Digital Phenotype. Nat Biotech

slide-28
SLIDE 28
  • Line plot

(Stacked) area plot Stream chart

slide-29
SLIDE 29

Networks

When to use?

Useful for information fmow

slide-30
SLIDE 30

https://vizhub.healthdata.org

  • Sankey diagram
slide-31
SLIDE 31

Code available from: https://guyabel.com/post/ animated-directional- chord-diagrams/

Chord diagram

slide-32
SLIDE 32

Bad visualizations

slide-33
SLIDE 33

Which of these images has issues?

slide-34
SLIDE 34

Which of these images has issues?

slide-35
SLIDE 35

What’s wrong with these images?

slide-36
SLIDE 36

Tools and Resources

slide-37
SLIDE 37

Python libraries

  • Matplotlib
  • ggplot
  • Seaborn
  • Bokeh
  • Pygal
  • Plotly
  • Geoplotlib
  • Gleam
  • Missingno
  • Leather
  • Pydot
slide-38
SLIDE 38

https://github.com/ft-interactive/chart-doctor/tree/master/visual-vocabulary

The Chart Doctor

slide-39
SLIDE 39

Other tools

  • T

ableau

  • R ggplot2 and others
  • D3
slide-40
SLIDE 40

Next ... ipython tutorial