Looking at the farmers market data IMP R OVIN G YOU R DATA VISU - - PowerPoint PPT Presentation

looking at the farmers market data
SMART_READER_LITE
LIVE PREVIEW

Looking at the farmers market data IMP R OVIN G YOU R DATA VISU - - PowerPoint PPT Presentation

Looking at the farmers market data IMP R OVIN G YOU R DATA VISU AL IZATION S IN P YTH ON Nick Stra y er Instr u ctor First e x plorations of a dataset Take a broad v ie w Sho w as m u ch info as possible Don ' t f u ss o v er appearances


slide-1
SLIDE 1

Looking at the farmers market data

IMP R OVIN G YOU R DATA VISU AL IZATION S IN P YTH ON

Nick Strayer

Instructor

slide-2
SLIDE 2

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

First explorations of a dataset

Take a broad view Show as much info as possible Don't fuss over appearances

slide-3
SLIDE 3

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Using your head()

pollution.head()

slide-4
SLIDE 4

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

# Just show median pollution.describe(percentiles=[0.5] # Describe all columns include='all')

slide-5
SLIDE 5

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON pd.plotting.scatter_matrix(pollution, alpha = 0.2);

slide-6
SLIDE 6

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-7
SLIDE 7

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-8
SLIDE 8

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON markets.head()

slide-9
SLIDE 9

Let's explore our data

IMP R OVIN G YOU R DATA VISU AL IZATION S IN P YTH ON

slide-10
SLIDE 10

Exploring the patterns

IMP R OVIN G YOU R DATA VISU AL IZATION S IN P YTH ON

Nick Strayer

Instructor

slide-11
SLIDE 11

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Digging in deeper

Investigating correlations Are correlations driven by confounding? Anything surprising?

slide-12
SLIDE 12

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Target audiences

Shared with peers Be smart about design decisions Remember they aren't as familiar with data

slide-13
SLIDE 13

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON sns.regplot('NO2', 'CO', ci=False, data=pollution, # Lower opacity of points scatter_kws={'alpha':0.2, 'color':'grey'} )

slide-14
SLIDE 14

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Profiling patterns

Found interesting paern in data How to quickly explore and explain the paern? Use text!

slide-15
SLIDE 15

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Using text scatters to id outliers

slide-16
SLIDE 16

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON g = sns.scatterplot("SO2","CO", data=long_beach_avgs) # Iterate over the rows of our data for _, row in long_beach_avgs.iterrows(): # Unpack columns from row month, SO2, CO = row # Draw annotation in correct place g.annotate(month, (SO2,CO)) plt.title('Long Beach avg SO2 by CO')

slide-17
SLIDE 17

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-18
SLIDE 18

Let's dig in

IMP R OVIN G YOU R DATA VISU AL IZATION S IN P YTH ON

slide-19
SLIDE 19

Making your visualizations efficient

IMP R OVIN G YOU R DATA VISU AL IZATION S IN P YTH ON

Nick Strayer

Instructor

slide-20
SLIDE 20

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

What is efficient?

Reduce the eort needed to see story Re-organize plots to keep focus Improve 'ink' to info ratio Don't compromise the message

slide-21
SLIDE 21

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-22
SLIDE 22

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

# Create a subplot w/ one row & two columns. f, (ax1, ax2) = plt.subplots(1, 2) # Pass each axes to respective plot sns.lineplot('month', 'NO2', 'year', ax=ax1, data=pol_by_month) sns.barplot('year', 'count', ax=ax2, data=obs_by_year)

slide-23
SLIDE 23

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Clear unnecessary legends

slide-24
SLIDE 24

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

sns.lineplot('month', 'NO2', 'year', ax=ax1, data=pol_by_month, palette='RdBu',) sns.barplot('year', 'count', 'year', ax=ax2, data=obs_by_year, palette='RdBu', dodge=False) # Remove legends for both plots ax1.legend_.remove() ax2.legend_.remove()

slide-25
SLIDE 25

Let's practice

IMP R OVIN G YOU R DATA VISU AL IZATION S IN P YTH ON

slide-26
SLIDE 26

Tweaking your plots

IMP R OVIN G YOU R DATA VISU AL IZATION S IN P YTH ON

Nick Strayer

Instructor

slide-27
SLIDE 27

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Looking at the small things

Put yourself into the viewer's shoes

slide-28
SLIDE 28

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Is the aesthetic appropriate?

Is the aesthetic appropriate for the context?

slide-29
SLIDE 29

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Font-sizes

Is everything legible?

slide-30
SLIDE 30

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-31
SLIDE 31

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-32
SLIDE 32

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-33
SLIDE 33

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-34
SLIDE 34

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-35
SLIDE 35

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Removing spines from plots

slide-36
SLIDE 36

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Removing spines from plots

slide-37
SLIDE 37

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-38
SLIDE 38

Let's tweak some plots

IMP R OVIN G YOU R DATA VISU AL IZATION S IN P YTH ON

slide-39
SLIDE 39

Wrap-Up

IMP R OVIN G YOU R DATA VISU AL IZATION S IN P YTH ON

Nick Strayer

Instructor

slide-40
SLIDE 40

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-41
SLIDE 41

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Using color responsibly

slide-42
SLIDE 42

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-43
SLIDE 43

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

slide-44
SLIDE 44

IMPROVING YOUR DATA VISUALIZATIONS IN PYTHON

Going further

Blogs Flowing data Curated list of data visualizations. Datawrapper Blog Articles that dig deep into visualization techniques and mistakes. Twier #datavis An ongoing stream of cool projects and inspiration.

slide-45
SLIDE 45

Thank you!

IMP R OVIN G YOU R DATA VISU AL IZATION S IN P YTH ON