categorical plot types
play

Categorical Plot Types DATA VIS UALIZ ATION W ITH S EABORN Chris - PowerPoint PPT Presentation

Categorical Plot Types DATA VIS UALIZ ATION W ITH S EABORN Chris Moftt Instructor Categorical Data Data which takes on a limited and xed number of values Normally combined with numeric data Examples include: Geography (country,


  1. Categorical Plot Types DATA VIS UALIZ ATION W ITH S EABORN Chris Mof�tt Instructor

  2. Categorical Data Data which takes on a limited and �xed number of values Normally combined with numeric data Examples include: Geography (country, state, region) Gender Ethnicity Blood type Eye color DATA VISUALIZATION WITH SEABORN

  3. Plot types - show each observation DATA VISUALIZATION WITH SEABORN

  4. Plot types - abstract representations DATA VISUALIZATION WITH SEABORN

  5. Plot types - statistical estimates DATA VISUALIZATION WITH SEABORN

  6. Plots of each observation - stripplot sns.stripplot(data=df, y="DRG Definition", x="Average Covered Charges", jitter=True) DATA VISUALIZATION WITH SEABORN

  7. Plots of each observation - swarmplot sns.swarmplot(data=df, y="DRG Definition", x="Average Covered Charges") DATA VISUALIZATION WITH SEABORN

  8. Abstract representations - boxplot sns.boxplot(data=df, y="DRG Definition", x="Average Covered Charges") DATA VISUALIZATION WITH SEABORN

  9. Abstract representation - violinplot sns.violinplot(data=df, y="DRG Definition", x="Average Covered Charges") DATA VISUALIZATION WITH SEABORN

  10. Abstract representation - lvplot sns.lvplot(data=df, y="DRG Definition", x="Average Covered Charges") DATA VISUALIZATION WITH SEABORN

  11. Statistical estimates - barplot sns.barplot(data=df, y="DRG Definition", x="Average Covered Charges", hue="Region") DATA VISUALIZATION WITH SEABORN

  12. Statistical estimates - pointplot sns.pointplot(data=df, y="DRG Definition", x="Average Covered Charges", hue="Region") DATA VISUALIZATION WITH SEABORN

  13. Statistical estimates - countplot sns.countplot(data=df, y="DRG_Code", hue="Region") DATA VISUALIZATION WITH SEABORN

  14. Let's practice! DATA VIS UALIZ ATION W ITH S EABORN

  15. Regression Plots DATA VIS UALIZ ATION W ITH S EABORN Chris Mof�tt Instructor

  16. Bicycle Dataset Aggregated bicycle sharing data in Washington DC Data includes: Rental amounts Weather information Calendar information Can we predict rental amounts? DATA VISUALIZATION WITH SEABORN

  17. Plotting with regplot() sns.regplot(data=df, x='temp', y='total_rentals', marker='+') DATA VISUALIZATION WITH SEABORN

  18. Evaluating regression with residplot() A residual plot is useful for evaluating the �t of a model Seaborn supports through residplot function sns.residplot(data=df, x='temp', y='total_rentals') DATA VISUALIZATION WITH SEABORN

  19. Polynomial regression Seaborn supports polynomial regression using the order parameter sns.regplot(data=df, x='temp', y='total_rentals', order=2) DATA VISUALIZATION WITH SEABORN

  20. residplot with polynomial regression sns.residplot(data=df, x='temp', y='total_rentals', order=2) DATA VISUALIZATION WITH SEABORN

  21. Categorical values sns.regplot(data=df, x='mnth', y='total_rentals', x_jitter=.1, order=2) DATA VISUALIZATION WITH SEABORN

  22. Estimators In some cases, an x_estimator can be useful for highlighting trends sns.regplot(data=df, x='mnth', y='total_rentals', x_estimator=np.mean, order=2) DATA VISUALIZATION WITH SEABORN

  23. Binning the data x_bins can be used to divide the data into discrete bins The regression line is still �t against all the data sns.regplot(data=df,x='temp',y='total_rentals', x_bins=4) DATA VISUALIZATION WITH SEABORN

  24. Let's practice! DATA VIS UALIZ ATION W ITH S EABORN

  25. Matrix Plots DATA VIS UALIZ ATION W ITH S EABORN Chris Mof�tt Instructor

  26. Getting data in the right format Seaborn's heatmap() function requires data to be in a grid format pandas crosstab() is frequently used to manipulate the data pd.crosstab(df["mnth"], df["weekday"], values=df["total_rentals"],aggfunc='mean').round(0) DATA VISUALIZATION WITH SEABORN

  27. Build a heatmap sns.heatmap(pd.crosstab(df["mnth"], df["weekday"], values=df["total_rentals"], aggfunc='mean') ) DATA VISUALIZATION WITH SEABORN

  28. Customize a heatmap sns.heatmap(df_crosstab, annot=True, fmt="d", cmap="YlGnBu", cbar=False, linewidths=.5) DATA VISUALIZATION WITH SEABORN

  29. Centering a heatmap Seaborn support centering the heatmap colors on a speci�c value sns.heatmap(df_crosstab, annot=True, fmt="d", cmap="YlGnBu", cbar=True, center=df_crosstab.loc[9, 6]) DATA VISUALIZATION WITH SEABORN

  30. Plotting a correlation matrix Pandas corr function calculates correlations between columns in a dataframe The output can be converted to a heatmap with seaborn sns.heatmap(df.corr()) DATA VISUALIZATION WITH SEABORN

  31. Let's practice! DATA VIS UALIZ ATION W ITH S EABORN

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend