World Happiness Report Data Analysis Presented By: Jacob - - PowerPoint PPT Presentation

world happiness report data analysis
SMART_READER_LITE
LIVE PREVIEW

World Happiness Report Data Analysis Presented By: Jacob - - PowerPoint PPT Presentation

World Happiness Report Data Analysis Presented By: Jacob Bissonette, Kyle Costello, Zachary Mailloux, and Adrianna Staszewska What are we looking at? - Happiness standings of different countries - Different factors that play important roles


slide-1
SLIDE 1

World Happiness Report Data Analysis

Presented By: Jacob Bissonette, Kyle Costello, Zachary Mailloux, and Adrianna Staszewska

slide-2
SLIDE 2

What are we looking at?

  • Happiness standings of different countries
  • Happiness rankings of different

regions of the world

  • Different factors that play important roles

in a countries happiness rating

slide-3
SLIDE 3

What Does Our Data Include?

  • Results from 134 Countries between 2015 - 2019
  • Dependent Variable: Happiness Score, Scale of 1 - 10
  • Independent Variables:
  • Economic Score (GDP per Capita)
  • Health Score (Life Expectancy)
  • Freedom Score (ability to make own life choices)
  • Trust Score (higher score means greater trust/less corruption in gov’t)
  • Generosity Score
slide-4
SLIDE 4

Cleaning the Data

  • Missing countries across data sets
  • Missing independent variables across data sets
  • Zeroed values for some independent variables in specific countries
  • Added extra variable called Regions for further break down
slide-5
SLIDE 5

Approaching Our Problem

Goals

  • Predict a happiness score for a country
  • Determine trends in happiness scores with respect

to regions

  • Determine what variables have a larger deciding

factor in the happiness score

Approaches

  • Regression analysis
  • Singular/Multiple/Tree
  • Basic data visualization
  • Heat map
slide-6
SLIDE 6

Initial Analysis

slide-7
SLIDE 7
slide-8
SLIDE 8

Regression Analysis

slide-9
SLIDE 9

Regression in Excel

Examined a Multiple Regression using the entire data set from 2015-2019

R2 = 0.745

Independent Variables and Significance GDP P-Value = 5.81E-47 Life Expectancy P-Value = 8.66E-17 Freedom P-Value = 2.25E-23 Trust P-Value = 0.127080097 Generosity P-Value = 0.018402025

slide-10
SLIDE 10

Choosing our predictors

slide-11
SLIDE 11

Regression Tree Approach

slide-12
SLIDE 12

Multiple Regression Model

HappinessScore = 2.23853 + 1.34888*GDP + 1.49602*LE + 2.36287*F Multiple R2 value for training data: 0.755 Multiple R2 for testing data: 0.688

slide-13
SLIDE 13
slide-14
SLIDE 14

Comparison of Multiple Regression and Tree Methods

Multiple Regression Tree Regression

MSE 0.35 0.46 MAPE 9.33% 10.5% R2 68.8% 58.4%

slide-15
SLIDE 15

Conclusions

  • Most important factors when considering global happiness are:
  • GDP
  • Freedom
  • Life Expectancy
  • Using the Multiple Regression approach provided lower MSE values and

higher R2 values than the Tree Method approach

  • Determined that Europe, North America, and Oceania have some of the

highest happiness scores in the world

slide-16
SLIDE 16

Questions?