gov 51 visualizing distributions
play

Gov 51: Visualizing Distributions Matthew Blackwell Harvard - PowerPoint PPT Presentation

Gov 51: Visualizing Distributions Matthew Blackwell Harvard University 1 / 14 Studying political effjcacy 2002 WHO survey of people in China and Mexico. Goal: determine feelings of political effjcacy. Question: How much say do


  1. Gov 51: Visualizing Distributions Matthew Blackwell Harvard University 1 / 14

  2. Studying political effjcacy • 2002 WHO survey of people in China and Mexico. • Goal: determine feelings of political effjcacy. • Question: “How much say do you have in getting the government to address issues that interest you?” 1. No say at all 2. little say 3. some say 4. a lot of say 5. unlimited say 2 / 14

  3. Studying political effjcacy • 2002 WHO survey of people in China and Mexico. • Goal: determine feelings of political effjcacy. • Question: “How much say do you have in getting the government to address issues that interest you?” 1. No say at all 2. little say 3. some say 4. a lot of say 5. unlimited say 2 / 14

  4. Studying political effjcacy • 2002 WHO survey of people in China and Mexico. • Goal: determine feelings of political effjcacy. • Question: “How much say do you have in getting the government to address issues that interest you?” 1. No say at all 2. little say 3. some say 4. a lot of say 5. unlimited say 2 / 14

  5. Studying political effjcacy • 2002 WHO survey of people in China and Mexico. • Goal: determine feelings of political effjcacy. • Question: “How much say do you have in getting the government to address issues that interest you?” 1. No say at all 2. little say 3. some say 4. a lot of say 5. unlimited say 2 / 14

  6. Studying political effjcacy • 2002 WHO survey of people in China and Mexico. • Goal: determine feelings of political effjcacy. • Question: “How much say do you have in getting the government to address issues that interest you?” 1. No say at all 2. little say 3. some say 4. a lot of say 5. unlimited say 2 / 14

  7. Studying political effjcacy • 2002 WHO survey of people in China and Mexico. • Goal: determine feelings of political effjcacy. • Question: “How much say do you have in getting the government to address issues that interest you?” 1. No say at all 2. little say 3. some say 4. a lot of say 5. unlimited say 2 / 14

  8. Studying political effjcacy • 2002 WHO survey of people in China and Mexico. • Goal: determine feelings of political effjcacy. • Question: “How much say do you have in getting the government to address issues that interest you?” 1. No say at all 2. little say 3. some say 4. a lot of say 5. unlimited say 2 / 14

  9. Studying political effjcacy • 2002 WHO survey of people in China and Mexico. • Goal: determine feelings of political effjcacy. • Question: “How much say do you have in getting the government to address issues that interest you?” 1. No say at all 2. little say 3. some say 4. a lot of say 5. unlimited say 2 / 14

  10. Data 3 ## 4 2 4 2 1 0 22 ## 5 2 3 0 3 0 52 ## 6 1 3 1 5 0 50 50 1 • Load the data: 0 vignettes <- read.csv(”data/vignettes.csv”) head(vignettes) ## self alison jane moses china age ## 1 1 5 5 2 31 1 ## 2 1 1 5 5 0 54 ## 3 2 3 3 / 14

  11. • prop.table() converts these counts into proportions of units: Contingency table • Useful way to visualize this information: barplot ## 0.4187 0.2689 0.1665 0.0717 0.0743 5 4 3 2 1 ## ## prop.table(table(vignettes$self)) 58 • table() shows how many units are in each category of a variable: 56 ## 327 210 130 5 4 3 2 1 ## ## table(vignettes$self) 4 / 14

  12. • prop.table() converts these counts into proportions of units: Contingency table • Useful way to visualize this information: barplot ## 0.4187 0.2689 0.1665 0.0717 0.0743 5 4 3 2 1 ## ## prop.table(table(vignettes$self)) 58 • table() shows how many units are in each category of a variable: 56 ## 327 210 130 5 4 3 2 1 ## ## table(vignettes$self) 4 / 14

  13. • prop.table() converts these counts into proportions of units: Contingency table • Useful way to visualize this information: barplot ## 0.4187 0.2689 0.1665 0.0717 0.0743 5 4 3 2 1 ## ## prop.table(table(vignettes$self)) 58 • table() shows how many units are in each category of a variable: 56 ## 327 210 130 5 4 3 2 1 ## ## table(vignettes$self) 4 / 14

  14. Contingency table prop.table(table(vignettes$self)) • Useful way to visualize this information: barplot ## 0.4187 0.2689 0.1665 0.0717 0.0743 5 4 3 2 1 ## ## 58 • table() shows how many units are in each category of a variable: 56 ## 327 210 130 5 4 3 2 1 ## ## table(vignettes$self) 4 / 14 • prop.table() converts these counts into proportions of units:

  15. Contingency table prop.table(table(vignettes$self)) • Useful way to visualize this information: barplot ## 0.4187 0.2689 0.1665 0.0717 0.0743 5 4 3 2 1 ## ## 58 • table() shows how many units are in each category of a variable: 56 ## 327 210 130 5 4 3 2 1 ## ## table(vignettes$self) 4 / 14 • prop.table() converts these counts into proportions of units:

  16. Contingency table prop.table(table(vignettes$self)) • Useful way to visualize this information: barplot ## 0.4187 0.2689 0.1665 0.0717 0.0743 5 4 3 2 1 ## ## 58 • table() shows how many units are in each category of a variable: 56 ## 327 210 130 5 4 3 2 1 ## ## table(vignettes$self) 4 / 14 • prop.table() converts these counts into proportions of units:

  17. Contingency table prop.table(table(vignettes$self)) • Useful way to visualize this information: barplot ## 0.4187 0.2689 0.1665 0.0717 0.0743 5 4 3 2 1 ## ## 58 • table() shows how many units are in each category of a variable: 56 ## 327 210 130 5 4 3 2 1 ## ## table(vignettes$self) 4 / 14 • prop.table() converts these counts into proportions of units:

  18. Barplot example 5 / 14 0.4 0.3 Proportion of Respodents 0.2 0.1 0.0 None A little Some A lot Unlimited Self-reported political efficacy

  19. • names : vector of labels for the each category/bar • xlab , ylab are axis labels Barplots in R • The barplot() function can help us visualize a categorical variable: barplot(height = prop.table(table(vignettes$self)), names = c(”None”, ”A little”, ”Some”, ”A lot”, ”Unlimited”), xlab = ”Self-reported political efficacy”, ylab = ”Proportion of Respodents”) • Arguments: • height : height each bar should take (proportions in this case) 6 / 14

  20. • names : vector of labels for the each category/bar • xlab , ylab are axis labels Barplots in R • The barplot() function can help us visualize a categorical variable: barplot(height = prop.table(table(vignettes$self)), names = c(”None”, ”A little”, ”Some”, ”A lot”, ”Unlimited”), xlab = ”Self-reported political efficacy”, ylab = ”Proportion of Respodents”) • Arguments: • height : height each bar should take (proportions in this case) 6 / 14

  21. • height : height each bar should take (proportions in this case) • names : vector of labels for the each category/bar • xlab , ylab are axis labels Barplots in R • The barplot() function can help us visualize a categorical variable: barplot(height = prop.table(table(vignettes$self)), names = c(”None”, ”A little”, ”Some”, ”A lot”, ”Unlimited”), xlab = ”Self-reported political efficacy”, ylab = ”Proportion of Respodents”) • Arguments: 6 / 14

  22. • names : vector of labels for the each category/bar • xlab , ylab are axis labels Barplots in R • The barplot() function can help us visualize a categorical variable: barplot(height = prop.table(table(vignettes$self)), names = c(”None”, ”A little”, ”Some”, ”A lot”, ”Unlimited”), xlab = ”Self-reported political efficacy”, ylab = ”Proportion of Respodents”) • Arguments: • height : height each bar should take (proportions in this case) 6 / 14

  23. • xlab , ylab are axis labels Barplots in R • The barplot() function can help us visualize a categorical variable: barplot(height = prop.table(table(vignettes$self)), names = c(”None”, ”A little”, ”Some”, ”A lot”, ”Unlimited”), xlab = ”Self-reported political efficacy”, ylab = ”Proportion of Respodents”) • Arguments: • height : height each bar should take (proportions in this case) • names : vector of labels for the each category/bar 6 / 14

  24. Barplots in R • The barplot() function can help us visualize a categorical variable: barplot(height = prop.table(table(vignettes$self)), names = c(”None”, ”A little”, ”Some”, ”A lot”, ”Unlimited”), xlab = ”Self-reported political efficacy”, ylab = ”Proportion of Respodents”) • Arguments: • height : height each bar should take (proportions in this case) • names : vector of labels for the each category/bar 6 / 14 • xlab , ylab are axis labels

  25. Histogram • Histograms visualize density of continuous/numeric variable. 7 / 14

  26. Histogram • Histograms visualize density of continuous/numeric variable. 7 / 14 Distribution of Respondent's Age 0.04 0.03 Density 0.02 0.01 0.00 20 40 60 80 Age

  27. How to create histograms? • How to create a histogram by hand: 1. create bins along the variable of interest 2. count number of observations in each bin 3. density = bin height density = proportion of observations in bin bin width • The areas of the bins = proportion of observations in those bins. • area of the blocks sum to 1 (100%) • Can lead to confusion: height of block can go above 1! • With equal-width bins, height is proportional to proportion in bin. 8 / 14

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend