evaluation data science and the causal revolution
play

Evaluation, data science, and the causal revolution January 15, - PowerPoint PPT Presentation

Evaluation, data science, and the causal revolution January 15, 2020 PMAP 8521: Program Evaluation for Public Service Andrew Young School of Policy Studies Georgia State University Spring 2020 Plan for today Data science and public


  1. Evaluation, data science, and the causal revolution January 15, 2020 PMAP 8521: Program Evaluation for Public Service Andrew Young School of Policy Studies • Georgia State University Spring 2020

  2. Plan for today Data science and public service Evidence, evaluation, and causation Class details Getting staRted!

  3. Data science and public service

  4. Data and government “To responsibly unleash the power of data to benefit all Americans”

  5. How do you use all this data to make the world better?

  6. What is “statistics”? Collecting and analyzing data from a representative sample in order to make inferences about a whole population

  7. What is “data science”? Algorithms Big data Data mining Machine C l o u learning d c o m p u t i n g Neural A PR-speak for r t i f i c i a l networks i n t e “statistics” l l i g e n c e

  8. What is “data science”? Turning raw data into understanding, insight, and knowledge Collect Analyze Communicate

  9. What’s the difference? Collect Statistics Analyze Communicate

  10. What is “program evaluation”? Measuring the effect of social programs on society Data and statistics Communication Causal inference (econometrics)

  11. Evidence, evaluation, and causation

  12. What is the relationship between social science research and public policy & administration?

  13. Evidence-based medicine

  14. Modern evidence-based medicine Apply evidence to clinical treatment decisions Move away from clinical judgment and “craft knowledge” Is this good?

  15. Can we find and measure evidence for policies and programs?

  16. Evidence-based policy RAND health insurance study Oregon Medicaid expansion HUD’s Moving to Opportunity Tennessee STAR

  17. Policy evidence industry Jameel Poverty Action Lab (J-PAL) Campbell Collaboration

  18. Should we have evidence for every policy or program? No! Science vs. art/craft/intuition

  19. Where does program evaluation fit with all this? It’s a method for collecting evidence for policies and programs

  20. Types of evaluation Needs assessment Design and theory assessment Process evaluation and monitoring Impact evaluation Efficiency evaluation (CBA)

  21. # Law, of people Reduced risk factors Increased commitment PSD distributes truancy parents, No truancy who know for delinquency to school information to all families students, teachers, expectations and administrators Truancy Better grades % to all schools in the district increase in grades and Grants attendance # of 5 unexcused 1st citation mailed home 1st citations absences mailed (5 total) # of PowerPoint presentation + # of 2nd citation mailed home + Students and parents attend 5 unexcused 2nd citations Explanation of state law + truancy school absences referral to truancy school truancy school mailed Instruction on PowerSchool attendees (10 total) # of 3rd citation mailed home + PSD Attendance Court 5 unexcused 3rd citations referral to truancy court (K–10) absences mailed (15 total) # of court attendees 4th District Juvenile Court Judges (9–10) Logic Model Output Legend Input Adapted from Provo School District, “Truancy Alternative Meet with district social Program Logic Model: FY Activity Outcome plan created* worker (11–12) 2011–2012.” * Because 11th and 12th graders who receive 3rd citations are generally unable to graduate from high school, district social workers no longer attempt to increase their commitment to school. As such, any outcomes that occur as a result of the alternative plans made for these students (work study programs, career development assistance, etc.) are only tangentially related to the outcomes of the truancy program itself. The system for creating alternative plans is an entirely separate program with its own logic model, goals, and outcomes.

  22. Theories of change Increased commitment to school Three phases of No truancy Reduced risk factors truancy intervention Better grades Impact evaluation!

  23. Theory → impact Grades with program Post-program grades Outcome change Grades Grades without program Pre-program grades Program activities Program outcomes Before During After Program Program Program

  24. 4.5 Lines Average number of absences Actual 4.0 Predicted ● 3.5 ● ● ● 3.0 Colors ● ● ● ● ● 2.5 80% Confidence ● ● 2.0 95% Confidence ● ● ● 1.5 Truancy intervention ● − 10 − 5 0 5 Weeks before/after truancy intervention

  25. Godwin’s Law for statistics Correlation does not imply causation Except when it does Even if it doesn’t, this phrase is useless and kills discussion

  26. Correlation vs. causation How do we figure out How do we figure out correlation? causation? Math and statistics Philosophy. No math.

  27. How do we know if X causes Y? X causes Y if… …we intervene and change X without changing anything else… …and Y changes

  28. Y “listens to” X X isn’t the only thing that causes Y A light switch causes a light to go on, but not if bulb is burned out (no Y despite X) or if the light was already on (Y without X)

  29. Causal relationships? Lighting fireworks causes noise Rooster crows are followed by sunrise Getting an MPA increases your earnings Colds go away a few days after you take vitamin C

  30. Causation Causation = Correlation + time order + all other factors ruled out How do you know if you have it right? You need a philosophical model That’s what this class is for!

  31. The causal revolution

  32. Causal diagrams Directed acyclic graphs (DAGs) Graphical model of the process that generates the data Maps your philosophical model Fancy math (“do-calculus”) tells you what to control for to find causation

  33. Break Set up an RStudio.cloud account if you haven’t Go to https://andhs.co/rstudio to join the class workspace

  34. Ask me anything!

  35. Class details

  36. model_2sls <- iv_robust( health ~ bed_net | treatment, data = bed_nets)

  37. Class technology

  38. The tidyverse

  39. The tidyverse

  40. R code, but reads like English! strike_damages_month <- bird_strikes %>% group_by(Month) %>% summarize(total_damages = sum(Cost, na.rm = TRUE), average_damages = mean(Cost, na.rm = TRUE)) ggplot(data = strike_damages_month, mapping = aes(x = Month, y = total_damages)) + geom_col() + scale_y_continuous(labels = dollar) + labs(x = "Month", y = "Total damages", title = "Really expensive collisions happen in the fall?", subtitle = "Don't fly in August or October?", source = "Source: FAA Wildlife Strike Database")

  41. Sucking There is no way to go from knowing nothing about a subject to knowing something about a subject without going through a period of much frustration and suckiness Push through. You'll suck less. Hadley Wickham, author of ggplot2 and the tidyverse

  42. Sucking

  43. Am I making you computer scientists? No! You don’t need to be a mechanic to drive a car safely You don’t need to be a computer scientist or developer to use R safely

  44. Learning R

  45. You can do this.

  46. Goals for the class Become an expert with R Speak and do causation Design rigorous evaluations Change the world with data

  47. Prerequisites Math skills Basic algebra Computer science skills None Statistical skills Regression and differences in means (ideally; you can survive without it, though)

  48. Miscellanea

  49. Class expectations Late work Technology Participation Other?

  50. Getting staRted!

  51. Goals for the class andhs.co/survey

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend