The Model You Know: Generalizability and Predictive Power of Models - PowerPoint PPT Presentation

The Model You Know: Generalizability and Predictive Power of Models of Choice Under Uncertainty B. Douglas Bernheim Christine Exley Jeffrey Naecker Charles Sprenger 1/24/2019

Motivation ◮ Two important features of models: ◮ Interpretability/parsimony ◮ Generalizability/predictive power ◮ Risk preference models ◮ Certainly interpretable and parsimonious ◮ Known to fit well in sample but may be issues with out-of-sample prediction (eg, Camerer 1992)

Our Contribution ◮ Test out-of-sample performance of utility models in two settings: ◮ Changing stakes ◮ Increasing complexity of gambles ◮ Provide alternative data and methods to 1. Make more accurate predictions out-of-sample 2. Get better estimates of treatment effects

Typical Choice Problem

Choice Environment ◮ Choose between two lotteries, A and B ◮ Represent in two Machina triangles: ◮ Triangle 1: outcomes $1, $10, $30 ◮ exterior: up to two outcomes possible in any lotter ◮ interior: up to three outcomes possible in any lottery ◮ Triangle 2: outcomes $0, $5, $20 ◮ exterior only ◮ 199 lottery pairs total ◮ Participants see random set of 80 pairs, shown sequentially ◮ Lottery A along legs of triangle, while lottery B is along hypotenuse

Triangle 1 Triangle 2 1.00 Probabilty of Highest Payment 0.75 Exterior 0.50 0.25 0.00 1.00 0.75 Interior 0.50 0.25 0.00 0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 Probability of Lowest Payment

Treatments Treatment Question(s) Real Which option do you prefer? [1 = option A, 0 = option B] Hypothetical Hypothetically, which option do you prefer? Hypothetical likelihood Hypothetically, how likely would you be to choose Option A over Option B? [1-5] Vicarious hypothetical How likely would a typical Stanford undergraduate student be to choose Option A over Option B? Subjective Choosing which option would indicate a greater willingness to take risks? Choosing which option would indicate better judgment? Which option is more difficult to evaluate?

Utility Models 1. Expected utility with constant relative risk aversion: � p i x α U ( p , x ) = i i 2. Cumulative prospect theory from Kahneman and Tversky (1992): U ( p , x ; g ) = ( π ( p 3 , g ) − π (0 , g )) x α 3 + ( π ( p 2 + p 3 , g ) − π ( p 3 , g )) x α 2 + ( π ( p 1 , g ) − π ( p 2 + p 3 , g )) x α 1 where p g π ( p , g ) = ( p g + (1 − p ) g ) (1 / g )

Errors Luce decision error formulation: 1 U ( A ) µ P (choose A) = 1 1 µ + U ( B ) U ( A ) µ ◮ µ → 0: no mistakes (ie all probabilities = 0 or 1) ◮ µ → ∞ : flip a coin (ie all probabilities = 1 2 ) Parameter estimates

Non-Choice Data Methods: Univariate Models ◮ Regress real choice frequency on hypothetical in triangle 1 exterior at choice problem level: real 1 i = α + β hyp 1 i + ε ◮ Then use estimated coefficients to predict real in triangle 2 exterior from hypothetical in triangle 2 exterior: � α + ˆ real 2 i = ˆ β hyp 2 i ◮ Repeat with vicarious hypothetical likelihood mean as predictor ◮ Same procedure to predict to triangle 1 interior

Non-Choice Data Methods: LASSO ◮ Large number of predictors: ◮ Means for all hypothetical and subjective questions ◮ For all Likert-scale questions, fraction of responses = 1, ≤ 2, ≤ 3, etc ◮ Use regularized regression (LASSO): � ( y i − β x i ) 2 + λ || β || 2 min β i ◮ Regularization parameter λ set using cross-validation ◮ Estimation and prediction as with univariate OLS models

Choice Probabilities Triangle 1 Triangle 1 Triangle 2 Exterior Interior Exterior 1.00 0.75 Value 0.50 0.25 0.00 0.4 0.8 1.2 1.6 0.6 0.8 1.0 1.2 0.6 0.9 1.2 Ratio of EV A to EV B Hypothetical choice mean Real choice mean

Prediction Statistics: Pooled Label Bias Mean Squared Err Calibration Score Expected utility: rep agent 0.048 0.035 0.187 Prospect theory: rep agent 0.045 0.033 0.163 Expected utility: hetero agents -0.024 0.023 0.085 Prospect theory: hetero agents -0.017 0.024 0.014 Non-choice: all vars 0.012 0.013 0.267 Non-choice: all hyp vars 0.014 0.014 0.319 Non-choice: hyp mean only 0.021 0.016 0.006 Non-choice: vicarious mean only 0.011 0.019 0.016

In-Sample Performance Label Bias Mean Squared Err Calibration Score Expected utility: rep agent 0.009 0.014 0.046 Prospect theory: rep agent 0.009 0.013 0.050 Expected utility: hetero agents -0.054 0.014 0.008 Prospect theory: hetero agents -0.035 0.016 0.063 Non-choice: all vars 0.000 0.013 0.264 Non-choice: all hyp vars 0.000 0.013 0.336 Non-choice: hyp mean only 0.000 0.015 0.000 Non-choice: vicarious mean only 0.000 0.019 0.000 Visualizations

Out-of-Sample Performance: Interior Label Bias Mean Squared Err Calibration Score Expected utility: rep agent -0.061 0.026 0.366 Prospect theory: rep agent -0.065 0.027 0.360 Expected utility: hetero agents -0.103 0.034 0.237 Prospect theory: hetero agents -0.111 0.041 0.349 Non-choice: all vars -0.005 0.012 0.305 Non-choice: all hyp vars -0.007 0.013 0.344 Non-choice: hyp mean only 0.005 0.015 0.060 Non-choice: vicarious mean only -0.018 0.018 0.079 Visualizations

Out-of-Sample Performance: Triangle 2 Label Bias Mean Squared Err Calibration Score Expected utility: rep agent 0.234 0.088 0.342 Prospect theory: rep agent 0.226 0.079 0.291 Expected utility: hetero agents 0.114 0.030 0.182 Prospect theory: hetero agents 0.110 0.024 0.079 Non-choice: all vars 0.050 0.014 0.184 Non-choice: all hyp vars 0.062 0.017 0.208 Non-choice: hyp mean only 0.077 0.020 0.063 Non-choice: vicarious mean only 0.063 0.019 0.050 Visualizations

So What? ◮ What can we do with predictions? ◮ One answer: estimate treatment effects without observing treatment ◮ Two treatments: 1. Increase complexity 2. Decrease stakes

Exterior to Interior (Increase Complexity) Prospect theory: rep agent Prospect theory: hetero agents Non−choice: vicarious mean only Non−choice: hyp mean only Label Non−choice: all vars Non−choice: all hyp vars Expected utility: rep agent Expected utility: hetero agents Actual −0.04 −0.02 0.00 0.02 ‘Treatment Effect‘

Triangle 1 to Triangle 2 (Decrease Stakes) Prospect theory: rep agent Prospect theory: hetero agents Non−choice: vicarious mean only Non−choice: hyp mean only Label Non−choice: all vars Non−choice: all hyp vars Expected utility: rep agent Expected utility: hetero agents Actual −0.10 −0.05 0.00 0.05 0.10 ‘Treatment Effect‘

Conclusion ◮ Utility models may not be best option for predicting treatment effects ◮ Next step: Adding additional benchmark using methods from Naecker and Peysakhovich (2017) ◮ Can suggest improvements to utility models

Appendix

Utility Parameter Estimates Expected utility: hetero agents Prospect theory: hetero agents 0.020 0.015 alpha 0.010 0.005 0.000 Fraction 0.10 error 0.05 0.00 0.15 weight 0.10 0.05 0.00 0.25 0.50 0.75 1.00 0.25 0.50 0.75 1.00 Value Back

In-Sample Performance Expected utility: hetero agents Expected utility: rep agent Non−choice: all hyp vars 0.9 0.6 0.3 ‘Real choice mean‘ 0.0 Non−choice: all vars Non−choice: hyp mean only Non−choice: vicarious mean only 0.9 0.6 0.3 0.0 0.3 0.6 0.9 Prospect theory: hetero agents Prospect theory: rep agent 0.9 0.6 0.3 0.0 0.3 0.6 0.9 0.3 0.6 0.9 Prediction Back

Out-of-Sample Performance: Interior Expected utility: hetero agents Expected utility: rep agent Non−choice: all hyp vars 1.25 1.00 0.75 0.50 0.25 ‘Real choice mean‘ 0.00 Non−choice: all vars Non−choice: hyp mean only Non−choice: vicarious mean only 1.25 1.00 0.75 0.50 0.25 0.00 0.2 0.4 0.6 0.8 1.0 Prospect theory: hetero agents Prospect theory: rep agent 1.25 1.00 0.75 0.50 0.25 0.00 0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0 Prediction Back

Out-of-Sample Performance: Triangle 2 Expected utility: hetero agents Expected utility: rep agent Non−choice: all hyp vars 0.9 0.6 0.3 0.0 ‘Real choice mean‘ Non−choice: all vars Non−choice: hyp mean only Non−choice: vicarious mean only 0.9 0.6 0.3 0.0 0.3 0.6 0.9 Prospect theory: hetero agents Prospect theory: rep agent 0.9 0.6 0.3 0.0 0.3 0.6 0.9 0.3 0.6 0.9 Prediction Back

The Model You Know: Generalizability and Predictive Power of Models - PowerPoint PPT Presentation

The Model You Know: Generalizability and Predictive Power of Models of Choice Under Uncertainty B. Douglas Bernheim Christine Exley Jeffrey Naecker Charles Sprenger 1/24/2019 Motivation Two important features of models:

Model Predictive Control Model Predictive Control of Hybrid Systems of Hybrid Systems Model

What You Dont Know What You Dont Know What You Dont Know What You Dont Know That

Generalizability Theory; Understanding Variance in Research Casperhp12345@gmail.com

Session 3 Upskilling for Predictive Analytics Travis M Short, FSA Upskilling for Predictive

(11-14) How much do you know about the internet? Make sure you stay SAFE AND SECURE ONLINE YOU

WELCOME! You need to know what you know, and know what you dont know. Then work on your areas

Know how. Know now. Know how. Know now. Please Thank our sponsor! The Nebraska Soybean Board

(power x 0) == 1 (power x (+ n 1)) == (* (power x n) x) (power x 0) == 1 (power x (+ (* 2 m)

We Know It ! We Know It ! WeKnowIt WeKnowIt Emerging, Collective Intelligence for personal,

Generalized Model Predictive Control (Discretely Generalized MPC) Sa sa V. Rakovi c, Ph.D.

Rethinking our Alignment Obsession: Can a Generalizability Perspective Help? Scott Marion, Center

Contributions to Generalizability Theory: Dick Jaegers Indirect but Strong Mentoring Effects

On the generalizability of P an .inis praty ah ara -technique to other languages

Reproducibility & Generalizability @ Twitter Strengthening Reproducibility in Network Science

Predictive Analytics for Capacity Planning HIC 2015 Andrae Gaeth What is predictive

The Art The Art when you don't know! Define what you want when you do know! of of Know

EC3062 ECONOMETRICS ELEMENTARY REGRESSION ANALYSIS We shall consider three methods for estimating

s trts r

Microarrays False Discovery Rate Prof. Tesler Math 186 Winter 2019 Prof. Tesler

Class Imbalance Multiclass Problems General Idea Original D Training data .... Step 1:

Predicting Secondary Structure of All-Helical Proteins Using Hidden Markov Support Vector

10: Biological Applications for HMMs Machine Learning and Real-world Data (MLRD) Ann Copestake

COMP364: PDB & Biopython Jrme Waldisphl, McGill University

Computational Methods in Protein Structure Prediction C.A. Floudas Department of Chemical

Sambuz

Useful Links

Newsletter

Mail Us

The Model You Know: Generalizability and Predictive Power of Models - PowerPoint PPT Presentation

The Model You Know: Generalizability and Predictive Power of Models of Choice Under Uncertainty B. Douglas Bernheim Christine Exley Jeffrey Naecker Charles Sprenger 1/24/2019 Motivation Two important features of models:

Model Predictive Control Model Predictive Control of Hybrid Systems of Hybrid Systems Model

What You Dont Know What You Dont Know What You Dont Know What You Dont Know That

Generalizability Theory; Understanding Variance in Research Casperhp12345@gmail.com

Session 3 Upskilling for Predictive Analytics Travis M Short, FSA Upskilling for Predictive

(11-14) How much do you know about the internet? Make sure you stay SAFE AND SECURE ONLINE YOU

WELCOME! You need to know what you know, and know what you dont know. Then work on your areas

Know how. Know now. Know how. Know now. Please Thank our sponsor! The Nebraska Soybean Board

(power x 0) == 1 (power x (+ n 1)) == (* (power x n) x) (power x 0) == 1 (power x (+ (* 2 m)

We Know It ! We Know It ! WeKnowIt WeKnowIt Emerging, Collective Intelligence for personal,

Generalized Model Predictive Control (Discretely Generalized MPC) Sa sa V. Rakovi c, Ph.D.

Rethinking our Alignment Obsession: Can a Generalizability Perspective Help? Scott Marion, Center

Contributions to Generalizability Theory: Dick Jaegers Indirect but Strong Mentoring Effects

On the generalizability of P an .inis praty ah ara -technique to other languages

Reproducibility &amp; Generalizability @ Twitter Strengthening Reproducibility in Network Science

Predictive Analytics for Capacity Planning HIC 2015 Andrae Gaeth What is predictive

The Art The Art when you don't know! Define what you want when you do know! of of Know

EC3062 ECONOMETRICS ELEMENTARY REGRESSION ANALYSIS We shall consider three methods for estimating

s trts r

Microarrays False Discovery Rate Prof. Tesler Math 186 Winter 2019 Prof. Tesler

Class Imbalance Multiclass Problems General Idea Original D Training data .... Step 1:

Predicting Secondary Structure of All-Helical Proteins Using Hidden Markov Support Vector

10: Biological Applications for HMMs Machine Learning and Real-world Data (MLRD) Ann Copestake

COMP364: PDB &amp; Biopython Jrme Waldisphl, McGill University

Computational Methods in Protein Structure Prediction C.A. Floudas Department of Chemical

Sambuz

Useful Links

Newsletter

Mail Us

Reproducibility & Generalizability @ Twitter Strengthening Reproducibility in Network Science

COMP364: PDB & Biopython Jrme Waldisphl, McGill University