The Model You Know: Generalizability and Predictive Power of Models - - PowerPoint PPT Presentation

the model you know generalizability and predictive power
SMART_READER_LITE
LIVE PREVIEW

The Model You Know: Generalizability and Predictive Power of Models - - PowerPoint PPT Presentation

The Model You Know: Generalizability and Predictive Power of Models of Choice Under Uncertainty B. Douglas Bernheim Christine Exley Jeffrey Naecker Charles Sprenger 1/24/2019 Motivation Two important features of models:


slide-1
SLIDE 1

The Model You Know: Generalizability and Predictive Power of Models of Choice Under Uncertainty

  • B. Douglas Bernheim

Christine Exley Jeffrey Naecker Charles Sprenger 1/24/2019

slide-2
SLIDE 2

Motivation

◮ Two important features of models:

◮ Interpretability/parsimony ◮ Generalizability/predictive power

◮ Risk preference models

◮ Certainly interpretable and parsimonious ◮ Known to fit well in sample but may be issues with out-of-sample prediction

(eg, Camerer 1992)

slide-3
SLIDE 3

Our Contribution

◮ Test out-of-sample performance of utility models in two settings:

◮ Changing stakes ◮ Increasing complexity of gambles

◮ Provide alternative data and methods to

  • 1. Make more accurate predictions out-of-sample
  • 2. Get better estimates of treatment effects
slide-4
SLIDE 4

Typical Choice Problem

slide-5
SLIDE 5

Choice Environment

◮ Choose between two lotteries, A and B ◮ Represent in two Machina triangles:

◮ Triangle 1: outcomes $1, $10, $30 ◮ exterior: up to two outcomes possible in any lotter ◮ interior: up to three outcomes possible in any lottery ◮ Triangle 2: outcomes $0, $5, $20 ◮ exterior only

◮ 199 lottery pairs total ◮ Participants see random set of 80 pairs, shown sequentially ◮ Lottery A along legs of triangle, while lottery B is along hypotenuse

slide-6
SLIDE 6

Triangle 1 Triangle 2 Exterior Interior 0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00

Probability of Lowest Payment Probabilty of Highest Payment

slide-7
SLIDE 7

Treatments

Treatment Question(s) Real Which option do you prefer? [1 = option A, 0 = option B] Hypothetical Hypothetically, which option do you prefer? Hypothetical likelihood Hypothetically, how likely would you be to choose Option A over Option B? [1-5] Vicarious hypothetical How likely would a typical Stanford undergraduate student be to choose Option A over Option B? Subjective Choosing which option would indicate a greater willingness to take risks? Choosing which option would indicate better judgment? Which option is more difficult to evaluate?

slide-8
SLIDE 8

Utility Models

  • 1. Expected utility with constant relative risk aversion:

U(p, x) =

  • i

pix α

i

  • 2. Cumulative prospect theory from Kahneman and Tversky (1992):

U(p, x; g) = (π(p3, g) − π(0, g))x α

3

+ (π(p2 + p3, g) − π(p3, g))x α

2

+ (π(p1, g) − π(p2 + p3, g))x α

1

where π(p, g) = pg (pg + (1 − p)g)(1/g)

slide-9
SLIDE 9

Errors

Luce decision error formulation: P(choose A) = U(A)

1 µ

U(A)

1 µ + U(B) 1 µ

◮ µ → 0: no mistakes (ie all probabilities = 0 or 1) ◮ µ → ∞: flip a coin (ie all probabilities = 1 2)

Parameter estimates

slide-10
SLIDE 10

Non-Choice Data Methods: Univariate Models

◮ Regress real choice frequency on hypothetical in triangle 1 exterior at choice

problem level: real1i = α + βhyp1i + ε

◮ Then use estimated coefficients to predict real in triangle 2 exterior from

hypothetical in triangle 2 exterior:

  • real2i = ˆ

α + ˆ βhyp2i

◮ Repeat with vicarious hypothetical likelihood mean as predictor ◮ Same procedure to predict to triangle 1 interior

slide-11
SLIDE 11

Non-Choice Data Methods: LASSO

◮ Large number of predictors:

◮ Means for all hypothetical and subjective questions ◮ For all Likert-scale questions, fraction of responses = 1, ≤ 2, ≤ 3, etc

◮ Use regularized regression (LASSO):

minβ

  • i

(yi − βxi)2 + λ||β||2

◮ Regularization parameter λ set using cross-validation

◮ Estimation and prediction as with univariate OLS models

slide-12
SLIDE 12

Prediction Metrics

◮ Bias (average prediction error):

1 N

  • i

| reali − reali|

◮ mean-squared prediction error (MSPE):

1 N

  • i

| reali − reali|2

◮ Calibration score is |β − 1|, with estimated β in the regression equation:

reali = α + β reali + εi

slide-13
SLIDE 13

Choice Probabilities

Triangle 1 Exterior Triangle 1 Interior Triangle 2 Exterior 0.4 0.8 1.2 1.6 0.6 0.8 1.0 1.2 0.6 0.9 1.2 0.00 0.25 0.50 0.75 1.00

Ratio of EV A to EV B Value

Hypothetical choice mean Real choice mean

slide-14
SLIDE 14

Prediction Statistics: Pooled

Label Bias Mean Squared Err Calibration Score Expected utility: rep agent 0.048 0.035 0.187 Prospect theory: rep agent 0.045 0.033 0.163 Expected utility: hetero agents

  • 0.024

0.023 0.085 Prospect theory: hetero agents

  • 0.017

0.024 0.014 Non-choice: all vars 0.012 0.013 0.267 Non-choice: all hyp vars 0.014 0.014 0.319 Non-choice: hyp mean only 0.021 0.016 0.006 Non-choice: vicarious mean only 0.011 0.019 0.016

slide-15
SLIDE 15

In-Sample Performance

Label Bias Mean Squared Err Calibration Score Expected utility: rep agent 0.009 0.014 0.046 Prospect theory: rep agent 0.009 0.013 0.050 Expected utility: hetero agents

  • 0.054

0.014 0.008 Prospect theory: hetero agents

  • 0.035

0.016 0.063 Non-choice: all vars 0.000 0.013 0.264 Non-choice: all hyp vars 0.000 0.013 0.336 Non-choice: hyp mean only 0.000 0.015 0.000 Non-choice: vicarious mean only 0.000 0.019 0.000

Visualizations

slide-16
SLIDE 16

Out-of-Sample Performance: Interior

Label Bias Mean Squared Err Calibration Score Expected utility: rep agent

  • 0.061

0.026 0.366 Prospect theory: rep agent

  • 0.065

0.027 0.360 Expected utility: hetero agents

  • 0.103

0.034 0.237 Prospect theory: hetero agents

  • 0.111

0.041 0.349 Non-choice: all vars

  • 0.005

0.012 0.305 Non-choice: all hyp vars

  • 0.007

0.013 0.344 Non-choice: hyp mean only 0.005 0.015 0.060 Non-choice: vicarious mean only

  • 0.018

0.018 0.079

Visualizations

slide-17
SLIDE 17

Out-of-Sample Performance: Triangle 2

Label Bias Mean Squared Err Calibration Score Expected utility: rep agent 0.234 0.088 0.342 Prospect theory: rep agent 0.226 0.079 0.291 Expected utility: hetero agents 0.114 0.030 0.182 Prospect theory: hetero agents 0.110 0.024 0.079 Non-choice: all vars 0.050 0.014 0.184 Non-choice: all hyp vars 0.062 0.017 0.208 Non-choice: hyp mean only 0.077 0.020 0.063 Non-choice: vicarious mean only 0.063 0.019 0.050

Visualizations

slide-18
SLIDE 18

So What?

◮ What can we do with predictions? ◮ One answer: estimate treatment effects without observing treatment ◮ Two treatments:

  • 1. Increase complexity
  • 2. Decrease stakes
slide-19
SLIDE 19

Exterior to Interior (Increase Complexity)

Actual Expected utility: hetero agents Expected utility: rep agent Non−choice: all hyp vars Non−choice: all vars Non−choice: hyp mean only Non−choice: vicarious mean only Prospect theory: hetero agents Prospect theory: rep agent −0.04 −0.02 0.00 0.02

‘Treatment Effect‘ Label

slide-20
SLIDE 20

Triangle 1 to Triangle 2 (Decrease Stakes)

Actual Expected utility: hetero agents Expected utility: rep agent Non−choice: all hyp vars Non−choice: all vars Non−choice: hyp mean only Non−choice: vicarious mean only Prospect theory: hetero agents Prospect theory: rep agent −0.10 −0.05 0.00 0.05 0.10

‘Treatment Effect‘ Label

slide-21
SLIDE 21

Conclusion

◮ Utility models may not be best option for predicting treatment effects ◮ Next step: Adding additional benchmark using methods from Naecker and

Peysakhovich (2017)

◮ Can suggest improvements to utility models

slide-22
SLIDE 22

Appendix

slide-23
SLIDE 23

Utility Parameter Estimates

Expected utility: hetero agents Prospect theory: hetero agents alpha error weight 0.25 0.50 0.75 1.00 0.25 0.50 0.75 1.00 0.000 0.005 0.010 0.015 0.020 0.00 0.05 0.10 0.00 0.05 0.10 0.15

Value Fraction

Back

slide-24
SLIDE 24

In-Sample Performance

Prospect theory: hetero agents Prospect theory: rep agent Non−choice: all vars Non−choice: hyp mean only Non−choice: vicarious mean only Expected utility: hetero agents Expected utility: rep agent Non−choice: all hyp vars 0.3 0.6 0.9 0.3 0.6 0.9 0.3 0.6 0.9 0.0 0.3 0.6 0.9 0.0 0.3 0.6 0.9 0.0 0.3 0.6 0.9

Prediction ‘Real choice mean‘

Back

slide-25
SLIDE 25

Out-of-Sample Performance: Interior

Prospect theory: hetero agents Prospect theory: rep agent Non−choice: all vars Non−choice: hyp mean only Non−choice: vicarious mean only Expected utility: hetero agents Expected utility: rep agent Non−choice: all hyp vars 0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0 0.00 0.25 0.50 0.75 1.00 1.25 0.00 0.25 0.50 0.75 1.00 1.25 0.00 0.25 0.50 0.75 1.00 1.25

Prediction ‘Real choice mean‘

Back

slide-26
SLIDE 26

Out-of-Sample Performance: Triangle 2

Prospect theory: hetero agents Prospect theory: rep agent Non−choice: all vars Non−choice: hyp mean only Non−choice: vicarious mean only Expected utility: hetero agents Expected utility: rep agent Non−choice: all hyp vars 0.3 0.6 0.9 0.3 0.6 0.9 0.3 0.6 0.9 0.0 0.3 0.6 0.9 0.0 0.3 0.6 0.9 0.0 0.3 0.6 0.9

Prediction ‘Real choice mean‘

Back