Know Can Hurt Your Bottom Line. Gatan Veilleux, Valen Technologies - PowerPoint PPT Presentation

Selection Bias - What You Don ’ t Know Can Hurt Your Bottom Line. Gaétan Veilleux, Valen Technologies 2011 CAS Ratemaking and Product Management Seminar March 20-22, 2011

Antitrust Notice The Casualty Actuarial Society is committed to adhering strictly to the � letter and spirit of the antitrust laws. Seminars conducted under the auspices of the CAS are designed solely to provide a forum for the expression of various points of view on topics described in the programs or agendas for such meetings. Under no circumstances shall CAS seminars be used as a means for � competing companies or firms to reach any understanding – expressed or implied – that restricts competition or in any way impairs the ability of members to exercise independent business judgment regarding matters affecting competition. It is the responsibility of all seminar participants to be aware of antitrust � regulations, to prevent any written or verbal discussions that appear to violate these laws, and to adhere in every respect to the CAS antitrust compliance policy. 2

“ I don ’ t like statistics. It ’ s like logic, it doesn ’ t make any sense. ” 3

What Is Selection Bias? “ A type of bias caused by choosing non-random data for statistical � analysis. The bias exists due to a flaw in the sample selection process, where a subset of the data is systematically excluded due to a particular attribute. The exclusion of the subset can influence the statistical significance of the test, or produce distorted results. ” (Investopedia) Selection bias results from estimation on a subsample of individuals � who have essentially elected themselves for estimation through their decision to participate in a particular program. – Sample selection bias occurs if those who choose not to participate are systematically different from those who do – Attrition bias occurs if selected individuals are “ lost ” over time and those who are lost differ systematically from those who remain. 4

Should We Be Concerned? � The systematic selection of a sub-sample which differs from the overall population will yield distorted empirical results of the population of interest. � Building a model on such data without attempting to mitigate for the non-random sampling will yield biased estimates or estimates that apply only to the selected sub-sample. 5

Example Of Misspecification X 6

Selection Bias Outside of Insurance � Economics/Econometrics � Finance/Credit Industry � Social Sciences � Marketing � Political Science � Epidemiology � Investment Analysis � Insurance � Many Others 7

Selection Bias In Insurance � Do any insurance processes systematically exclude sub-sets of a population? – Pricing – Underwriting – Claims – Marketing – Customer service – Customer Retention � What is the source of the systematic selection process? 8

Statistical Methods 9

3 Modified Distributional Forms � Truncation A sample is drawn from a subset of a larger population of interest. � Censoring All values above or below some value are set to one value. � Sample Selection (incidental truncation) A specific form of Truncation. 10

Truncated Normal Distribution Density ¡of ¡a ¡truncated ¡random ¡variable: ¡ 𝜏 𝜚 0𝑧 − 𝜈 1 3 𝑔(𝑧) 𝜏 𝑔(𝑧|𝑧 > 𝑏) = 𝑄𝑠𝑝𝑐(𝑧 > 𝑏) = 1 − 𝛸(𝛽) ¡ Moments: ¡ ¡ 𝐹[𝑧|𝑧 > 𝑏] = ¡𝜈 + 𝜏𝜇(𝛽) ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡𝑊𝑏𝑠[𝑧|𝑧 > 𝑏] = 𝜏 2 [1 − 𝜀(𝛽)] ¡ ¡ 𝑥ℎ𝑓𝑠𝑓 ¡𝛽 = (𝑏 − 𝜈)/𝜏 ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ 𝜇(𝛽) = 𝜚(𝛽)/[1 − 𝛸(𝛽) ¡] ¡ Inverse Mills Ratio ¡ 𝑏𝑜𝑒 ¡𝜀(𝛽) = 𝜇(𝛽)[𝜇(𝛽) − 𝛽] ¡ ¡ 𝑂 Log-‑Likelihood: ¡ ¡ ¡ ¡ 𝑚𝑜𝑀 = ∑ 𝑔(𝑧)] − 𝑚𝑜[1 − 𝛸(𝛽)] ¡ (𝑚𝑜[ 𝑗=1 11

Truncated Regression Model ′ 𝜸 + 𝜁 𝑗 ¡ ¡ ¡ ¡ ¡ ¡𝑥ℎ𝑓𝑠𝑓 ¡ ¡ ¡ ¡ ¡𝜁 𝑗 |𝒚 𝒋 ¡~𝑂[0, 𝜏 2 ] ¡ ¡ ¡ ¡ ¡ ¡𝑏𝑜𝑒 ¡ ¡ ¡ ¡ ¡𝑧 𝑗 |𝒚 𝒋 ¡~𝑂[𝒚 𝒋 ′ 𝜸, 𝜏 2 ] ¡ 𝑧 𝑗 = 𝒚 𝒋 ′ 𝜸)/𝜏] 𝜚[(𝑏 − 𝒚 𝒋 ′ 𝜸 + 𝜏 ′ 𝜸 + 𝜏𝜇(𝛽 𝑗 ) ¡ 𝐹[𝑧 𝑗 |𝑧 𝑗 > 𝑏] = 𝒚 𝒋 ′ 𝜸)/𝜏] = 𝒚 𝒋 1 − 𝛸[(𝑏 − 𝒚 𝒋 Marginal ¡effects: ¡ 𝜖𝐹[𝑧 𝑗 |𝑧 𝑗 > 𝑏] = 𝜸 + 𝜏(𝑒𝜇 𝑗 /𝑒𝛽 𝑗 ) 𝜖𝛽 𝑗 = 𝜸(1 − 𝜀 𝑗 ) ¡ 𝜖𝒚 𝒋 𝜖𝒚 𝒋 0<δ<1 ¡ 12

Censored data � Stochastic Censoring - some observations of a dependent variable y i are censored Example 1: The amount a person is willing to spend to buy a car is lower than the least expensive car. There will be no purchase and we do not observe the amount, y i , they would spend. Example 2: Losses greater than a loss limit. If a large loss is recorded at the loss limit, the amount above the limit is not available for analysis. � Tobit model 1958 13

Censored Normal Distribution (1) Define ¡a ¡new ¡y ¡transformed ¡from ¡the ¡latent ¡variable ¡y* ¡as ¡ ¡ y ¡= ¡a ¡ ¡ ¡ ¡ ¡ ¡if ¡y* ¡≤ ¡a ¡ ¡ y= ¡y* ¡ ¡ ¡ ¡ ¡if ¡y* ¡> ¡a ¡ Density ¡of ¡a ¡censored ¡random ¡variable: ¡ 𝑔(𝑧) = [𝑔(𝑧 ∗ )] 𝑒 𝑗 [𝐺(𝑏)] 1−𝑒 𝑗 ¡ ¡ Moments: ¡ 𝐹[𝑧] = ¡𝑏𝛸 + (1 − 𝛸)(𝜈 + 𝜏𝜇) ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ 𝑊𝑏𝑠[𝑧] = 𝜏 2 (1 − 𝛸)[1 − 𝜀) + (𝛽 − 𝜇) 2 𝛸] ¡ ¡ ¡ 14

Censored Normal Distribution (2) Log-‑Likelihood: ¡ ¡ ¡ 𝑧 𝑗− 𝜈 𝜈−𝑏 𝑂 ¡ ¡ 𝑚𝑜𝑀 = ∑ &𝑒 𝑗 )−𝑚𝑜𝜏 + 𝑚𝑜𝜚 . 𝜏 12 + (1 − 𝑒 𝑗 )𝑚𝑜 )1 − 𝛸 . 𝜏 128 𝑗=1 ¡ ¡ Special ¡Case: ¡ ¡a ¡= ¡0 ¡ 𝜈 𝐹[𝑧] = ¡𝛸 . 𝜏 1 (𝜈 + 𝜏𝜇) ¡ ¡ 𝜚. 𝜈 𝜏 1 ¡ where ¡ ¡ ¡ 𝜇 = ¡ (Inverse Mills Ratio) 𝜏 1 ¡ 𝛸. 𝜈 15

Standard Tobit Model � Assumptions – The underlying disturbances are normally distributed – The same data generating process that determines the censoring is the same process that determines the outcome variable – The dependent variable is censored at zero, i.e. ¡ y ¡= ¡0 ¡ ¡ ¡ ¡ ¡ ¡if ¡y* ¡≤ ¡0 ¡ ¡ y= ¡y* ¡ ¡ ¡ ¡ ¡if ¡y* ¡> ¡0 ¡ 16

¡ Tobit Model Expected Values of Possible Interest: 1. Expected value of y * , the latent variable 𝐹[𝑧 ∗ ] = 𝑌 𝑗 𝛾 ¡ 2. E[y|y > 0] – the truncated model 𝐹[𝑧|𝑧 > 0] = ¡𝑌 𝑗 𝛾 + 𝜏𝜇(𝛽) ¡ ¡ 𝑌𝑗𝛾 𝜚4 𝜏 5 ¡ where ¡ ¡ ¡ 𝜇 = ¡ 𝜏 5 ¡ ¡ ¡𝑗𝑡 ¡𝑢ℎ𝑓 ¡𝑗𝑜𝑤𝑓𝑠𝑡𝑓 ¡𝑁𝑗𝑚𝑚𝑡 ¡𝑠𝑏𝑢𝑗𝑝 ¡ 𝑌𝑗𝛾 𝛸4 3. E[y] – the censored model 𝑌 𝑗 𝛾 𝐹[𝑧] = ¡𝛸 ( ¡ 𝜏 -[𝑌 𝑗 𝛾 + 𝜏𝜇(𝛽)] ¡ 17

Tobit Model Estimation Log-‑Likelihood: ¡ ¡ ¡ 𝑂 ′ 𝜸 ′ 𝜸 𝑚𝑜𝑀 = % &𝑒 𝑗 )−𝑚𝑜𝜏 + 𝑚𝑜𝜚 .𝑧 𝑗 − 𝒛 𝒋 45 + (1 − 𝑒 𝑗 )𝑚𝑜 )1 − 𝛸 .𝒛 𝒋 𝜏 45: ¡ 𝜏 𝑗=1 ¡ ′ 𝜸) 2 ′ 𝜸 𝑚𝑜𝑀 = % − 1 2 =𝑚𝑜(2π) + 𝑚𝑜𝜏 2 + (𝑧 𝑗 − 𝒛 𝒋 % 𝑚𝑜 =1 − 𝛸 .𝒛 𝒋 ? + 𝜏 4? ¡ 𝜏 2 𝑧 𝑗 >0 𝑧 𝑗 =0 Why not use OLS? § E[y] is non-linear § OLS estimates of β are inconsistent § OLS parameters are approximately proportional to Tobit parameters 18

Incidental Truncation � We do not observe y due to the effect of another variable(s) � A non-random selection process – Examples: Wage offers are observed only for those who work. Workforce participation may be affected by some unobserved variables which also affect the wage offer. Audit results are observed only for audited policies. The decision to audit specific policies is influenced by other variables, some observed, some not which can affect the audit results. 19

Incidental Truncation Distribution Random ¡variables ¡y ¡and ¡z ¡have ¡a ¡bivariate ¡distribution ¡with ¡correlation ¡ρ. ¡ ¡ Incidentally ¡Truncated ¡joint ¡density ¡of ¡y ¡and ¡z: ¡ 𝑔(𝑧, 𝑨) 𝑔(𝑧, 𝑨|𝑨 > 𝑏) = 𝑄𝑠𝑝𝑐(𝑨 > 𝑏) ¡ ¡ Moments ¡of ¡the ¡Incidentally ¡Truncated ¡Bivariate ¡Normal ¡distribution: ¡ ¡ 2 [1 − 𝜍 2 𝜀(𝛽 𝑨 )] ¡ ¡ 𝐹[𝑧|𝑨 > 𝑏] = ¡𝜈 𝑧 + 𝜍𝜏 𝑧 𝜇(𝛽 𝑨 ) ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡𝑊𝑏𝑠[𝑧|𝑨 > 𝑏] = 𝜏 𝑧 𝑥ℎ𝑓𝑠𝑓 ¡𝛽 𝑨 = (𝑏 − 𝜈 𝑨 )/𝜏 𝑨 ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ 𝜇(𝛽 𝑨 ) = 𝜚(𝛽 𝑨 )/[1 − 𝛸(𝛽 𝑨 ) ¡] ¡ ¡ 𝑏𝑜𝑒 ¡𝜀(𝛽 𝑨 ) = 𝜇(𝛽 𝑨 )[𝜇(𝛽 𝑨 ) − 𝛽 𝑨 ] ¡ ¡ 20

Know Can Hurt Your Bottom Line. Gatan Veilleux, Valen Technologies - PowerPoint PPT Presentation

Selection Bias - What You Don t Know Can Hurt Your Bottom Line. Gatan Veilleux, Valen Technologies 2011 CAS Ratemaking and Product Management Seminar March 20-22, 2011 Antitrust Notice The Casualty Actuarial Society is committed to

Will It Hurt? Who Will it Hurt? Will It Hurt? Who Will it Hurt? Macroeconomic and Distributional

CA CAN H HURT RT Y YOU Fleet Safety.what you dont know can hurt you! Tim Batz IMA,

Bottom Bottom Bottom- Bottom - - -Up Studies for Regional Models Up Studies for Regional

BOTTOM, STRANGE MESONS BOTTOM, STRANGE MESONS BOTTOM, STRANGE MESONS BOTTOM, STRANGE MESONS ( B

The Slope of a Line The Slope of a Line The Slope of a Line The Slope of a Line The Slope of a

Title Slide Math 696 Class July 19, 2002 Line 1 Line 2 Line 3 Line 4 Line 5 Line 6 Line 7

A Triple Bottom Line Company A Triple Bottom Line Company Converting Renewable Resources into

HOW TO BECOME AN EFFECTIVE GROUP FACILITATOR How do I prepare? Know your Know your Know your

Know how. Know now. Know how. Know now. Please Thank our sponsor! The Nebraska Soybean Board

What You Dont Know What You Dont Know What You Dont Know What You Dont Know That

X-Line 101 June 2019 X-Line 101 X-Line Unit Overview What makes X-Line unique X-Line 101

92 ND Birthday Celebrations Theme : Help Ever Hurt Never: In Thought, Word & deed Date &

Claire Bird & Lee Hurt| Lloyds Bank PLANNING TO WIN! PLANNING TO WIN APMP UK Symposium June

Router Architectures CPU CPU Memory Memory packets NFE NFE Processor Processor Line Card

GROWING YOUR MEMBERS & PLAYERS Know your catchment, know your members, know your potential

Real Estate Market Analysis Real Estate Market Analysis Has the Market Hit Bottom? Has the

Triclosan concentrations in freshwater systems in the United States Angela L. Perez, Ph.D.

Tree-based estimators and actuarial applications Lyon-Columbia Workshop (Lyon), 06/27/2016

Characterizing uncertainty of the exposure point concentration based on left-censored data

Pricing Policy Time for Change or Status Quo The Coalition for Government Procurement Business

Adaptive Neymans smooth tests of homogeneity of two samples of survival data David Kraus

Whats New & Upcoming in 2017 Jeff T. Behler Regional Director, New York Regional Census

U.S. U.S. Census Bureau Data Census Bureau Data Products and Products and DBEDT Data Support

Investor Presentation October 2018 Forward-Looking Statements This presentation and comments

Sambuz

Useful Links

Newsletter

Mail Us

Know Can Hurt Your Bottom Line. Gatan Veilleux, Valen Technologies - PowerPoint PPT Presentation

Selection Bias - What You Don t Know Can Hurt Your Bottom Line. Gatan Veilleux, Valen Technologies 2011 CAS Ratemaking and Product Management Seminar March 20-22, 2011 Antitrust Notice The Casualty Actuarial Society is committed to

Will It Hurt? Who Will it Hurt? Will It Hurt? Who Will it Hurt? Macroeconomic and Distributional

CA CAN H HURT RT Y YOU Fleet Safety.what you dont know can hurt you! Tim Batz IMA,

Bottom Bottom Bottom- Bottom - - -Up Studies for Regional Models Up Studies for Regional

BOTTOM, STRANGE MESONS BOTTOM, STRANGE MESONS BOTTOM, STRANGE MESONS BOTTOM, STRANGE MESONS ( B

The Slope of a Line The Slope of a Line The Slope of a Line The Slope of a Line The Slope of a

Title Slide Math 696 Class July 19, 2002 Line 1 Line 2 Line 3 Line 4 Line 5 Line 6 Line 7

A Triple Bottom Line Company A Triple Bottom Line Company Converting Renewable Resources into

HOW TO BECOME AN EFFECTIVE GROUP FACILITATOR How do I prepare? Know your Know your Know your

Know how. Know now. Know how. Know now. Please Thank our sponsor! The Nebraska Soybean Board

What You Dont Know What You Dont Know What You Dont Know What You Dont Know That

X-Line 101 June 2019 X-Line 101 X-Line Unit Overview What makes X-Line unique X-Line 101

92 ND Birthday Celebrations Theme : Help Ever Hurt Never: In Thought, Word &amp; deed Date &amp;

Claire Bird &amp; Lee Hurt| Lloyds Bank PLANNING TO WIN! PLANNING TO WIN APMP UK Symposium June

Router Architectures CPU CPU Memory Memory packets NFE NFE Processor Processor Line Card

GROWING YOUR MEMBERS &amp; PLAYERS Know your catchment, know your members, know your potential

Real Estate Market Analysis Real Estate Market Analysis Has the Market Hit Bottom? Has the

Triclosan concentrations in freshwater systems in the United States Angela L. Perez, Ph.D.

Tree-based estimators and actuarial applications Lyon-Columbia Workshop (Lyon), 06/27/2016

Characterizing uncertainty of the exposure point concentration based on left-censored data

Pricing Policy Time for Change or Status Quo The Coalition for Government Procurement Business

Adaptive Neymans smooth tests of homogeneity of two samples of survival data David Kraus

Whats New &amp; Upcoming in 2017 Jeff T. Behler Regional Director, New York Regional Census

U.S. U.S. Census Bureau Data Census Bureau Data Products and Products and DBEDT Data Support

Investor Presentation October 2018 Forward-Looking Statements This presentation and comments

Sambuz

Useful Links

Newsletter

Mail Us

92 ND Birthday Celebrations Theme : Help Ever Hurt Never: In Thought, Word & deed Date &

Claire Bird & Lee Hurt| Lloyds Bank PLANNING TO WIN! PLANNING TO WIN APMP UK Symposium June

GROWING YOUR MEMBERS & PLAYERS Know your catchment, know your members, know your potential

Whats New & Upcoming in 2017 Jeff T. Behler Regional Director, New York Regional Census