Session 5 A brief introduction to Predictive Modeling Lichen Bao, - - PDF document

session 5 a brief introduction to predictive modeling
SMART_READER_LITE
LIVE PREVIEW

Session 5 A brief introduction to Predictive Modeling Lichen Bao, - - PDF document

SOA Predictive Analytics Seminar Malaysia 27 Aug. 2018 | Kuala Lumpur, Malaysia Session 5 A brief introduction to Predictive Modeling Lichen Bao, Ph.D A Brief Introduction to Predictive Modeling LICHEN BAO Data Scientist, RGA Reinsurance


slide-1
SLIDE 1

SOA Predictive Analytics Seminar – Malaysia

27 Aug. 2018 | Kuala Lumpur, Malaysia

Session 5 A brief introduction to Predictive Modeling

Lichen Bao, Ph.D

slide-2
SLIDE 2

A Brief Introduction to Predictive Modeling

LICHEN BAO

Data Scientist, RGA Reinsurance Company

August 27, 2018

slide-3
SLIDE 3

Agenda

  • Overview of Predictive Modeling (PM)
  • A Case Study
  • PM for Actuaries
slide-4
SLIDE 4

Overview of Predictive Modeling (PM)

slide-5
SLIDE 5

4

Data

High quality data

Modeling

Statistical model

Prediction

Business decisions

Modeling

Statistical model

What is Predictive Modeling?

Modeling covers the statistics models and algorithms.

slide-6
SLIDE 6

5

 Linear regression model

  • Y target/response variable; Xi explanatory/predictor variable
  • βi parameters to be estimated
  • ε error term/noise

 Underlying Assumptions for a Valid LM

  • Normality, 𝜁 ~ N(0,σ2)
  • Linearity; Homogeneity-Y for population; Fixed X, error-free;

Observation independence

Review of Predictive Modeling

Linear regression and OLS may sound familiar …

slide-7
SLIDE 7

6

Review of Predictive Modeling

Linear regression and OLS may sound familiar …

 Ordinary Least Squares(OLS)

  • For a simple regression

 Identical to Maximum likelihood estimator

  • More robust and consistent approach

 Use adj R2 to compare fitness of models

1 = 𝑆𝑇𝑇

𝑈𝑇𝑇 + 𝐹𝑇𝑇 𝑈𝑇𝑇

Define 𝑆2 = 𝑆𝑇𝑇

𝑈𝑇𝑇 = 1 − 𝐹𝑇𝑇 𝑈𝑇𝑇 = 𝑗(𝑍𝑗− 𝑍𝑗)2 𝑗(𝑍𝑗− 𝑍)2 , but it is biased

Adjusted 𝑆2 = 1 − 𝐹𝑇𝑇

𝑈𝑇𝑇 ∗ 𝑜−1 𝑜−𝑙 = 1 − (1−𝑆2)∗ 𝑜−1 𝑜−𝑙

β = 𝑏𝑠𝑕 min 𝑆𝑇𝑇 = 𝑏𝑠𝑕 min 𝑗( 𝑧𝑗 − 𝑧𝑗)2 = 𝑏𝑠𝑕 min 𝑗(𝑘𝛾𝑘𝑌𝑗𝑘 − 𝑧𝑗)2 β1 = (𝑦𝑗𝑧𝑗 − 1

𝑜𝑦𝑗𝑧𝑗) (𝑦𝑗

2 − 1

𝑜(𝑦𝑗)2),

β0 = 𝑧 − β1 𝑦 β = 𝑏𝑠𝑕 m𝑏𝑦 𝑀(𝑌, 𝑍, 𝛾) = 𝑏𝑠𝑕 min −ln(𝑀 𝑌, 𝑍, 𝛾 ) = 𝑏𝑠𝑕 min 𝑗(𝑧𝑗 − 𝑧𝑗(𝜈𝑗))2

if normal distribution

  • portion that has been explained by OLS model
  • portion of TSS for the error
slide-8
SLIDE 8

7

Review of Predictive Modeling

We barely see any real application of OLS in life insurance because of the constraints.

Features of OLS

Validation of assumptions - Normal w/ constant σ2 Non-linear relationship,

  • esp. for extrapolation

Unbounded data, non- negative value

Applications in Insurance

Binomial for rate (mortality/lapse/UW, etc.), σ2 ~ r(1-r) Poisson for claim count, ~ mean Gamma for claim amount, ~ mean2

×

Unmatched

slide-9
SLIDE 9

8

Generalized Linear Model (GLM)

GLM is extensively used in insurance industry.

Major focus of PM in insurance industry Includes most distributions related to insurance Great flexibility in variance structure OLS model is a special case of GLM (Relatively) Easy to understand and communicate Multiplicative model intuitive & consistent with insurance practice

slide-10
SLIDE 10

9

Generalized Linear Model (GLM)

GLM is extensively used in insurance industry.

Random component Systematic component Link function

slide-11
SLIDE 11

10

 Random component Observations Y1, . . . , Yn are independent w/ density from the exponential family From maximum likelihood theory,  Each distribution is specified in terms of mean & variance  Variance is a function of mean 𝑔

𝑗 𝑧𝑗; 𝜄𝑗,  = 𝑓𝑦𝑞 𝑧𝑗𝜄𝑗 − 𝑐(𝜄𝑗)

𝑏𝑗() + 𝑑(𝑧𝑗, ) 𝐹 𝑍 = 𝜈 = 𝑐′ 𝜄 , 𝑤𝑏𝑠 𝑍 = 𝑐′′ 𝜄 𝑏  = 𝑏  𝑊(𝜈)

Norm

  • rmal

al Poiss

  • isson

Bin inomial Gam amma InverseGauss ssian Name 𝑂(𝜈, 2) 𝑄(𝜈) 𝐶(𝑛, 𝜌) 𝑛 𝐻(𝜈, ) 𝐽𝐻(𝜈, 2) Range (-,+) (0,+) (0,1) (0,+) (0,+) b(𝜄) 2 e ln(1+e) − ln −𝜄 −(−2𝜄)1/2 𝜈(𝜄) 𝜄 e e/(1+e) − 1/ 𝜄 (−2𝜄)−1/2 𝑊(𝜈) 1 𝜈 𝜈(1 − 𝜈) 𝜈2 𝜈3

Generalized Linear Model (GLM)

GLM is extensively used in insurance industry.

slide-12
SLIDE 12

11

 Systematic component A linear predictor 𝑗 = 𝑘 𝑦𝑗𝑘𝛾𝑘 = 𝑌𝛾 for observation i  link function 𝑗 = 𝑕(𝜈𝑗), random & systematic are connected by a smooth & invertible function Log is unique in insurance application s.t. all parameters are multiplicative

  • 𝑧 = exp( 𝑘 𝑦𝑗𝑘𝛾𝑘) = 𝑘 exp 𝑦𝑗𝑘𝛾𝑘 = 𝑘 exp 𝛾𝑘

𝑦𝑗𝑘 = 𝑘 𝑔 𝑘 𝑦𝑗𝑘

  • Consistent with most insurance practices
  • Intuitively easy to understand and communicate

Ide dentity Log Log Log Logit Rec eciprocal 𝑕(𝜈𝑗) 𝑦 ln(𝑦) ln( 𝑦 1 − 𝑦) 1/𝑦 𝑕−1(𝑗) 𝑦 𝑓𝑦

𝑓𝑦 1+𝑓𝑦

1/𝑦

Generalized Linear Model (GLM)

GLM is extensively used in insurance industry.

slide-13
SLIDE 13

12

 Inclusion of most distributions related to insurance data

  • Normal, binomial, Poisson, Gamma, inverse-Gaussian, Tweedie

Generalized Linear Model (GLM)

GLM is extensively used in insurance industry.

Random Systematic Link OLS Normal only 𝑗 =

𝑘

𝑦𝑗𝑘𝛾𝑘 𝐹 𝑧𝑗 = 𝑗 GLM Various distribution 𝑕 𝐹(𝑧𝑗) = 𝑗

 Comparison with OLS

Link function Application sample Normal General Application Poisson Claim frequency, counts Bernoulli Retention, cross-sell, underwriting rates Negative Binomial Claim severity Gamma Claim severity Tweedie Claim cost Inverse Gaussian Claim severity

slide-14
SLIDE 14

An Inventory of the Methods

Machine Learning & Statistical Techniques Ensemble method Random Forest Gradient Boosting Survey Data Analysis Genetic Algorithms Markov chain Monte Carlo Optimization Methods Sentiment Analysis Support vector machine Neural Networks / Deep learning Ada Boosting XG-boost machine Feature engineering Classification/Association Bayesian Analysis Mixed Models Analysis of Variance Multivariate Analysis Categorical Data Analysis Cluster Analysis Survival Analysis Decision Trees Non-Parametric Analysis Text mining There are plenty of statistical modeling methods out there.

slide-15
SLIDE 15

14

Predictive Modeling by Classes

There are different terminologies regarding predictive modeling.

Supervised vs. Unsupervised Learning

  • Supervised: estimate

expected value of Y given values of X. GLM, Cox, CART, MARS, Random Forests, SVM, NN, etc.

  • Unsupervised: find

interesting patterns amongst X; no target variable Y Clustering, Correlation / Principal Components / Factor Analysis Classification vs. Regression

  • Classification: to

segment observations into 2 or more

  • categories. Fraud vs.

legitimate, lapsed vs. retained, UW class

  • Regression: to predict

a continuous amount. Dollars of loss for a policy, ultimate size of claim Parametric vs. Non- Parametric

  • Parametric Statistics:

probabilistic model of data Poisson Regression(claims count), Gamma (claim amount)

  • Non-Parametric

Statistics: no probability model specified Classification trees, NN

slide-16
SLIDE 16

15

Choosing the Right Method

There is always the trade-off between interpretability and flexibility.

GLM Models

  • Logistic Regression
  • Poisson Regression

Interpretability Flexibility

This is just a sample of many algorithms available

Trade-Off Between Interpretability and Flexibility

Random Forest Gradient Boosted Trees Decision Trees

Often referred to as simple, transparent models Often referred to as “machine learning”, black-box models

slide-17
SLIDE 17

16

Choosing the Right Method

There is always the trade-off between interpretability and flexibility.

Interpretability “Transparent” Algorithms Flexibility “Black-box” Algorithms More human intervention Less human intervention More interpretable Less interpretable Require less data Require more data Faster to estimate a model Slower to estimate a model Good at handling smooth effects (e.g., age, income, etc.) Not good at handling smooth effects (e.g., age, income, etc.) The model we choose might not be a good match to reality, resulting in poor predictions. Higher predictive accuracy because functional form is derived from the data, not assumed. Less likely to overfit the data More likely to overfit the data

slide-18
SLIDE 18

17

Choosing the Right Method

Choosing the right algorithm is a combination of statistical and business considerations.

Business Considerations

  • Experience

Some business problems are well-defined and are historically modeled a specific way successfully. Example: Poisson Regression for Experience Studies

  • Know your audience

The successful business implementation of a model may require buy-in from many different groups throughout an organization. Model interpretability may be critical, particularly for analyzing experience study data.

  • Technical Implementation

Sometimes the increased accuracy in more complex models doesn’t warrant the additional technical difficulties.

Statistical Considerations

  • Dependent Variable

Knowing whether the dependent variable is available (or not), if available whether its continuous, binary, or a count helps us narrow down the appropriate algorithm.

  • Amount of Data

Powerful algorithms (e.g., random forest) require more data to work well.

  • Model Validation

Data Scientists build many models, and pick the champion model based on which model predicts new data the best (e.g., higher accuracy)

slide-19
SLIDE 19

A Case Study

slide-20
SLIDE 20

Level

  • f client

demand

Hig igh Low Medium Pre-sale Und nderw rwri riti ting In In-force man anagement Cl Claims

New rating factors Multivariate analysis Cross- sell/upsell Fraud/non- disclosure Preferred risk selection Predictive underwriting Propensity to apply & triggers Distributor quality control Propensity to complete purchase Underwriting triage Determine underwriting ratings Proactive lapse management Claims triage Customer lifetime value Competitive pricing strategy

Predictive Modeling on Value Chain

As long as there is data, there is potential to capitalize on it by predictive modeling.

slide-21
SLIDE 21

20

Set

  • bjective

Process data Fit a model Interpret model & implement Monitor & Update

What to achieve by PM Same as traditional one: understand business & data; clean & process data

  • Select proper

model for target variable; choose explanatory variables; determine if cross- terms are needed; assess model

  • Validate the

model Understand model (e.g. A/E); extract business insights; implement in business process Monitor the performance and update when necessary

Process of Predictive Modeling Projects

slide-22
SLIDE 22

The client would like to conduct multivariate experience study for their CI products, where predictive model is built and tested to better understand risk and explore for additional business insights.

  • To use advanced statistical model to better

understand risk over conventional method

  • To identify true risk drivers and their effects
  • To gain additional business insights

Objectives

  • In traditional experience study format, covering
  • 7 calendar years
  • 10 products
  • About 30 variables available for modeling
  • For ex. Policy Info, Demographic Info and Sales

Person Info

Data

  • 20 statistical significant variables in the built model
  • Income, Occupation, Marital Status, BMI

Modeling & Lift Plot

Multivariate Experience Study

slide-23
SLIDE 23

The client would like to conduct multivariate experience study for their CI products, where predictive model are built and tested to better understand risk and explore for additional business insights.

Multivariate Experience Study

  • Benchmark the traditional pricing methods
  • Could introduce new factors previously not in the

pricing basis, such as BMI, etc. Pricing

  • Products of simpler underwriting process could be

provided for customers with relatively lower risk selected by model

  • Better use the data, and might ensure previously

uninsurable, e.g., particularly to some disease Product Redefinition

slide-24
SLIDE 24

Business Goals Data Environment

  • Objective to support the profitable growth of your business
  • Resources available & strong support from senior executives
  • Sufficient quantity & high quality of data to support modeling
  • Satisfactory data depth & width
  • Able & willing to obtain data, knowledge to understand & clean data
  • Local regulatory environment & privacy laws will allow such models
  • Distribution channel can support data-driven analytic solution

Predictive Modeling Considerations

The success of a PM project needs considering many factors.

slide-25
SLIDE 25

PM for Actuaries

slide-26
SLIDE 26

25

Predictive Modeling to Actuaries

Advantages

  • Industry

knowledge - domain knowledge is a key in modeling process

  • Expertise in data

process - data is always #1 issue in data-driven application

  • Unique position in

data analytics Opportunities

  • Solid foundation in

statistics

  • Education

experience in modeling (OLS)

  • Need to pick up

new skills & thinking by education, training, and experience Outlook

  • Data analytics is

here to stay; it is changing insurance industry, and will fundamentally change how we run insurance business

  • Actuaries could

and should be on top of it and lead the change Actuaries are good candidates for predictive modeling practitioners.

slide-27
SLIDE 27

26

Predictive Modeling to Actuaries

Actuaries could master predictive modeling through different study activities. Refresh yourself with the basics of modeling Learn a modeling application / language & practice with examples Attend seminar, conference, training program, etc. Link your new skills with your job & practice if possible

slide-28
SLIDE 28

Thank You.

slide-29
SLIDE 29