Decision Trees: an application to LTC Insurance
Thibault ANTOINE Head of Critical Illness R&D Centre Guillaume BIESSY P.h.D. candidate – LTC & Disability R&D Centre
Decision Trees: an application to LTC Insurance Thibault ANTOINE - - PowerPoint PPT Presentation
Decision Trees: an application to LTC Insurance Thibault ANTOINE Head of Critical Illness R&D Centre Guillaume BIESSY P.h.D. candidate LTC & Disability R&D Centre SCOR Global Life Contents 1 Introduction 2 Decision Trees 3
Thibault ANTOINE Head of Critical Illness R&D Centre Guillaume BIESSY P.h.D. candidate – LTC & Disability R&D Centre
1
Introduction
2
Decision Trees
3
Portfolio study : Example 1
4
Portfolio study : Example 2
Contents April 2016
3
Introduction
Some examples of machine learning methods:
Those methods rely on letting the algorithm learn the structure of data instead of forcing a model. Each method has its own advantages and drawbacks
4
Introduction
Figure 1 : Overview of strengths and weaknesses for several methods. Source : Conference on Actuarial & Data Science, SCOR, November 2015.
Algorithms High number of variables Dealing with missing Data Threshold effects No distribution hypothesis No Global hierarchy between variables Predictive power Interpretation
Association Rules Sparse Regressions SVM Decision Trees Random Forests Neural Networks & Deep Learning K-nearest neighbors
Very good Good Average Poor
5
Introduction
Popular belief : machine learning is a set of obscure methods which give a slight gain of accuracy but
When to use machine learning ?
6
Introduction
Good news : in LTC we have several covariates and limited knowledge about their effect as LTC is a complex risk. This is favorable to the use of machine learning. Bad news 1 : the amount of data is limited. Bad news 2 : in LTC we work with survival data which are censored. Machine learning methods cannot be applied directly.
7
Introduction
Why using machine learning methods in LTC Insurance ?
Concrete examples :
gender, amount of premium, level of underwriting, substandard risk) : impact of underwriting, adverse selection effects, segmentation
1
Introduction
2
Decision Trees
3
Portfolio study : Example 1
4
Portfolio study : Example 2
Contents April 2016
9
Decision Trees
Why decision trees ?
Figure 2 : Example of a decision tree. The variable of interest is the life expectancy in the disabled state.
10
Decision Trees
Two families of decision trees :
dependency. To obtain a decision tree, there are two main steps :
Classification and regression trees use different methods for both steps.
11
Decision Trees – Build the Tree
For this step, a criterion to create new branches is required.
entropy and the twoing function. Other can be defined, they just need to respect a few criteria (symmetry, unique maximum and minimum) A new branch is created if it reduces the global impurity of the tree, i.e. if the impurity of both leaves is lower than the impurity of the initial leaf.
group.
12
Decision Trees – Pruning
At this step, we want to reduce the tree to avoid any overfitting effect. There are several ways to do this :
each sample and take the mean error. The choice of the final tree aim at minimizing either :
which minimizes the global mean error. This approach introduces an extra safety margin and results in even smaller trees where each leaf is significant.
1
Introduction
2
Decision Trees
3
Portfolio study : Example 1
4
Portfolio study : Example 2
Contents April 2016
14
Portfolio study : Example 1 – Portfolio of annuitants
Available covariates :
Characteristics of portfolio
DISCLAIMER: For confidentiality purpose the scale of the figures has been changed The figures have been obtained on a subset of our portfolios The definitions used are not “standard”
15
Portfolio study : Example 1 – Data representation
16
Portfolio study : Example 1 – Tree
Time spent in LTC Nb of insured in the cluster Nb of Deaths
17
Portfolio study : Example 1 – Full Tree
18
Portfolio study : Example 1 – Pruned Tree
19
Portfolio study : Example 1 – Over pruned Tree
1
Introduction
2
Decision Trees
3
Portfolio study : Example 1
4
Portfolio study : Example 2
Contents April 2016
21
Portfolio study : Example 2 – Portfolio of annuitants
Available covariates :
Characteristics of the portfolio
DISCLAIMER: For confidentiality purpose the scale of the figures has been changed The figures have been obtained on a subset of our portfolios The definitions used are not “standard”
22
Portfolio study : Example 2– Data representation
23
Portfolio study : Example 2– Full Tree on pathologies
24
Portfolio study : Example 2– Pruned on pathologies
25
Portfolio study : Example 2– Full Tree All variables
26
Portfolio study : Example 2– Pruned Tree All variables
27
Conclusion
At least, decision trees are an interesting tool for the actuary. Results are easy to share, especially with non-actuaries. Application to those methods extends to other risk, among them disability. Disability offers a better context to apply those methods : more data, more covariates, limited benefit period