Modelling micro-level insurance claim counts using Markov-modulated - - PowerPoint PPT Presentation

modelling micro level insurance claim counts using markov
SMART_READER_LITE
LIVE PREVIEW

Modelling micro-level insurance claim counts using Markov-modulated - - PowerPoint PPT Presentation

Alan Xian, UNSW Sydney Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes 4th European Actuarial Journal Conference, 2018 Benjamin Avanzi, Greg Taylor, Bernard Wong and Alan Xian 11 September


slide-1
SLIDE 1

Alan Xian, UNSW Sydney

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

4th European Actuarial Journal Conference, 2018 Benjamin Avanzi, Greg Taylor, Bernard Wong and Alan Xian 11 September 2018

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-2
SLIDE 2

Alan Xian, UNSW Sydney Introduction

The Big Question

How do you model something

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-3
SLIDE 3

Alan Xian, UNSW Sydney Introduction

The Big Question

How do you model something that you can’t actually model?

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-4
SLIDE 4

Alan Xian, UNSW Sydney Introduction

The Big Question

How do you model something that you can’t actually model?

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-5
SLIDE 5

Alan Xian, UNSW Sydney Introduction

Presentation Outline

1 Introduction to insurance claims analysis

Why is accuracy important? What improvements can be made to current methods?

2 Markov-modulated non-homogeneous Poisson processes

(MMNPP)

How can we use MMNPPs for claims analysis? How do we calibrate the model?

3 Simulation Case Study

Is the calibration accurate?

4 Empirical Case Study

Does it work on real insurance data? What insights can we gain from the model?

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-6
SLIDE 6

Alan Xian, UNSW Sydney Introduction

Why is it important to get it right?

Insights from insurance claims analysis are used as inputs for:

1 Reserving and Capital Management 2 Premium Liability Estimation 3 Pricing and Rate-making 4 Claims Management 5 Business Strategy Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-7
SLIDE 7

Alan Xian, UNSW Sydney Introduction

How is it (usually) done in practice?

Key step: Claims are aggregated and discretised into triangles (for example, by accident year i and development year j)

Figure: Aggregate Claims Modelling: Key Step

Why is this approach so prevalent? Computationally cheap, flexible and tractable.

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-8
SLIDE 8

Alan Xian, UNSW Sydney Introduction

Macro-modelling Problems

However, there are issues with this approach documented in the literature: Problems resulting from small sample sizes (Renshaw [1994], Verdonck, Wouwe, and Dhaene [2009]) Problems with underlying processes and assumptions (Halliwell [2007], Taylor [2011], Taylor and McGuire [2004]) Problems with practical implementation (Kunkler [2004], Liu and Verrall [2009], Parodi [2014]) However, the key concern is the potentially material information is unnecessarily discarded due “excessive" aggregation/discretisation.

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-9
SLIDE 9

Alan Xian, UNSW Sydney Introduction

A relationship between motor claims and rainfall?

The Pearson correlation between the two lines shown below is a statistically significant 32%.

  • 30
  • 20
  • 10

10 20 30 40 50

1-Oct-10 16-Oct-10 31-Oct-10 15-Nov-10 30-Nov-10 15-Dec-10 30-Dec-10

2 4 6 8 10 12

Rainfall (mm per day) 3-Day Moving Average of Claim Counts (t,t+1,t+2) Rainfall

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-10
SLIDE 10

Alan Xian, UNSW Sydney Introduction

How do we model things that are hard (or materially infeasible) to model?

In our rainfall example, the impact on the granular daily claim frequencies was overdispersion and persistence. Our main idea: Model what information/drivers you can and proxy the impact of the rest!

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-11
SLIDE 11

Alan Xian, UNSW Sydney Introduction

Micro-level claims analysis - A delicate balancing act

Insurance data is investigated at a greater level of detail so that more information can be extracted. However, there are some important trade-offs to consider:

1 Prediction Accuracy 2 Model Complexity 3 Interpretability 4 Modellability 5 Computational Feasibility 6 Materiality

Is there a micro-level model for managing all of these elements?

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-12
SLIDE 12

Alan Xian, UNSW Sydney Markov-modulated Poisson processes

A solution! - Markov-modulated Poisson processes

Siméon méon Denis is Poi

  • isson

sson

Figure: A Markov-modulated Poisson process (MMPP)

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-13
SLIDE 13

Alan Xian, UNSW Sydney Markov-modulated Poisson processes

Or perhaps more formally...

Figure: The “Formal” Definition for a MMPP

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-14
SLIDE 14

Alan Xian, UNSW Sydney Markov-modulated Poisson processes

Where are these processes used?

These processes are commonly used to fit clustered or "bursty" processes where there are hidden components/drivers. Fields that use these models include

1 Natural sciences (Lu [2012], Thayakaran and Ramesh

[2013a], Thayakaran and Ramesh [2013b], Langrock, Borchers, and Skaug [2013])

2 Signals and telecommunications (Scott and Smyth [2003],

Pan, Rao, Agarwal, and Gelfand [2016])

3 Finance and economics (Nasr and Maddah [2015]) 4 Queueing, inventory and reliability theory (Arts [2017],

Landon, Özekici, and Soyer [2013])

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-15
SLIDE 15

Alan Xian, UNSW Sydney Markov-modulated Poisson processes

MMPPs in actuarial literature - relatively undeveloped

There are few papers applying MMPPs to insurance claim analysis:

1 Guillou, Loisel, and Stupfler [2013]: Applied an MMPP model

to insurance data incorporating a claim severity model

2 Guillou, Loisel, and Stupfler [2015]: Extended the claims

count model component to allow for seasonality Why is the literature underdeveloped? We think that there are three barriers to practical implementation that need to be addressed first...

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-16
SLIDE 16

Alan Xian, UNSW Sydney Markov-modulated Poisson processes

Domestic motor claim numbers over time

1 2 3 4 5 6

Years

Daily claim counts over time Figure:

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-17
SLIDE 17

Alan Xian, UNSW Sydney Markov-modulated Poisson processes

Domestic motor claim numbers over time

1 2 3 4 5 6

Years

Daily claim counts over time Figure:

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-18
SLIDE 18

Alan Xian, UNSW Sydney Markov-modulated Poisson processes

Domestic motor claim frequencies and number of policyholders over time

1 2 3 4 5 6

Years

Daily claim counts and number of policyholders

  • ver time

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-19
SLIDE 19

Alan Xian, UNSW Sydney Markov-modulated Poisson processes

Barriers to practical implementation

What extensions are required for the MMPP micro-level model to be realistic and practicably useful?

1 Flexible risk exposure 1

Periodic (seasonality)

2

Non-periodic (number of policyholders, structural changes)

2 Numerical stability during the calibration process 3 Reasonable computation times for large insurance data sets Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-20
SLIDE 20

Alan Xian, UNSW Sydney Markov-modulated non-homogeneous Poisson processes

Our Solution: a Markov-modulated non-homogeneous Poisson process!

Figure: A Markov-modulated non-homogeneous Poisson process

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-21
SLIDE 21

Alan Xian, UNSW Sydney Markov-modulated non-homogeneous Poisson processes

Questions to answer

1 How do we define the MMNPP model?

i.e. How do we introduce our flexible exposure measure?

2 How do we calibrate this new model? 1

What about issues of numerical instability?

2

Is it computationally feasible for large insurance data sets?

3 How would this model be implemented in practice? 4 Does it work for real-world insurance data? Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-22
SLIDE 22

Alan Xian, UNSW Sydney Question 1: How do we define the MMNPP model?

Question 1: How do we define the MMNPP model?

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-23
SLIDE 23

Alan Xian, UNSW Sydney Markov-modulated non-homogeneous Poisson processes

Model Notation

1 γ❼t➁ is a general exposure measure (what we can model) 2 M❼t➁ is the Hidden Markov chain (what we can’t model) 3 Y ❼t➁ is the conditional Poisson process for claim arrivals Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-24
SLIDE 24

Alan Xian, UNSW Sydney Markov-modulated non-homogeneous Poisson processes

Again, a bit more formally...

Figure: The “Formal” Definition for a MMNPP

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-25
SLIDE 25

Alan Xian, UNSW Sydney Question 2: How do we calibrate this model?

Question 2: How do we calibrate this model?

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-26
SLIDE 26

Alan Xian, UNSW Sydney The adapted EM Algorithm

Theoretical contributions - A new EM algorithm

We derived a new EM algorithm, adapting results from Rydén [1996] and Roberts, Ephraim, and Dieguez [2006]. This algorithm

1 Resolved issues of numerical stability of large data sets,

allowing the model to be implemented in standard software such as R or MATLAB.

2 Implemented several computational improvements/shortcuts

that drastically reduced calibration times.

3 Allowed for easy extraction of several statistical quantities

  • f interest (more on this later...)

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-27
SLIDE 27

Alan Xian, UNSW Sydney Question 3: How would this model be implemented in practice?

Question 3: How would this model be implemented in practice?

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-28
SLIDE 28

Alan Xian, UNSW Sydney MMNPP Model implementation

Toy example - Raw observed claim frequencies

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-29
SLIDE 29

Alan Xian, UNSW Sydney MMNPP Model implementation

Toy example - After accounting for risk exposure relativities

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-30
SLIDE 30

Alan Xian, UNSW Sydney MMNPP Model implementation

Toy example - After fitting the MMNPP model

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-31
SLIDE 31

Alan Xian, UNSW Sydney MMNPP Model implementation

Toy example - Final Model Outputs

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-32
SLIDE 32

Alan Xian, UNSW Sydney Question 4: Does it work?

Question 4: Does it work?

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-33
SLIDE 33

Alan Xian, UNSW Sydney Simulation case study

A simulation case study

Each period was set to be 100 time units. Period Poisson λ Exposure "Base" λ "True State" 1 0.5 1 0.5 1 2 1 1 1 2 3 1 2 0.5 1 4 2 1 2 3 5 10 10 1 2 6 10 20 0.5 1 7 10 5 2 3 8 0.75 1.5 0.5 1 9 0.1 0.1 1 2 10 0.1 0.05 2 3

Table: Simulated data set parameters

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-34
SLIDE 34

Alan Xian, UNSW Sydney Simulation case study

Simulation calibration results

Regime True Poisson Intensity Calibrated Poisson Intensity 1 0.5 0.482 2 1 1.013 3 2 1.965

Table: True intensities versus calibrated intensities

We can also extract the probability of being in each state/regime at each claim arrival time.

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-35
SLIDE 35

Alan Xian, UNSW Sydney Simulation case study

State/Regime filtering

Claim arrival times Most likely regime 100 200 300 400 500 600 700 800 900 1000 1 2 3

Figure: Plot of the most likely regime at each claim arrival time

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-36
SLIDE 36

Alan Xian, UNSW Sydney Does it work on actual insurance data?

Does it work on actual insurance data?

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-37
SLIDE 37

Alan Xian, UNSW Sydney Real insurance case study

The AUSI data set

The Allianz, University of New South Wales, Suncorp and Insurance Australia Group (AUSI) dataset is supported by the Australian Research Council and several major industry partners. It consists of 4 lines of business

Home Motor Public Liability Compulsory Third Party

Daily records Policy and Claim Characteristics In the following, we will be looking at the Domestic Motor LoB.

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-38
SLIDE 38

Alan Xian, UNSW Sydney Real insurance case study

What exposure measures should we adjust for?

An over-dispersed Poisson GLM was fit to daily claims data over a period of 6 years. Covariates were tested for statistical significance and where appropriate, bucketed for parsimony. The final model used the following features:

1 Number of policies in force 2 Various forms of seasonality: 1

Weekday/Weekend

2

Public Holiday

3

Month

4

Day of Month

3 Days since start of the period of investigation Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-39
SLIDE 39

Alan Xian, UNSW Sydney Real insurance case study

Do states/regimes actually exist in the data?

Some statistical results seem to indicate that even after consideration of the factors on the previous slide, regimes still exist. The dispersion parameter for the ODP GLM was 3.16, indicating large over-dispersion. A runs test was applied to the adjusted frequencies, indicating persistence. An order selection methodology based on white-noise residual testing was applied, and the resulting optimal order was 4.

If regimes did not exist, the optimal order should be 1.

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-40
SLIDE 40

Alan Xian, UNSW Sydney Real insurance case study

Final results - estimates and E-step estimators

The calibrated regime transition and claim intensities are Q ❁ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❃ ✏0.39 0.09 0.28 0.02 0.00 ✏0.04 0.04 0.00 0.06 0.40 ✏0.46 0.00 1.00 0.00 0.00 ✏1.00 ❂ ❆ ❆ ❆ ❆ ❆ ❆ ❆ ❄ , λ ❼135,177,204,518➁. We can also extract some other information from the model Quantity Estimator Number of changes to each state ❁ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❃ ✏20 7 12 1 4 ✏79 75 14 73 ✏87 1 ✏1 ❂ ❆ ❆ ❆ ❆ ❆ ❆ ❆ ❄ Proportion of time in each state (2.3%, 88.9%, 8.7%, 0.0%) Proportion of claims in each state (1.7%, 88.5%, 9.8%, 0.0%)

Table: Other statistical quantities of interest

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-41
SLIDE 41

Alan Xian, UNSW Sydney Real insurance case study

Final results - Regime filtering

Claim arrival times (days) Most likely regime

200 400 600 800 1000 1200 1400 1600 1800 2000 2200 1 2 3 4

Figure: Plot of the regime with the highest probability per day

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-42
SLIDE 42

Alan Xian, UNSW Sydney Conclusion

Conclusion

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-43
SLIDE 43

Alan Xian, UNSW Sydney Conclusion

Summary

1 Can improvements can be made to current claims analysis

methods? ✟

We can incorporate more detailed information, balanced against theoretical and practical considerations such as model tractability, model complexity and computational feasibility.

2 Can we calibrate the model? ✟

The adapted EM algorithm Scaling algorithms to deal with numerical stability Various methods to reduce computation times

3 Is the calibration accurate? (Simulation study) ✟ 4 Does it work on real insurance data? ✟ Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-44
SLIDE 44

Merci de votre attention!

slide-45
SLIDE 45

Alan Xian, UNSW Sydney

References I

Joachim Arts. A multi-item approach to repairable stocking and expediting in a fluctuating demand environment. European Journal of Operational Research, 256(1):102–115, 2017.

  • A. Guillou, S. Loisel, and G. Stupfler. Estimating the parameters of

a seasonal markov-modulated poisson process. Statistical Methodology, 26(0):103 – 123, 2015. Armelle Guillou, Stéphane Loisel, and Gilles Stupfler. Estimation of the parameters of a markov-modulated loss process in insurance. Insurance: Mathematics and Economics, 53(2):388–404, 2013.

  • L. J. Halliwell. Chain-ladder bias: Its reason and meaning.

Casualty Actuarial Society, 1(2):214–247, 2007.

  • M. Kunkler. Modelling zeros in stochastic reserving models.

Insurance: Mathematics and Economics, 34(1):23–35, 2004.

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-46
SLIDE 46

Alan Xian, UNSW Sydney

References II

Joshua Landon, Süleyman Özekici, and Refik Soyer. A markov modulated poisson model for software reliability. European Journal of Operational Research, 229(2):404–410, 2013. Roland Langrock, David L Borchers, and Hans J Skaug. Markov-modulated nonhomogeneous poisson processes for modeling detections in surveys of marine mammal abundance. Journal of the American Statistical Association, 108(503): 840–851, 2013. Huijuan Liu and Richard Verrall. Predictive distributions for reserves which separate true ibnr and ibner claims. ASTIN Bulletin, 39:35–60, 2009. Shaochuan Lu. Markov modulated poisson process associated with state-dependent marks and its applications to the deep

  • earthquakes. Annals of the Institute of Statistical Mathematics,

64(1):87–106, 2012.

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-47
SLIDE 47

Alan Xian, UNSW Sydney

References III

Walid W Nasr and Bacel Maddah. Continuous (s, s) policy with mmpp correlated demand. European Journal of Operational Research, 246(3):874–885, 2015. Jiangwei Pan, COM Vinayak Rao, EDU Pankaj K Agarwal, and Alan E Gelfand. Markov-modulated marked poisson processes for check-in data. In Proceedings of The 33rd International Conference on Machine Learning, pages 2244–2253, 2016.

  • P. Parodi. Triangle-free reserving. British Actuarial Journal, 19

(01):168–218, 2014.

  • A. E Renshaw. Modelling the claims process in the presence of
  • covariates. ASTIN Bulletin, 24(2):265–299, 1994.

William J.J. Roberts, Y. Ephraim, and E. Dieguez. On rydén’s em algorithm for estimating mmpps. Signal Processing Letters, IEEE, 13(6):373–376, 2006.

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-48
SLIDE 48

Alan Xian, UNSW Sydney

References IV

Tobias Rydén. An em algorithm for estimation in markov-modulated poisson processes. Computional Statistics & Data Analysis, 21(4):431–447, 1996. S.L. Scott and P. Smyth. The markov modulated poisson process and markov poisson cascade with applications to web traffic

  • modelling. Bayesian Statistics, 7, 2003.

Greg Taylor. Maximum likelihood and estimation efficiency of the chain ladder. Astin Bulletin, 41(01):131–155, 2011. Greg Taylor and Gráinne McGuire. Loss reserving with glms: A case study. Casualty Actuarial Society Discussion Paper Program, Applying and Evaluating Generalised Linear Models, 2004.

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes

slide-49
SLIDE 49

Alan Xian, UNSW Sydney

References V

  • R. Thayakaran and N. I. Ramesh. Multivariate models for rainfall

based on markov modulated poisson processes. Hydrology Research, 44(4):631–643, 2013a. ISSN 0029-1277. doi: 10.2166/nh.2013.180. URL http://hr.iwaponline.com/content/44/4/631. R Thayakaran and NI Ramesh. Markov modulated poisson process models incorporating covariates for rainfall intensity. Water Science and Technology, 67(8):1786–1792, 2013b. Charles Van Loan. Computing integrals involving the matrix

  • exponential. 1977.
  • T. Verdonck, M. V. Wouwe, and J. Dhaene. A robustification of

the chain-ladder method. North American Actuarial Journal, 13 (2):280–298, 2009.

Modelling micro-level insurance claim counts using Markov-modulated non-homogeneous Poisson processes