severity modeling of extreme insurance claims for
play

Severity Modeling of Extreme Insurance Claims for Tariffication - PowerPoint PPT Presentation

Severity Modeling of Extreme Insurance Claims for Tariffication Sascha Desmettre (joint work with C. Laudag, J. Wenzel) OICA 2020 - Online International Conference in Actuarial Science, Data Science and Finance April 28-29, 2020 S.


  1. Severity Modeling of Extreme Insurance Claims for Tariffication Sascha Desmettre (joint work with C. Laudagé, J. Wenzel) OICA 2020 - Online International Conference in Actuarial Science, Data Science and Finance April 28-29, 2020 S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 1 / 15

  2. Motivation Expected Claim Severity ◮ Usually modeled via generalized linear models (GLMs) based on gamma distribution (see e.g. [Ohlsson & Johansson (10), Wüthrich (17)]). Limitations ◮ Extreme claim sizes in data � The Gamma CDF is not heavy-tailed! Concentration on body of distribution may lead to ◮ bias predictions ◮ missing robustness in predictions � Extreme Value Theory might help! S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 2 / 15

  3. Modeling Framework Claim severity : Positive iid random RVs X 1 , X 2 , · · · ∼ X Claim frequency : Positive discrete RV N , where N ind. of X Features like car brand, age of driver or power of car affects damage. Vector of tariff features : R = ( R 1 , . . . , R d ) with positive RVs R i Tariff cell : Concrete combination of tariff features, e.g. 60 kW 80 kW . . . 18 years Cell 11 Cell 12 . . . 19 years Cell 21 . . . Cell 22 . . . ... . . . . . . r = (19 years, 80 kW) What is the expected claim severity for a specific tariff cell r ? E ( X | R = r ) Total damage in the given time period: E ( S | R = r ) = E ( N | R = r ) · E ( X | R = r ) S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 3 / 15

  4. Censoring by Insured Sum Primary insurers only pay for damages up to a specified amount. ◮ Considered as tariff feature R I . The actual damage Y may be larger than the insured sum. � Claim severity is then given by X := min( Y , R I ) . Insurer only observes realizations for X , i.e. right-censored data. � Determine the distribution of Y based on this censored data. S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 4 / 15

  5. Threshold Severity Model (TSM) Split the distribution of Y at a certain threshold u > 0. � Body and tail of the claim size distribution can be modeled separately. Notation for a given tariff cell r : ◮ H r cdf for the body with parameter vector Θ H ◮ G r cdf for the tail with parameter vector Θ G ◮ q r prob. of exceeding the given threshold u with parameter vector Θ q Assumptions to obtain a contiuous distribution function: ◮ H r ( u ; Θ H ) > 0 ◮ G r ( u ; Θ G ) = 0 S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 5 / 15

  6. Concrete Specification of the TSM Distribution function of Y with parameter vector Θ = (Θ H , Θ G , Θ q ):  0 , y ≤ 0 ,    (1 − q r (Θ q )) H r ( y ;Θ H ) F r ( y ; Θ) = , 0 < y ≤ u , H r ( u ;Θ H )   (1 − q r (Θ q )) + q r (Θ q ) G r ( y ; Θ G ) , y > u .  Note: Threshold u independent of tariff cell r However, the exceeding probability depends on insured sum: 1 q r (Θ q ) = 1 + e − ( δ 0 + δ I r I ) with Θ q = δ. � ˆ � ˆ Θ H , ˆ Θ G , ˆ Θ = Θ q is estimated via maximizing the log-likelihood. � Obtain desired expectation for a tariff cell r by [ X = min( Y , R I )]: � r I � � � � �� y ; ˆ r I ; ˆ E ˆ Θ (min( Y , R I ) | R = r ) = yf r Θ dy + r I 1 − F r Θ . 0 S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 6 / 15

  7. Recall: X := min( Y , R I ). S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 7 / 15

  8. Estimators for Basic and Extreme Claim Sizes Use concrete distributions for the conditional distribution functions below and above the threshold for a tariff cell r . Claim severity below the given threshold: ◮ Use general regression methods, i.e., a generalized linear model (GLM). ◮ Assume a gamma distribution for H r . ◮ In particular, the conditional distribution function P ( Y ≤ y | Y ≤ u , R = r ) = H r ( y ; Θ H ) H r ( u ; Θ H ) , 0 < y ≤ u , describes a truncated gamma distribution. Claim severity above the given threshold: ◮ Apply the peaks-over-threshold approach from extreme value theory. ◮ I.e., the conditional distribution function P ( Y ≤ y | Y > u , R = r ) = G r ( y ; Θ G ) , y > u , is approximated by the generalized Pareto distribution (GPD). S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 8 / 15

  9. Basic Claim Sizes: Truncated Gamma GLM We assume that for all covariates r ∈ R d ≥ 0 we have ( Y | Y ≤ u , R = r ) ∼ G ( φ, θ r , u ) with φ > 0 , θ r < 0 , i.e., they are truncated gamma distributed with dispersion φ , threshold u and scale θ r , depending on the tariff features r . GLM to model conditional distribution function of X = min( Y , R I ): P ( X ≤ x | X ≤ u , R = r ) = H r (min( x , u ); Θ H ) . H r ( u ; Θ H ) � d ′ θ ( b u ( ., ˆ φ ) ) g � − − − − − − → E ( X | X ≤ u , R = r ) − → α 0 + r i α i , i =1 with � 1 φ − 1 exp � � � − θ u θ u − u ′ := b ′ ( θ ) + φ φ ( b u ( θ, φ )) . � � 1 φ , − θ u γ φ S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 9 / 15

  10. Extreme Claim Sizes We are looking at the excess distribution: F u ( y , r ) = P ( Y ≤ y | Y > u , R = r ) = G r ( y ; Θ G ) , y > u . Theorem of Pickands, Balkema and de Haan: � � lim sup � F u ( x ) − G ξ,β ( u ) ( x ) � = 0 . � � u ↑ x F 0 < x < x F − u Application to Y with Θ G = ( ξ, β ) provides approximation : G r ( y ; Θ G ) = G ξ,β ; u ( y ) = G ξ,β ( y − u ) , y > u . Conditional distribution function of X := min( Y , R I ): P ( X ≤ x | X > u , R = r ) = G ξ,β (min ( x , r I ) − u ) , x > u . S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 10 / 15

  11. Simulation Study Goal: Show that the TSM outperforms the classical gamma GLM when fitting to simulated claim sizes from other regression models. � Use heavy-tailed regression models based on the log-normal and Burr Type XII distributions to generate claim sizes. Present and compare the predictions stemming from the gamma GLM and the TSM w.r.t. the different scenarios. Setting: ◮ Set the index of the insured sum to 1 and denote it by v (= r 1 = r I ). ◮ Insured sums: 5 million, 20 million, 50 million. ◮ Second tariff feature taking integer values from 1 to 10. [E.g. mileage or the car’s power; denoted by w (= r 2 )]. � 30 tariff cells in total. S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 11 / 15

  12. Simulation Study: Log-Normal Regression 1 Simulate a normal random variable Z ∼ N ( µ, σ ) with mean µ = α 0 + α 1 v + α 1 w and standard deviation σ > 0. 2 Obtain the log-normal random variable by X = e Z . 3 In order to obtain a significant influence of the insured sum, we use the following parameters in this scenario: α 0 = 5 . 5 , α 1 = 4 × 10 − 8 , α 2 = 0 . 02 , σ = 2 . 75 . 4 Compare the classical gamma GLM with the TSM in this log-normal setting. S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 12 / 15

  13. Simulation Study: Burr Regression 1 Simulate claim sizes from a Burr Type XII distribution, i.e, Y ∼ Burr ( β, λ, τ ) with density fucntion λβ λ τ y τ − 1 f B ( y ; β, λ, τ ) = ( β + y τ ) λ +1 , y > 0 , β, λ, τ > 0 . 2 To incorporate tariff cells, we use a regression for the parameter β , i.e., we obtain the conditional distribution ( Y | R = r ) ∼ Burr ( β ( r ) , λ, τ ) with β ( r ) := exp ( τ ( α 0 + α 1 v + α 1 w )) . 3 Parameter values in this scenario: α 0 = 8 , α 1 = 4 × 10 − 8 , α 2 = 0 . 02 , λ = 1 . 5 , τ = 0 . 7 ( ⇒ heavy tails) . 4 Compare the cl. gamma GLM with the TSM in this Burr-type setting. S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 13 / 15

  14. Results - Observed Statistics Quantify the relative deviation between the true ( µ i ) and predictive mean ( ˆ µ i ) of a specific tariff cell. Calculate (weighted) averages of the relative differences for every scenario w.r.t. all tariff cells: 30 30 z 1 := 1 | ˆ µ i − µ i | m i | ˆ µ i − µ i | � � ¯ , ¯ z 2 := . 30 µ i m µ i i =1 i =1 Simulated Claims Model ¯ ¯ z 1 z 2 Log-Normal Gamma GLM 53.31% 14.58% Log-Normal TSM 21.67% 13.35% Burr Gamma GLM 74.82% 23.51% Burr TSM 17.78% 5.59% S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 14 / 15

  15. Conclusion and Outlook TSM combines idea of GLMs with EVT for tariffication. Allows for simple interpretations. Robust against Log-Normal and Burr claim sizes. Outperforms the classical gamma-based GLM. Further tariff features for excess distribution. Usage of different thresholds. Transfer to risk management. S. Desmettre Modeling of Extreme Insurance Claims April 28-29, 2020 15 / 15

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend