Using Predictive Analytics to Detect F Fraudulent Claims d l Cl i - - PowerPoint PPT Presentation

using predictive analytics to detect f fraudulent claims
SMART_READER_LITE
LIVE PREVIEW

Using Predictive Analytics to Detect F Fraudulent Claims d l Cl i - - PowerPoint PPT Presentation

Using Predictive Analytics to Detect F Fraudulent Claims d l Cl i May 17, 2011 Roosevelt C. Mosley, Jr., FCAS, MAAA CAS Spring Meeting Palm Beach, FL Experience the Pinnacle Difference! Predictive Analysis for Fraud Claim fraud is


slide-1
SLIDE 1

Using Predictive Analytics to Detect F d l Cl i Fraudulent Claims

May 17, 2011 Roosevelt C. Mosley, Jr., FCAS, MAAA CAS Spring Meeting Palm Beach, FL

Experience the Pinnacle Difference!

slide-2
SLIDE 2

Predictive Analysis for Fraud

Claim fraud is increasing, focus on fraud is magnified magnified There are special investigators in the industry that are good at detecting fraud g g As good as they are, they can’t review every claim and detect all fraud di i l i b i h i Predictive analytics can bring the expertise to bear on all claims Predictive analytics can enhance the work of Predictive analytics can enhance the work of investigators by uncovering complexities the human eye may miss

slide-3
SLIDE 3

Claim Fraud is Increasing, and the Focus Cl i F d i I i W ll

  • n Claim Fraud is Increasing as Well
slide-4
SLIDE 4

Increasing Claim Fraud – 2011 Headlines

March 30 – Suspicious claims rise 34% in Florida April 17 – The Battle Against Insurance Fraud in Georgia p g g April 26 – Insurance Groups Stress Need for N.Y. No-Fault Reform at Hearing April 26 – PIP Bills Crash in Florida May 2 – Four Women Booked with Insurance Fraud in Louisiana Louisiana May 5 – Council Woman Gets Jail Time for Insurance Fraud May 6 - Allstate Files $4 Million Insurance Fraud Case in New York May 8 – Questionable Claims on the Rise in Oklahoma (+15%) l May 12 – NY State Must Stand Against No Fault Car Insurance Fraud

slide-5
SLIDE 5

Increase in Questionable Claims

4,232 4,016

4 000 4,500

3,613

3,000 3,500 4,000

1,654 1,825 1,578 1,982

2,000 2,500

446 867 647 1,082 1,082 734 867 868

500 1,000 1,500

  • 500

Tampa Miami Orlando New York City Los Angeles 2008 2009 2010 Source: National Insurance Crime Bureau

slide-6
SLIDE 6

Fraud Detection Process Fraud Detection Process

slide-7
SLIDE 7

Geneal Fraud Identification Process

Identify triggers that alert the claim adjuster to potential fraud (fraud indicators) Rely on claim adjusters to identify potentially y j y p y fraudulent claims (recognition, intuition) Potentially fraudulent claims are referred to Potentially fraudulent claims are referred to SIU Smaller group of SIU investigators handle the Smaller group of SIU investigators handle the investigation of fraudulent claims

slide-8
SLIDE 8

Recognition (I’ve Seen This Before)

Examples

Repeat offenders Repeat offenders Provider/patient/attorney combinations

Approach pp

Advisory claim database Experience of adjuster

Disadvantages

Assumes adjuster has seen it before Aliases Fraud becomes smarter

slide-9
SLIDE 9

Fraud Indicators

Rules based system Identify known or potential fraud scenarios Advantages

Easy to implement and modify Easy to understand Effective to attack specific problems

Disadvantages

Doesn’t detect new and unknown fraud Creates smarter fraud

slide-10
SLIDE 10

Fraud Indicators - Examples

Distance between claimant’s home address and medical provider Multiple medical opinions/providers Certain claim types (e g soft tissue) Certain claim types (e.g., soft tissue) Changing providers for the same treatment (possibly correlated with other claim activity) y High number of treatments for type of injury Abnormally long treatment time off for the type of injury Accident severity does not correlate with severity of injury

slide-11
SLIDE 11

Intuition (Something Smells Funny)

Something about the claim doesn’t seem right to the adjuster, and it is referred to the SIU Relies on ability and experience of adjuster to y p j see suspicious cases Inexperienced adjusters will not have the Inexperienced adjusters will not have the ability to detect suspicious as well

slide-12
SLIDE 12

As Good as the SIU Is… As Good as the SIU Is…

slide-13
SLIDE 13

Concerns with the Current Process Claim referral can be inconsistent – heavy dependence on claim adjuster False positives p Claim adjuster may not be aware of all suspicious relationships suspicious relationships Not all historical fraud has been identified P i iti ti f t ti ll f d l t l i Prioritization of potentially fraudulent claims

slide-14
SLIDE 14

Using Predictive Analytics to Address These Concerns

Predictive analysis of historical referrals ( i t t f l ) (consistent referrals) Predictive analysis of historical fraudulent claims (false positives) (false positives) Association analysis (recognition of claim patterns) patterns) Clustering Methods (missed claims, prioritization)

K-mean clustering K mean clustering Kohonen self-organizing maps

PRIDIT (consistent referrals, prioritization) ( , p )

slide-15
SLIDE 15

Analysis of Historical Referrals

Target: history of claim referrals to SIU Independent Factors: details of claim Models Tested

Decision tree Neural network

Linear regression

Linear regression Ensemble

Result: given the history of claim referrals the Result: given the history of claim referrals, the likelihood that a new claim should be referred to SIU based on the claim characteristics

slide-16
SLIDE 16

Decision Tree

Most serious injury: neck sprain/strain Claimant's hospital treatment: did not go,

  • utpatient
  • utpatient

Arbitration: non-binding Impact severity to claimant's vehicle: none, Impact severity to claimant s vehicle: none, minor Was claimant represented by an attorney? Y

slide-17
SLIDE 17

Regression: First Report of Claim

First Report of Claim

1.0834 1.3155 1.1742 1.2000 1.4000 1.0000 0.8000 1.0000 0.4000 0.6000 0 0000 0.2000 0.0000 Insured Claimant Attorney Other

slide-18
SLIDE 18

Referral Score

18 0%

Referral Score

14.0% 16.0% 18.0% 8.0% 10.0% 12.0% 4.0% 6.0% 8.0% 0.0% 2.0% 0.00 0.04 0.08 0.12 0.16 0.20 0.24 0.28 0.32 0.36 0.40 0.44 0.48 0.52 0.56 0.60 0.64 0.68 0.72 0.76

slide-19
SLIDE 19

Analysis of Historical Fraudulent Claims

Target: history of actionable claim referrals to SIU Independent Factors: details of claim Independent Factors: details of claim Models Tested

Decision tree Neural network Linear regression Ensemble Ensemble

Result:

given the history of claim referrals, the likelihood that

action will be taken on a new claim based on the claim action will be taken on a new claim based on the claim characteristics

Comparison to referral claims

slide-20
SLIDE 20

Decision Tree Comparison – Variable Importance po ta ce

Variable Actionable Importance SIU Referral Importance Ratio Variable Importance Importance Ratio Central City 1.000 0.464 46.4% Replace:Claimant's state of residence 0.967 1.000 103.5% Impact severity to claimant's vehicle 0.962 0.828 86.2% Was claimant represented by an attorney? 0.850 0.905 106.4% Policy coverage limits per person 0.750 0.411 54.9% Arbitration 0.547 0.368 67.2% Most serious injury 0 530 0 375 70 9% Most serious injury 0.530 0.375 70.9% Settlement_lag 0.456 0.000 0.0% Who reported injury to insurer 0.439 0.374 85.3% Most expensive injury 0.423 0.239 56.5% DRAGE 0.312 0.306 98.0% Lawsuit status 0.295 0.000 0.0% Driver, other violation 0.285 0.000 0.0% Amount Spent on Medical Professionals 0 255 0 412 161 6% Amount Spent on Medical Professionals 0.255 0.412 161.6%

slide-21
SLIDE 21

Difference in Referred vs. Actionable Claims

50 0%

Referred Minus Actionable

35.7% 43.5%

35.0% 40.0% 45.0% 50.0%

D i s t

Sh ld h 20.0% 25.0% 30.0% 35.0%

t r i b u

Should have been referred? False Positives

0.0% 0.0% 0.0% 0.1% 0.2% 0.4% 0.3% 0.2% 3.9% 5.9% 3.2% 2.2% 1.8% 1.2% 0.2% 0.8% 0.2% 0.2%

0 0% 5.0% 10.0% 15.0%

u t i

  • n

0.0%

  • 1.00
  • 0.90
  • 0.80
  • 0.70
  • 0.60
  • 0.50
  • 0.30
  • 0.20
  • 0.10

0.00 0.10 0.20 0.30 0.40 0.50 0.60 0.70 0.80 0.90 1.00

n

Difference Difference

slide-22
SLIDE 22

Association Analysis (recognition of patterns) patte s)

Technique used in market basket analysis Identification of items that occur together in the same record Produces event occurrence as well as confidence interval around the occurrence likelihood C l d l i ll hi h Can lead to sequence analysis as well, which considers timing and ordering of events

slide-23
SLIDE 23

Association Analysis Measurements

Support – how often items occur together C fid t th f i ti

Transactions that contain items A & B All transactions

Confidence – strength of association

Transactions that contain items A & B Transactions that contain item A

Expected Confidence – proportion of items that satisfy right side of rule satisfy right side of rule

Transactions that contain item B All transactions All transactions

slide-24
SLIDE 24

Association Analysis Output

slide-25
SLIDE 25

Association Output Example

slide-26
SLIDE 26

Self – Organizing Maps

Topological mapping from input space to l t clusters Observations from the input space are d t i d id mapped onto an organized grids Neurons are determined initially, and as i t d t th id th inputs are mapped to the grids the neurons are adjusted A i t i t h d t th id ll th As a input is matched to the grid, all the neurons around that grid are updated

slide-27
SLIDE 27

SOM – SIU Indicator

slide-28
SLIDE 28

Clustering/Segmentation

Unsupervised classification technique Groups data into set of discrete clusters or contiguous groups of cases Performs disjoint cluster analysis on the basis of Euclidean distances computed from one or more quantitative input variables and cluster seeds quantitative input variables and cluster seeds Objects in each cluster tend to be similar, objects in different clusters tend to be dissimilar different clusters tend to be dissimilar Can be used as a dimension reduction technique

slide-29
SLIDE 29

Cluster Evaluation – Suspicion Scores Root Mean Square Standard Deviation – variability of claims within a cluster Distance to Nearest Cluster – group of outlier g p claims Distance from Cluster Seed – the distance of Distance from Cluster Seed the distance of the claim from the average Review of cluster summary statistics Review of cluster summary statistics

slide-30
SLIDE 30

Homeowner Contents Analysis

Claim values by detailed category

Replacement cost value Depreciation Number of items Age

Property characteristics (age, bathrooms, bedrooms) Coverage details (coverage C) Insured demographics (age, education, income)

slide-31
SLIDE 31

Cluster Proximities – All Causes of Loss

slide-32
SLIDE 32

Cumulative Distribution – Distance to Nearest Cluster (Theft) ea est C uste ( e t)

slide-33
SLIDE 33

Review of Cluster Summary Statistics

slide-34
SLIDE 34

Distance from Cluster Mean

90

Public Adjuster

80.8 70 80 90 40 50 60 18.3 20 30 40 2.3 2.5 10 Theft Fire Water Weather Other Theft Fire Water Weather Other

slide-35
SLIDE 35

Final Fraud Calculations

Factor Name Description Input Value insured_kids_2 Y, N, or U u peril_2 Cause of Loss Fire public_adjuster 0 or 1 IMP_REP_Coverage_C Coverage C Amount 190,500 IMP_REP_Insured_Home_Bathrooms Number of Bathrooms 2 IMP_REP_Insured_Home_Bedrooms Number of Bedrooms 3 IMP REP Insured Home SqFt Square Footage 1 412 IMP_REP_Insured_Home_SqFt Square Footage 1,412 IMP_REP_Insured_Home_YearBuilt Year Built 1973 IMP_REP_Insured_Homeowner Homeowner (Y or N) Y IMP_REP_acvloss_rcttotal Ratio of ACV Loss to RCT Total 1.13 IMP_REP_create_lag Delay in Creating Record 9 IMP_REP_insured_age_2 Insured Age 50 IMP_REP_insured_educationlevel_2 Years of Education 12 IMP_REP_insured_homevalue_calc_r Home Value Calculation Rounded 149 IMP_REP_insured_yearsinhome_2 Insured Years in Home 6

Suspicion Score Root Mean Square Error 99.7% Distance to Nearest Cluster 99.4% Distance from Mean 96.5% Combined 98.3%

slide-36
SLIDE 36

PRIDIT Comparison

slide-37
SLIDE 37

Wrap Up - Predictive Analysis for Fraud

Claim fraud is increasing, focus on fraud is magnified magnified There are special investigators in the industry that are good at detecting fraud g g As good as they are, they can’t review every claim and detect all fraud di i l i b i h i Predictive analytics can bring the expertise to bear on all claims Predictive analytics can enhance the work of Predictive analytics can enhance the work of investigators by uncovering complexities the human eye may miss