Manchester UK Professor Jorge Ribeiro Patrick Ribeiro 1 SAS/ETS - - PowerPoint PPT Presentation

manchester uk
SMART_READER_LITE
LIVE PREVIEW

Manchester UK Professor Jorge Ribeiro Patrick Ribeiro 1 SAS/ETS - - PowerPoint PPT Presentation

Survival Data Mining using Enterprise Miner and Proportional Hazard Cox Model 25 th June 2015 Manchester UK Professor Jorge Ribeiro Patrick Ribeiro 1 SAS/ETS Econometrics Time Series Enterprise Miner 13.2 PROC ARIMA Survival Analysis


slide-1
SLIDE 1

1

Survival Data Mining using Enterprise Miner and Proportional Hazard Cox Model 25th June 2015 Manchester – UK Professor Jorge Ribeiro Patrick Ribeiro

slide-2
SLIDE 2

2

Survival Analysis Node

2

Enterprise Miner 13.2

Simulation Studio 13.2

SAS/OR

Operational Research

SAS/ETS

Econometrics Time Series PROC ARIMA PROC AUTOREG

slide-3
SLIDE 3

3

Model 1 - Time to Next Purchase Survival Discrete Model

slide-4
SLIDE 4

4

slide-5
SLIDE 5

5

Survival Analysis Node

5

Enterprise Miner 13.2

slide-6
SLIDE 6

6

1.1 - Model 1 - Time to Next Purchase Survival Discrete Model

slide-7
SLIDE 7

7

1.2 - Model 1 - Time to Next Purchase

“People are much more likely to get on a bus if they know where it is going”.

Steps Plan

slide-8
SLIDE 8

8

1.2 - Model 1 - Time to Next Purchase

slide-9
SLIDE 9

9

1.2 - Model 1 - Time to Next Purchase

slide-10
SLIDE 10

10 10

1.2 - Model 1 - Time to Next Purchase

slide-11
SLIDE 11

11 11

Final Model - Hazard Function

slide-12
SLIDE 12

12 12

Final Model - Benefit graph

slide-13
SLIDE 13

13 13

Final Model

slide-14
SLIDE 14

14 14

slide-15
SLIDE 15

15 15

1.2 - Model 1 - Time to Next Purchase

slide-16
SLIDE 16

16

PROC ARIMA / PROC AUTOREG

16

SAS/ETS – Econometrics Time Series

slide-17
SLIDE 17

17 17

SAS/ETS – Econometrics Time Series

The Cross-Correlation Function

t t

H L 

4

Jan

t t

H L 

1 t t

H L 

2

Oct Nov Dec Jan Feb Lag Apr Dec Jan Feb Mar Dec

slide-18
SLIDE 18

18

PROC ARIMA / PROC AUTOREG

18

SAS/ETS – Econometrics Time Series

Primary Event Variables Royal Wedding Bank Holiday Price Marketing Campaign Point/Pulse Step Ramp tevent

slide-19
SLIDE 19

19

PROC ARIMA / PROC AUTOREG

19

SAS/ETS – Econometrics Time Series

slide-20
SLIDE 20

20

Simulation Studio 13.2

20

SAS/OR – Operational Research

slide-21
SLIDE 21

21

Simulation Studio 13.2

21

slide-22
SLIDE 22

22 22

2 - Model 2 - Call Centre Demand Call Centre Demand Model

slide-23
SLIDE 23

23 23

2.1 - Model 2 – Call Centre

Wait Time Goal = 30 Wait Time Max = 90

slide-24
SLIDE 24

24 24

2.2 - Model 2 – Call Centre

Wait Time Goal = 30 Wait Time Max = 90

slide-25
SLIDE 25

25 25

2.3 - Model 2 – Call Centre

Wait Time Goal = 30 Wait Time Max = 90

slide-26
SLIDE 26

26 26

2.4 - Model 2 – Call Centre

Wait Time Goal = 30 Wait Time Max = 90

slide-27
SLIDE 27

27 27

2.5 - Model 2 – Call Centre

slide-28
SLIDE 28

28 28

3.1 - Model 3 – Stress Test and Scenario Analysis

slide-29
SLIDE 29

29 29

slide-30
SLIDE 30

30 30

62 days for data preparation 6 days for modelling

slide-31
SLIDE 31

31

slide-32
SLIDE 32

32

Step 1 – Economic variables

Economic Variables Unemployment GDP Inflation Cash rate Credit availability House prices Commercial property prices Commodity prices Swap rates Equity prices

slide-33
SLIDE 33

33

Cox Proportional Hazards Model

1 1

 

{ ... }

( ) ( )

i k ik

X X i

h t e h t

 

Baseline Hazard function - involves time but not predictor variables Linear function of a set of predictor variables - does not involve time

...

slide-34
SLIDE 34

34

Step 3 – Model

PROC PHREG DATA = MODEL COVSANDWICH(AGGREGATE); CLASS Risk ; MODEL (START,END)*DEFAULT(0) = Risk P1GDP UNEMPLOYMENT; ID CUSTOMER_ID; HAZARDRATIO Risk / DIFF=REF; HAZARDRATIO P1GDP / UNITS = 1 2 3 5; HAZARDRATIO UNEMPLOYMENT / UNITS = 1 2 3 5; RUN;

PD_Band Risk 1 to 5 1 6 to 11 5 12 to 16 09 17 to 18 12 19 to 20 15

slide-35
SLIDE 35

35

SAS Results

For each 1 unit increase in the GDP, the Hazard of Default goes down by an estimated 16.7 %.

0.18257

e 0.833

 100*(0.833 1) 16.7%   

slide-36
SLIDE 36

36

SAS Results

For each 1 unit increase in the Unemployment, the Hazard of Default increases by an estimated 25.5 %.

0.22684

e 1.255  100*(1.255 1) 25.5%  

Risk

slide-37
SLIDE 37

37

SAS Results A customer in the Band 01 has a ONLY 8.7%

the risk of Default (or - 91.3%) compared to a customer in the Band 15 (the reference Band).

2.44279

e 0.087

 100*(0.087 1) 91.3%   

HAZARD RATIO (BAND 01) 0.087 HAZARD RATIO (BAND 15)   

slide-38
SLIDE 38

38

100*(0.087 1) 91.3%   

HAZARD RATIO (BAND 01) 0.087 HAZARD RATIO (BAND 15)   

HAZARDRATIO Risk / DIFF=REF;

SAS Results A customer in the Band 01 has a ONLY 8.7%

the risk of Default (or - 91.3%) compared to a customer in the Band 15 (the reference Band).

Risk

slide-39
SLIDE 39

39

SAS Results

Output 7

PROC PHREG DATA = MODEL COVSANDWICH(AGGREGATE); CLASS Risk (PARAM=REF REF='15') ; MODEL (START,END)*DEFAULT(0) = Risk P1GDP UNEMPLOYMENT; ID CUSTOMER_ID; HAZARDRATIO P1GDP / UNITS = 1 2 3 5; HAZARDRATIO UNEMPLOYMENT / UNITS = 1 2 3 5; RUN;

100*(0.694 1) 30.6%    100*(0.578 1) 42.2%    100*(0.401 1) 59.9%   

slide-40
SLIDE 40

40

SAS Results

Output 8

PROC PHREG DATA = MODEL COVSANDWICH(AGGREGATE); CLASS Risk (PARAM=REF REF='15'); MODEL (START,END)* DEFAULT(0) = Risk P1GDP UNEMPLOYMENT; ID CUSTOMER_ID; HAZARDRATIO P1GDP / UNITS = 1 2 3 5; HAZARDRATIO UNEMPLOYMENT / UNITS = 1 2 3 5; RUN;

100*(1.574 1) 57.4%   100*(1.975 1) 97.5%   100*(3.109 1) 210.9%  

slide-41
SLIDE 41

41

Survival Function

Scenario Analysis 1 P1GDP=1.1; Unemployment=6; Scenario Analysis 2 P1GDP=0.8; Unemployment=10;

slide-42
SLIDE 42

42

Forecast under Scenario

slide-43
SLIDE 43

43 43

Go Further

Introduction to Survival Analysis using PH Cox Models Applying Survival Analysis for Business

 

slide-44
SLIDE 44

44 44

Go Further

Survival Data Mining Programming Approach Survival Data Mining Using Enterprise Miner

 

slide-45
SLIDE 45

45 45

Go Further – Books

slide-46
SLIDE 46

46 46

Go Further – Books

slide-47
SLIDE 47

47

Questions

47

www.modellingtraining.com info@modellingtraining.com

  • SAS code
  • Results
  • PDF

Email: Web page: Tel: 01943 430241 07880 474564