Lecture 17: Survival Analysis -- Cox proportional Hazards Ani - - PowerPoint PPT Presentation

lecture 17 survival analysis cox proportional hazards
SMART_READER_LITE
LIVE PREVIEW

Lecture 17: Survival Analysis -- Cox proportional Hazards Ani - - PowerPoint PPT Presentation

Lecture 17: Survival Analysis -- Cox proportional Hazards Ani Manichaikul amanicha@jhsph.edu 14 May 2007 1 Survival Analysis n Suppose we have designed a study to estimate survival after chemotherapy treatment for patients with a certain


slide-1
SLIDE 1

1

Lecture 17: Survival Analysis

  • - Cox proportional Hazards

Ani Manichaikul amanicha@jhsph.edu 14 May 2007

slide-2
SLIDE 2

2

Survival Analysis

n Suppose we have designed a study to

estimate survival after chemotherapy treatment for patients with a certain cancer

n Patients received chemotherapy

between 1990 and 1994 and were followed until death or the year 2000, whichever occurred first

slide-3
SLIDE 3

3

Survival Analysis

n In this study the event of interest is

death

n The time clock starts as soon as the

subject finishes his/her chemotherapy treatments

slide-4
SLIDE 4

4

Dies 1990 1995 2000

Survival Analysis

slide-5
SLIDE 5

5

Dies 1990 1995 2000

Patient one enters in 1990, dies in 1995: Patient one survives five years

Survival Analysis

slide-6
SLIDE 6

6

1990 1995 2000

Lost to Follow-up

Survival Analysis

slide-7
SLIDE 7

7

1990 1995 2000

Patient two enters in 1991, drops out in 1997: Patient two is lost to follow-up after six years

Survival Analysis

Lost to Follow-up

slide-8
SLIDE 8

8

1990 1995 2000

Withdrawn Alive (Administratively Censored)

Survival Analysis

slide-9
SLIDE 9

9

1990 1995 2000

Patient three enters in 1993, is still alive at end of study: Patient three is still alive after seven years

Survival Analysis

Withdrawn Alive

slide-10
SLIDE 10

10

Survival Analysis

n Patient:

n 1:

1990

→ 1995 5 years

n 2:

1991

→ 1997 6+ years

n 3:

1993

→ 2000 7+ years

n Patients two and three are called

censored observations

slide-11
SLIDE 11

11

Central Problem

n Estimation of the survival curve n S(t) = Proportion surviving at least to

time t or beyond

slide-12
SLIDE 12

12

Approaches

n Life table method

n Grouped in intervals

n Kaplan-Meier (1958)

n Ungrouped data n Small samples

slide-13
SLIDE 13

13

) _ _ (Pr ) ( ) ( ) ( ) ( Time Event evious t n t y t n t S S ×         − =

Kaplan-Meier Estimate

n Curve can be estimated at each event,

but not at censoring times

n y(t) = # events at time t n n(t) = # subjects at risk for event at

time t

slide-14
SLIDE 14

14

) _ _ (Pr ) ( ) ( ) ( ) ( Time Event evious t n t y t n t S S ×         − =

Proportion of original sample making it to time t

Kaplan-Meier Estimate

n Curve can be estimated at each event,

but not at censoring times

slide-15
SLIDE 15

15

Proportion surviving to time t who survive beyond time t

Kaplan-Meier Estimate

n Curve can be estimated at each event,

but not at censoring times

) _ _ (Pr ) ( ) ( ) ( ) ( Time Event evious t n t y t n t S S ×         − =

slide-16
SLIDE 16

16

n Start estimate at first event time

n No Chemotherapy Group: Time = 5

833 . 12 10 12 2 12 ) 5 ( ) 5 ( ) 5 ( ) 5 ( = = − =         − = n y n S

Kaplan-Meier Estimate

slide-17
SLIDE 17

17

n No Chemotherapy group: Time= 8

n 2nd event time

666 . 833 . 10 8 ) 833 (. 10 2 10 ) 5 ( ) 8 ( ) 8 ( ) 8 ( ) 8 ( = × = ×       − = ×         − = S n y n S

Kaplan-Meier Estimate

slide-18
SLIDE 18

18

Kaplan-Meier Estimate

n Skip over censoring times: Remove

from number at risk for next event time

n Continue through final event time

slide-19
SLIDE 19

19

slide-20
SLIDE 20

20

slide-21
SLIDE 21

21

Kaplan-Meier Estimate

n Graph is a step function n “Jumps” at each observed event time n Nothing is assumed about curved shape

between each observed event time

slide-22
SLIDE 22

22

Kaplan-Meier Estimate

slide-23
SLIDE 23

23

Confidence Interval for S(t)

Greenwood’s Formula Complementary log-log transformation

slide-24
SLIDE 24

24

Greenwood’s Formula

n Variance of S(t) n Standard Error

( )

− =

t t j j j j j

j

y n n y t S t S Var

: 2

) ( ˆ )] ( ˆ [

)] ( ˆ [ ) ( SEGW t S Var t =

slide-25
SLIDE 25

25

95% Confidence Interval

n Using Greenwood’s formula, and

approximate 95% CI for S(t) is

n There is a “problem”: the 95%

Confidence Interval is not constrained to lie within the interval (0,1)

) ( SE * 96 . 1 ) ( ˆ

GW t

t S ±

slide-26
SLIDE 26

26

Alternative Confidence Interval

n Complementary log-log transformation n Variance of CLL:

)] ( ˆ log log[ ) ( ˆ t S t − = υ

( ) ( )

2 : :

log )] ( ˆ ( Var[                 − − =

∑ ∑

≤ ≤ t t j j j j j t t j j j j j

j j

y n n y y n n y t υ

)] ( ˆ ( Var[ (t) SECLL t υ =

slide-27
SLIDE 27

27

95% CI based on complementary log-log transformation

n Use CLL to obtain 95% confidence

interval on S(t)

n Get 95% CI for :

) (t ν

) ( SE * 96 . 1 ) ( ˆ

CLL t

t ± υ

slide-28
SLIDE 28

28

n Transform back to get 95% for S(t):

Use the inverse transformation to get the 95% CI for S(t):

( )

(t)

  • e

e ) (

ν

= t S

( ) ( )

( )

) ( SE * 96 . 1 ) ( CLL SE * 96 . 1 ) ( ˆ ) ( CLL SE * 96 . 1 ) ( ˆ

)] ( ˆ [ ] e , e [

  • e
  • e

t CLL t t t t

e

t S

± − +

=

υ υ

slide-29
SLIDE 29

29

Back to the AML Data

slide-30
SLIDE 30

30

Kaplan-Meier Estimates

slide-31
SLIDE 31

31

95% CI: Greenwood

n VarGreenwood(13)] = 0.8182

= (0.116)2

n 95% CI Greenwood

= .818 1.96* (.116) = (.586, 1.05) 1.05 is out of Range!

      + 9 * 10 1 10 * 11 1

±

slide-32
SLIDE 32

32

Better 95% CI CLL transformation

n n

605 . 1 )] ( ˆ log log[ ) ( ˆ − = − = t S t υ

708 . ) 13 ( 502 . 04027 . 0202 . 10 9 log 11 10 log 90 1 110 1 )] 13 ( ˆ ( Var[

2

= = =       +       + =

CLL

SE υ

slide-33
SLIDE 33

33

Better 95% CI CLL transformation

n 95% CLL for S(13)

Does not contain 1!

[ ]

) 952 ,. 437 (. 818 .

) 708 (. * 96 . 1

= =

±

e

slide-34
SLIDE 34

34

95% CI for S(t) in the maintained

  • n chemotherapy group
slide-35
SLIDE 35

35

95% CI for S(t) in the not maintained on chemo group

slide-36
SLIDE 36

36

Regression in Survival Analysis

n The Kaplan-Meier estimate and log-rank

tests are great ways to compare survival between groups without making too many assumptions.

n But…we also want a simple summary

measure that compares groups Solution: Regression Analysis

slide-37
SLIDE 37

37

Regression in Survival Analysis

n The regression model for the hazard

function (instantaneous incidence rate) as a function of p explanatory X variables is specified as:

n log hazard: n hazard:

p p X

X X t X t β β β λ λ + + + + = ... ) ( log ) ; ( log

2 2 1 1

( )( ) (

)

( )

β β β β

λ λ λ

X X X X

e t e e e t X t

p p

) ( ... ) ( ) ; (

2 2 1 1

= =

(Vector of X’s)

slide-38
SLIDE 38

38

Interpretations

n 0(t): Hazard (incidence) rate as a

function of time when all X’s are zero

n

  • ften must center Xs to make 0(t)

interpretable

n exp{ 1} : the relative hazard associated

with a 1 unit change in X1 (i.e., X1+ 1 - vs- X1), holding other Xs constant, independent of time

slide-39
SLIDE 39

39

Interpretations: Relative Risk

n exp{ 1} : the relative risk for X1+ 1 -vs-

X1, holding other Xs constant, independent of time

n Other s have similar interpretations

slide-40
SLIDE 40

40

Interpretations

n e: “multiplies” the baseline hazard

0(t) by the same amount regardless of

the time t.

n This is therefore a “proportional

hazards” model

n the effect of any (fixed) X is the same at

any time during follow-up

slide-41
SLIDE 41

41

Note

n is the focus whereas 0(t) is a nuisance

variable

n David Cox (1972) showed how to estimate

without having to assume a model for 0(t)

n “Semi-parametric”

n 0(t) is the baseline hazard

n “non-parametric part of the model

n are the regression coefficients

n “parametric” part of the model

slide-42
SLIDE 42

42

Why Cox Proportional Hazards Model is Different?

n It uses the partial likelihood, not the

likelihood

n We do not assume a particular

distribution for the failure time; we only assume proportional hazards

slide-43
SLIDE 43

43

Results from AML Data

n Semi-parametric model for the hazard

(incidence) rate for the AML data

n Where

n i(t) is the hazard for person i at week t n 0(t) is the hazard if Xi = 0 (not maintained

group), and is the multiplicative effect of Xi= 1 (maintained group)

β

λ λ

i

X i

e t t ) ( ) ( =

slide-44
SLIDE 44

44

Results from AML Data:

n e-0.812 = 0.44: relative rate of AML

relapse maintained vs not maintained

n 1/.44 = 2.25 relative rate of AML relapse

not-maintained vs maintained

n 95% CI: [e-.812-1.96* .521, e-.812+ 1.96* .521]

n (1.22, 6.25)

slide-45
SLIDE 45

45

Example: CABG surgery

n

Cox model to compare two treatments, controlling for several predictors (Fisher and Van Belle, 1993)

n

Compare surgical (CABG) with medical treatment for left main coronary heart disease

n

Use mortality (time to death) as the response variable

n

Control for 7 risk factors (age at baseline and 6 coronary status measures) in making the comparison

n

Time variable is time from treatment initiation to death or censoring due to the end of the study or lost to follow-up

slide-46
SLIDE 46

46

Variables

slide-47
SLIDE 47

47

Cox PH: CABG Surgery

n Model for the log hazard rate (incidence

  • f death):

n Model for the hazard rate

8 8 2 2 1

... ) ( log ) ; ( log X X X t X t β β β λ λ + + + + =

( )

8 8 2 2 1 1

... 0 )

( ) ; (

X X X

e t X t

β β β

λ λ

+ + +

=

slide-48
SLIDE 48

48

Cox Model Results

slide-49
SLIDE 49

49

Question:

n What is the relative risk of death for the

CABG group compared to the medical group, adjusting for age and other risk factors?

slide-50
SLIDE 50

50

Answer (Interpretation):

n e-1.0777 = .34 n 66% reduction in the risk of death for

  • therwise comparable patients treated

with CABG compared with patients treated medically

n note the coding 2= CABG, 1= Medical

gives the same results as 1= CABG and 0= Medical

slide-51
SLIDE 51

51

What is the interpretation of each coefficient?

n CHFSCR: Controlling for type of treatment

and other risk factors, the risk of death, as estimated from a Cox model, is e.2985 = 1.35 times higher per unit difference in CHF score

n AGE: Controlling for type of treatment and

  • ther risk factors, the risk of death, as

estimated from a Cox model, is e.0423 = 1.04 times higher per year of age

slide-52
SLIDE 52

52

Interpretation

n HYPTEN: Controlling for type of treatment

and other risk factors, the risk of death, as estimated from a Cox model, is e-.5428 = 0.58 times lower for patients who have a history of hypertension compared with those who do not

n Anyone know why a history of hypertension

should lower risk following treatment?

slide-53
SLIDE 53

53

Question:

n What is the relative risk death for

(A) a medically treated 45-year old

  • vs-

(B) a surgically treated 75 year old who otherwise have comparable risk factors?

slide-54
SLIDE 54

54

Answer:

n log hazard for (A) =

const + 1* (-1.0777) + 45* (.0423) = const + .8258

n log hazard for (B) =

const + 2* (-1.0777) + 75* (.0423) = const + 1.017

slide-55
SLIDE 55

55

Interpretation:

n Difference in log hazards, (B) -vs- (A):

(const+ 1.017) - (const+ .8258) = .1913

n Relative Risk, (B) -vs- (A):

e.1913 = 1.21 higher risk for older, surgically treated patient than for younger, medically treated patient

n Is the assumption of “otherwise comparable risk

factors” reasonable?

slide-56
SLIDE 56

56

Question:

n How much higher is the risk of a 70

year old patient compared with a 60 year old patient, assuming treatment and other coronary risk factors are the same?

slide-57
SLIDE 57

57

Answer:

n The estimated difference in log hazards for

two patients whose ages differ by 10 years, holding other predictors fixed is 10×AGE= 10 ×.0423 = .423 RR = e.423 = 1.53

n A ten year difference in the age at initiation

  • f treatment increases the risk of subsequent

mortality by 50%

slide-58
SLIDE 58

58

Another Question:

n How would you determine whether the

mortality advantage of CABG over medical treatment was greater for younger patients than for older patients?