1
Lecture 17: Survival Analysis
- - Cox proportional Hazards
Lecture 17: Survival Analysis -- Cox proportional Hazards Ani - - PowerPoint PPT Presentation
Lecture 17: Survival Analysis -- Cox proportional Hazards Ani Manichaikul amanicha@jhsph.edu 14 May 2007 1 Survival Analysis n Suppose we have designed a study to estimate survival after chemotherapy treatment for patients with a certain
1
2
n Suppose we have designed a study to
n Patients received chemotherapy
3
n In this study the event of interest is
n The time clock starts as soon as the
4
5
6
7
8
9
10
n Patient:
n 1:
n 2:
n 3:
n Patients two and three are called
11
n Estimation of the survival curve n S(t) = Proportion surviving at least to
12
n Life table method
n Grouped in intervals
n Kaplan-Meier (1958)
n Ungrouped data n Small samples
13
n Curve can be estimated at each event,
n y(t) = # events at time t n n(t) = # subjects at risk for event at
14
n Curve can be estimated at each event,
15
n Curve can be estimated at each event,
16
n Start estimate at first event time
n No Chemotherapy Group: Time = 5
17
n No Chemotherapy group: Time= 8
n 2nd event time
18
n Skip over censoring times: Remove
n Continue through final event time
19
20
21
n Graph is a step function n “Jumps” at each observed event time n Nothing is assumed about curved shape
22
23
24
n Variance of S(t) n Standard Error
≤
t t j j j j j
j
: 2
25
n Using Greenwood’s formula, and
n There is a “problem”: the 95%
GW t
26
n Complementary log-log transformation n Variance of CLL:
2 : :
log )] ( ˆ ( Var[ − − =
≤ ≤ t t j j j j j t t j j j j j
j j
y n n y y n n y t υ
27
n Use CLL to obtain 95% confidence
n Get 95% CI for :
CLL t
28
n Transform back to get 95% for S(t):
(t)
ν
( )
) ( SE * 96 . 1 ) ( CLL SE * 96 . 1 ) ( ˆ ) ( CLL SE * 96 . 1 ) ( ˆ
t CLL t t t t
± − +
υ υ
29
30
31
n VarGreenwood(13)] = 0.8182
n 95% CI Greenwood
+ 9 * 10 1 10 * 11 1
32
n n
2
CLL
33
n 95% CLL for S(13)
) 708 (. * 96 . 1
±
34
35
36
n The Kaplan-Meier estimate and log-rank
n But…we also want a simple summary
37
n The regression model for the hazard
n log hazard: n hazard:
p p X
X X t X t β β β λ λ + + + + = ... ) ( log ) ; ( log
2 2 1 1
β β β β
X X X X
p p
2 2 1 1
(Vector of X’s)
38
n 0(t): Hazard (incidence) rate as a
n
n exp{ 1} : the relative hazard associated
39
n exp{ 1} : the relative risk for X1+ 1 -vs-
n Other s have similar interpretations
40
n e: “multiplies” the baseline hazard
n This is therefore a “proportional
n the effect of any (fixed) X is the same at
41
n is the focus whereas 0(t) is a nuisance
n David Cox (1972) showed how to estimate
n “Semi-parametric”
n 0(t) is the baseline hazard
n “non-parametric part of the model
n are the regression coefficients
n “parametric” part of the model
42
n It uses the partial likelihood, not the
n We do not assume a particular
43
n Semi-parametric model for the hazard
n Where
n i(t) is the hazard for person i at week t n 0(t) is the hazard if Xi = 0 (not maintained
β
i
X i
44
n e-0.812 = 0.44: relative rate of AML
n 1/.44 = 2.25 relative rate of AML relapse
n 95% CI: [e-.812-1.96* .521, e-.812+ 1.96* .521]
n (1.22, 6.25)
45
n
n
n
n
n
46
47
n Model for the log hazard rate (incidence
n Model for the hazard rate
8 8 2 2 1
8 8 2 2 1 1
... 0 )
X X X
β β β
+ + +
48
49
n What is the relative risk of death for the
50
n e-1.0777 = .34 n 66% reduction in the risk of death for
n note the coding 2= CABG, 1= Medical
51
n CHFSCR: Controlling for type of treatment
n AGE: Controlling for type of treatment and
52
n HYPTEN: Controlling for type of treatment
n Anyone know why a history of hypertension
53
n What is the relative risk death for
54
n log hazard for (A) =
n log hazard for (B) =
55
n Difference in log hazards, (B) -vs- (A):
n Relative Risk, (B) -vs- (A):
n Is the assumption of “otherwise comparable risk
56
n How much higher is the risk of a 70
57
n The estimated difference in log hazards for
n A ten year difference in the age at initiation
58
n How would you determine whether the