Fitting smooth-in-time prognostic risk functions via logistic - - PowerPoint PPT Presentation

fitting smooth in time prognostic risk functions via
SMART_READER_LITE
LIVE PREVIEW

Fitting smooth-in-time prognostic risk functions via logistic - - PowerPoint PPT Presentation

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary Fitting smooth-in-time prognostic risk functions via logistic regression James A. Hanley 1 Olli S. Miettinen 1 1 Department of


slide-1
SLIDE 1

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

Fitting smooth-in-time prognostic risk functions via logistic regression

James A. Hanley1 Olli S. Miettinen1

1Department of Epidemiology, Biostatistics and Occupational Health,

McGill University

International Society for Clinical Biostatistics Prague, August 2009

slide-2
SLIDE 2

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

OUTLINE

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

slide-3
SLIDE 3

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY PROFILE-SPECIFIC RISK FUNCTIONS?

  • 5-year Cumulative Incidence or 5-year Risk, of stroke for

78 yr. white female with isolated hypertension (Systolic Pressure=180) if treat / do not treat hypertension ???

  • Most reports of RCTs are for “average" profile, and use

hazard/incidence ratios (HRs) rather than risk differences

  • For an individual patient,

HR = IDR = 0.65 not helpful.

  • Risk0−5 =

CI0−5 = 8.2% if Tx = 0 (don’t treat);

  • Risk0−5 =

CI0−5 = 5.2% if Tx = 1 (treat), more helpful

  • but need risks specific to the profile (unless profile is near

the centre of profiles included in trial).

slide-4
SLIDE 4

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY PROFILE-SPECIFIC RISK FUNCTIONS?

  • 5-year Cumulative Incidence or 5-year Risk, of stroke for

78 yr. white female with isolated hypertension (Systolic Pressure=180) if treat / do not treat hypertension ???

  • Most reports of RCTs are for “average" profile, and use

hazard/incidence ratios (HRs) rather than risk differences

  • For an individual patient,

HR = IDR = 0.65 not helpful.

  • Risk0−5 =

CI0−5 = 8.2% if Tx = 0 (don’t treat);

  • Risk0−5 =

CI0−5 = 5.2% if Tx = 1 (treat), more helpful

  • but need risks specific to the profile (unless profile is near

the centre of profiles included in trial).

slide-5
SLIDE 5

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY PROFILE-SPECIFIC RISK FUNCTIONS?

  • 5-year Cumulative Incidence or 5-year Risk, of stroke for

78 yr. white female with isolated hypertension (Systolic Pressure=180) if treat / do not treat hypertension ???

  • Most reports of RCTs are for “average" profile, and use

hazard/incidence ratios (HRs) rather than risk differences

  • For an individual patient,

HR = IDR = 0.65 not helpful.

  • Risk0−5 =

CI0−5 = 8.2% if Tx = 0 (don’t treat);

  • Risk0−5 =

CI0−5 = 5.2% if Tx = 1 (treat), more helpful

  • but need risks specific to the profile (unless profile is near

the centre of profiles included in trial).

slide-6
SLIDE 6

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY PROFILE-SPECIFIC RISK FUNCTIONS?

  • 5-year Cumulative Incidence or 5-year Risk, of stroke for

78 yr. white female with isolated hypertension (Systolic Pressure=180) if treat / do not treat hypertension ???

  • Most reports of RCTs are for “average" profile, and use

hazard/incidence ratios (HRs) rather than risk differences

  • For an individual patient,

HR = IDR = 0.65 not helpful.

  • Risk0−5 =

CI0−5 = 8.2% if Tx = 0 (don’t treat);

  • Risk0−5 =

CI0−5 = 5.2% if Tx = 1 (treat), more helpful

  • but need risks specific to the profile (unless profile is near

the centre of profiles included in trial).

slide-7
SLIDE 7

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY PROFILE-SPECIFIC RISK FUNCTIONS?

  • 5-year Cumulative Incidence or 5-year Risk, of stroke for

78 yr. white female with isolated hypertension (Systolic Pressure=180) if treat / do not treat hypertension ???

  • Most reports of RCTs are for “average" profile, and use

hazard/incidence ratios (HRs) rather than risk differences

  • For an individual patient,

HR = IDR = 0.65 not helpful.

  • Risk0−5 =

CI0−5 = 8.2% if Tx = 0 (don’t treat);

  • Risk0−5 =

CI0−5 = 5.2% if Tx = 1 (treat), more helpful

  • but need risks specific to the profile (unless profile is near

the centre of profiles included in trial).

slide-8
SLIDE 8

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY PROFILE-SPECIFIC RISK FUNCTIONS?

  • 5-year Cumulative Incidence or 5-year Risk, of stroke for

78 yr. white female with isolated hypertension (Systolic Pressure=180) if treat / do not treat hypertension ???

  • Most reports of RCTs are for “average" profile, and use

hazard/incidence ratios (HRs) rather than risk differences

  • For an individual patient,

HR = IDR = 0.65 not helpful.

  • Risk0−5 =

CI0−5 = 8.2% if Tx = 0 (don’t treat);

  • Risk0−5 =

CI0−5 = 5.2% if Tx = 1 (treat), more helpful

  • but need risks specific to the profile (unless profile is near

the centre of profiles included in trial).

slide-9
SLIDE 9

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

5-year Cumulative Incidence / Risk of Stroke:

1 2 3 4 5 5 10 15 Prospective time (years) Cumulative incidence (%) (a.0): 80 year old black male, SBP=180 (a.1) (b.1): 65 year old white female, SBP=160 (b.0) semi−parametric (Cox) proposed

<- High-risk, untreated <- High-risk, treated <- Low-risk, untreated <- Low-risk, treated

slide-10
SLIDE 10

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHAT WE WISHED TO DO

  • Model the hazard (h), or incidence density (ID), as a

smooth function of

  • set of prognostic indicators
  • choice of intervention
  • prospective time.
  • Estimate the parameters of this function.
  • Calculate profile-specific risk/cumulative incidence,

CIx(t) from this function:

  • CIx(t) = 1 − exp{−

Hx(t)} = 1 − exp{− t hx(u)du}.

slide-11
SLIDE 11

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHAT WE WISHED TO DO

  • Model the hazard (h), or incidence density (ID), as a

smooth function of

  • set of prognostic indicators
  • choice of intervention
  • prospective time.
  • Estimate the parameters of this function.
  • Calculate profile-specific risk/cumulative incidence,

CIx(t) from this function:

  • CIx(t) = 1 − exp{−

Hx(t)} = 1 − exp{− t hx(u)du}.

slide-12
SLIDE 12

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHAT WE WISHED TO DO

  • Model the hazard (h), or incidence density (ID), as a

smooth function of

  • set of prognostic indicators
  • choice of intervention
  • prospective time.
  • Estimate the parameters of this function.
  • Calculate profile-specific risk/cumulative incidence,

CIx(t) from this function:

  • CIx(t) = 1 − exp{−

Hx(t)} = 1 − exp{− t hx(u)du}.

slide-13
SLIDE 13

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHAT WE WISHED TO DO

  • Model the hazard (h), or incidence density (ID), as a

smooth function of

  • set of prognostic indicators
  • choice of intervention
  • prospective time.
  • Estimate the parameters of this function.
  • Calculate profile-specific risk/cumulative incidence,

CIx(t) from this function:

  • CIx(t) = 1 − exp{−

Hx(t)} = 1 − exp{− t hx(u)du}.

slide-14
SLIDE 14

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

SMOOTH-IN-TIME HAZARD FUNCTIONS

  • Hjort, 1992, International Statistical Review
  • Reid N. A Conversation with Sir David Cox.

1994, Statistical Science.

  • Royston and Parmar, 2002, Statistics in Medicine
slide-15
SLIDE 15

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

SMOOTH-IN-TIME HAZARD FUNCTIONS

  • Hjort, 1992, International Statistical Review
  • Reid N. A Conversation with Sir David Cox.

1994, Statistical Science.

  • Royston and Parmar, 2002, Statistics in Medicine
slide-16
SLIDE 16

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FORM

log{h(x, t)} = g(x, t, β) ⇐ ⇒ h(x, t) = eg(x,t,β)

  • x is a realization of the covariate vector X, representing

the patient profile P, and possible intervention I.

  • β : a vector of parameters with unknown values,
  • g() includes constant 1, variates for P, I and t;
  • g() can have product terms involving P, I, and t.
  • g() must be ‘linear’ in parameters, in ‘linear model’ sense.

————–

  • ‘proportional hazards’ if no product terms involving t & I
  • If t is represented by a linear term (so that ‘time to event’

∼ Gompertz), then CIp, i(t) has a closed smooth form.

  • If t is replaced by log t, then ‘time to event’ ∼ Weibull.
slide-17
SLIDE 17

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FORM

log{h(x, t)} = g(x, t, β) ⇐ ⇒ h(x, t) = eg(x,t,β)

  • x is a realization of the covariate vector X, representing

the patient profile P, and possible intervention I.

  • β : a vector of parameters with unknown values,
  • g() includes constant 1, variates for P, I and t;
  • g() can have product terms involving P, I, and t.
  • g() must be ‘linear’ in parameters, in ‘linear model’ sense.

————–

  • ‘proportional hazards’ if no product terms involving t & I
  • If t is represented by a linear term (so that ‘time to event’

∼ Gompertz), then CIp, i(t) has a closed smooth form.

  • If t is replaced by log t, then ‘time to event’ ∼ Weibull.
slide-18
SLIDE 18

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FORM

log{h(x, t)} = g(x, t, β) ⇐ ⇒ h(x, t) = eg(x,t,β)

  • x is a realization of the covariate vector X, representing

the patient profile P, and possible intervention I.

  • β : a vector of parameters with unknown values,
  • g() includes constant 1, variates for P, I and t;
  • g() can have product terms involving P, I, and t.
  • g() must be ‘linear’ in parameters, in ‘linear model’ sense.

————–

  • ‘proportional hazards’ if no product terms involving t & I
  • If t is represented by a linear term (so that ‘time to event’

∼ Gompertz), then CIp, i(t) has a closed smooth form.

  • If t is replaced by log t, then ‘time to event’ ∼ Weibull.
slide-19
SLIDE 19

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FORM

log{h(x, t)} = g(x, t, β) ⇐ ⇒ h(x, t) = eg(x,t,β)

  • x is a realization of the covariate vector X, representing

the patient profile P, and possible intervention I.

  • β : a vector of parameters with unknown values,
  • g() includes constant 1, variates for P, I and t;
  • g() can have product terms involving P, I, and t.
  • g() must be ‘linear’ in parameters, in ‘linear model’ sense.

————–

  • ‘proportional hazards’ if no product terms involving t & I
  • If t is represented by a linear term (so that ‘time to event’

∼ Gompertz), then CIp, i(t) has a closed smooth form.

  • If t is replaced by log t, then ‘time to event’ ∼ Weibull.
slide-20
SLIDE 20

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FORM

log{h(x, t)} = g(x, t, β) ⇐ ⇒ h(x, t) = eg(x,t,β)

  • x is a realization of the covariate vector X, representing

the patient profile P, and possible intervention I.

  • β : a vector of parameters with unknown values,
  • g() includes constant 1, variates for P, I and t;
  • g() can have product terms involving P, I, and t.
  • g() must be ‘linear’ in parameters, in ‘linear model’ sense.

————–

  • ‘proportional hazards’ if no product terms involving t & I
  • If t is represented by a linear term (so that ‘time to event’

∼ Gompertz), then CIp, i(t) has a closed smooth form.

  • If t is replaced by log t, then ‘time to event’ ∼ Weibull.
slide-21
SLIDE 21

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FORM

log{h(x, t)} = g(x, t, β) ⇐ ⇒ h(x, t) = eg(x,t,β)

  • x is a realization of the covariate vector X, representing

the patient profile P, and possible intervention I.

  • β : a vector of parameters with unknown values,
  • g() includes constant 1, variates for P, I and t;
  • g() can have product terms involving P, I, and t.
  • g() must be ‘linear’ in parameters, in ‘linear model’ sense.

————–

  • ‘proportional hazards’ if no product terms involving t & I
  • If t is represented by a linear term (so that ‘time to event’

∼ Gompertz), then CIp, i(t) has a closed smooth form.

  • If t is replaced by log t, then ‘time to event’ ∼ Weibull.
slide-22
SLIDE 22

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FORM

log{h(x, t)} = g(x, t, β) ⇐ ⇒ h(x, t) = eg(x,t,β)

  • x is a realization of the covariate vector X, representing

the patient profile P, and possible intervention I.

  • β : a vector of parameters with unknown values,
  • g() includes constant 1, variates for P, I and t;
  • g() can have product terms involving P, I, and t.
  • g() must be ‘linear’ in parameters, in ‘linear model’ sense.

————–

  • ‘proportional hazards’ if no product terms involving t & I
  • If t is represented by a linear term (so that ‘time to event’

∼ Gompertz), then CIp, i(t) has a closed smooth form.

  • If t is replaced by log t, then ‘time to event’ ∼ Weibull.
slide-23
SLIDE 23

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FORM

log{h(x, t)} = g(x, t, β) ⇐ ⇒ h(x, t) = eg(x,t,β)

  • x is a realization of the covariate vector X, representing

the patient profile P, and possible intervention I.

  • β : a vector of parameters with unknown values,
  • g() includes constant 1, variates for P, I and t;
  • g() can have product terms involving P, I, and t.
  • g() must be ‘linear’ in parameters, in ‘linear model’ sense.

————–

  • ‘proportional hazards’ if no product terms involving t & I
  • If t is represented by a linear term (so that ‘time to event’

∼ Gompertz), then CIp, i(t) has a closed smooth form.

  • If t is replaced by log t, then ‘time to event’ ∼ Weibull.
slide-24
SLIDE 24

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FORM

log{h(x, t)} = g(x, t, β) ⇐ ⇒ h(x, t) = eg(x,t,β)

  • x is a realization of the covariate vector X, representing

the patient profile P, and possible intervention I.

  • β : a vector of parameters with unknown values,
  • g() includes constant 1, variates for P, I and t;
  • g() can have product terms involving P, I, and t.
  • g() must be ‘linear’ in parameters, in ‘linear model’ sense.

————–

  • ‘proportional hazards’ if no product terms involving t & I
  • If t is represented by a linear term (so that ‘time to event’

∼ Gompertz), then CIp, i(t) has a closed smooth form.

  • If t is replaced by log t, then ‘time to event’ ∼ Weibull.
slide-25
SLIDE 25

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FORM

log{h(x, t)} = g(x, t, β) ⇐ ⇒ h(x, t) = eg(x,t,β)

  • x is a realization of the covariate vector X, representing

the patient profile P, and possible intervention I.

  • β : a vector of parameters with unknown values,
  • g() includes constant 1, variates for P, I and t;
  • g() can have product terms involving P, I, and t.
  • g() must be ‘linear’ in parameters, in ‘linear model’ sense.

————–

  • ‘proportional hazards’ if no product terms involving t & I
  • If t is represented by a linear term (so that ‘time to event’

∼ Gompertz), then CIp, i(t) has a closed smooth form.

  • If t is replaced by log t, then ‘time to event’ ∼ Weibull.
slide-26
SLIDE 26

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FITTING

  • Unable to find a ready-to-use ML procedure within the

common statistical packages.

  • Likelihood becomes quite involved even if no censored
  • bservations.
  • Albertsen & Hanley(’98); Efron(’88, ’02); Carstensen(’00):
  • divide ‘survival time’ of each subject into time-slices;
  • treat # of events in each ∼ Binomial / Poisson.
slide-27
SLIDE 27

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FITTING

  • Unable to find a ready-to-use ML procedure within the

common statistical packages.

  • Likelihood becomes quite involved even if no censored
  • bservations.
  • Albertsen & Hanley(’98); Efron(’88, ’02); Carstensen(’00):
  • divide ‘survival time’ of each subject into time-slices;
  • treat # of events in each ∼ Binomial / Poisson.
slide-28
SLIDE 28

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FITTING

  • Unable to find a ready-to-use ML procedure within the

common statistical packages.

  • Likelihood becomes quite involved even if no censored
  • bservations.
  • Albertsen & Hanley(’98); Efron(’88, ’02); Carstensen(’00):
  • divide ‘survival time’ of each subject into time-slices;
  • treat # of events in each ∼ Binomial / Poisson.
slide-29
SLIDE 29

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FULLY-PARAMETRIC MODEL: FITTING

  • Unable to find a ready-to-use ML procedure within the

common statistical packages.

  • Likelihood becomes quite involved even if no censored
  • bservations.
  • Albertsen & Hanley(’98); Efron(’88, ’02); Carstensen(’00):
  • divide ‘survival time’ of each subject into time-slices;
  • treat # of events in each ∼ Binomial / Poisson.
slide-30
SLIDE 30

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FITTING: OUR APPROACH

  • An extension of the method of Mantel (1973) to binary
  • utcomes that deals with time dimension.
  • Mantel’s problem:
  • (c =)165 ‘cases’ of Y = 1,
  • 4000 instances of Y = 0.
  • Associated regressor vector X for each of the 4165
  • A logistic model for Prob(Y = 1 | X)
  • A computer with limited capacity.
slide-31
SLIDE 31

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FITTING: OUR APPROACH

  • An extension of the method of Mantel (1973) to binary
  • utcomes that deals with time dimension.
  • Mantel’s problem:
  • (c =)165 ‘cases’ of Y = 1,
  • 4000 instances of Y = 0.
  • Associated regressor vector X for each of the 4165
  • A logistic model for Prob(Y = 1 | X)
  • A computer with limited capacity.
slide-32
SLIDE 32

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FITTING: OUR APPROACH

  • An extension of the method of Mantel (1973) to binary
  • utcomes that deals with time dimension.
  • Mantel’s problem:
  • (c =)165 ‘cases’ of Y = 1,
  • 4000 instances of Y = 0.
  • Associated regressor vector X for each of the 4165
  • A logistic model for Prob(Y = 1 | X)
  • A computer with limited capacity.
slide-33
SLIDE 33

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FITTING: OUR APPROACH

  • An extension of the method of Mantel (1973) to binary
  • utcomes that deals with time dimension.
  • Mantel’s problem:
  • (c =)165 ‘cases’ of Y = 1,
  • 4000 instances of Y = 0.
  • Associated regressor vector X for each of the 4165
  • A logistic model for Prob(Y = 1 | X)
  • A computer with limited capacity.
slide-34
SLIDE 34

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FITTING: OUR APPROACH

  • An extension of the method of Mantel (1973) to binary
  • utcomes that deals with time dimension.
  • Mantel’s problem:
  • (c =)165 ‘cases’ of Y = 1,
  • 4000 instances of Y = 0.
  • Associated regressor vector X for each of the 4165
  • A logistic model for Prob(Y = 1 | X)
  • A computer with limited capacity.
slide-35
SLIDE 35

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FITTING: OUR APPROACH

  • An extension of the method of Mantel (1973) to binary
  • utcomes that deals with time dimension.
  • Mantel’s problem:
  • (c =)165 ‘cases’ of Y = 1,
  • 4000 instances of Y = 0.
  • Associated regressor vector X for each of the 4165
  • A logistic model for Prob(Y = 1 | X)
  • A computer with limited capacity.
slide-36
SLIDE 36

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FITTING: OUR APPROACH

  • An extension of the method of Mantel (1973) to binary
  • utcomes that deals with time dimension.
  • Mantel’s problem:
  • (c =)165 ‘cases’ of Y = 1,
  • 4000 instances of Y = 0.
  • Associated regressor vector X for each of the 4165
  • A logistic model for Prob(Y = 1 | X)
  • A computer with limited capacity.
slide-37
SLIDE 37

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FITTING: OUR APPROACH

  • An extension of the method of Mantel (1973) to binary
  • utcomes that deals with time dimension.
  • Mantel’s problem:
  • (c =)165 ‘cases’ of Y = 1,
  • 4000 instances of Y = 0.
  • Associated regressor vector X for each of the 4165
  • A logistic model for Prob(Y = 1 | X)
  • A computer with limited capacity.
slide-38
SLIDE 38

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

MANTEL ’S SOLUTION

  • Form a reduced dataset containing...
  • All c instances (cases) of Y = 1
  • Random sample of the Y = 0 observations
  • Fit the same logistic model to this reduced dataset.

“Such sampling will tend to leave the dependence of the log odds on the variables unaffected except for an additive constant.” Anderson (Biometrika, 1972) had noted this too.

  • Outcome(Choice)-based sampling common in Epi, Marketing, etc...
slide-39
SLIDE 39

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

MANTEL ’S SOLUTION

  • Form a reduced dataset containing...
  • All c instances (cases) of Y = 1
  • Random sample of the Y = 0 observations
  • Fit the same logistic model to this reduced dataset.

“Such sampling will tend to leave the dependence of the log odds on the variables unaffected except for an additive constant.” Anderson (Biometrika, 1972) had noted this too.

  • Outcome(Choice)-based sampling common in Epi, Marketing, etc...
slide-40
SLIDE 40

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

MANTEL ’S SOLUTION

  • Form a reduced dataset containing...
  • All c instances (cases) of Y = 1
  • Random sample of the Y = 0 observations
  • Fit the same logistic model to this reduced dataset.

“Such sampling will tend to leave the dependence of the log odds on the variables unaffected except for an additive constant.” Anderson (Biometrika, 1972) had noted this too.

  • Outcome(Choice)-based sampling common in Epi, Marketing, etc...
slide-41
SLIDE 41

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

MANTEL ’S SOLUTION

  • Form a reduced dataset containing...
  • All c instances (cases) of Y = 1
  • Random sample of the Y = 0 observations
  • Fit the same logistic model to this reduced dataset.

“Such sampling will tend to leave the dependence of the log odds on the variables unaffected except for an additive constant.” Anderson (Biometrika, 1972) had noted this too.

  • Outcome(Choice)-based sampling common in Epi, Marketing, etc...
slide-42
SLIDE 42

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

MANTEL ’S SOLUTION

  • Form a reduced dataset containing...
  • All c instances (cases) of Y = 1
  • Random sample of the Y = 0 observations
  • Fit the same logistic model to this reduced dataset.

“Such sampling will tend to leave the dependence of the log odds on the variables unaffected except for an additive constant.” Anderson (Biometrika, 1972) had noted this too.

  • Outcome(Choice)-based sampling common in Epi, Marketing, etc...
slide-43
SLIDE 43

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

MANTEL ’S SOLUTION

  • Form a reduced dataset containing...
  • All c instances (cases) of Y = 1
  • Random sample of the Y = 0 observations
  • Fit the same logistic model to this reduced dataset.

“Such sampling will tend to leave the dependence of the log odds on the variables unaffected except for an additive constant.” Anderson (Biometrika, 1972) had noted this too.

  • Outcome(Choice)-based sampling common in Epi, Marketing, etc...
slide-44
SLIDE 44

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

MANTEL ’S SOLUTION

  • Form a reduced dataset containing...
  • All c instances (cases) of Y = 1
  • Random sample of the Y = 0 observations
  • Fit the same logistic model to this reduced dataset.

“Such sampling will tend to leave the dependence of the log odds on the variables unaffected except for an additive constant.” Anderson (Biometrika, 1972) had noted this too.

  • Outcome(Choice)-based sampling common in Epi, Marketing, etc...
slide-45
SLIDE 45

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DATA TO EXPLAIN OUR APPROACH

Systolic Hypertension in Elderly Program (SHEP)

.......................... SHEP Cooperative Research Group (1991). .......................... Journal of American Medical Association 265, 3255-3264.

  • 4,701 persons with complete data on P = {age, sex, race,

and systolic blood pressure} and I = {active/placebo}.

  • Study base of B = 20, 894 person-years of follow-up;

c = 263 events ("cases") of stroke identified.

slide-46
SLIDE 46

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DATA TO EXPLAIN OUR APPROACH

Systolic Hypertension in Elderly Program (SHEP)

.......................... SHEP Cooperative Research Group (1991). .......................... Journal of American Medical Association 265, 3255-3264.

  • 4,701 persons with complete data on P = {age, sex, race,

and systolic blood pressure} and I = {active/placebo}.

  • Study base of B = 20, 894 person-years of follow-up;

c = 263 events ("cases") of stroke identified.

slide-47
SLIDE 47

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DATA TO EXPLAIN OUR APPROACH

Systolic Hypertension in Elderly Program (SHEP)

.......................... SHEP Cooperative Research Group (1991). .......................... Journal of American Medical Association 265, 3255-3264.

  • 4,701 persons with complete data on P = {age, sex, race,

and systolic blood pressure} and I = {active/placebo}.

  • Study base of B = 20, 894 person-years of follow-up;

c = 263 events ("cases") of stroke identified.

slide-48
SLIDE 48

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DATA TO EXPLAIN OUR APPROACH

Systolic Hypertension in Elderly Program (SHEP)

.......................... SHEP Cooperative Research Group (1991). .......................... Journal of American Medical Association 265, 3255-3264.

  • 4,701 persons with complete data on P = {age, sex, race,

and systolic blood pressure} and I = {active/placebo}.

  • Study base of B = 20, 894 person-years of follow-up;

c = 263 events ("cases") of stroke identified.

slide-49
SLIDE 49

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DATA TO EXPLAIN OUR APPROACH

Systolic Hypertension in Elderly Program (SHEP)

.......................... SHEP Cooperative Research Group (1991). .......................... Journal of American Medical Association 265, 3255-3264.

  • 4,701 persons with complete data on P = {age, sex, race,

and systolic blood pressure} and I = {active/placebo}.

  • Study base of B = 20, 894 person-years of follow-up;

c = 263 events ("cases") of stroke identified.

slide-50
SLIDE 50

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

STUDY BASE, and the 263 cases

1 2 3 4 5 6 7 1000 2000 3000 4000 5000 6000

t: Years since Randomization Persons

  • No. of Persons

Being Followed

STUDY BASE

− 20,894 person−years [B=20,894 PY] − 10,982,000,000 person−minutes (approx) − infinite number of person−moments

c = 263 events (Y=1) in this infinite number

  • f person−moments
  • infinite number
  • f person−moments

with Y=0

slide-51
SLIDE 51

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

OUR APPROACH

  • Base series: representative (unstratified) sample of base.
  • b: size of base series
  • B: amount of population-time constituting study base.
  • B(x, t): population-time element in study base

Pr(Y = 1|x, t) Pr(Y = 0|x, t) = h(x, t) × B(x, t) b × [B(x, t)/B] = h(x, t) × (B/b),

  • log(B/b) is an offset [a regression term with known coefficient of 1].

→ logistic model, with t having same status as x, and offset, directly yields h(x, t) = IDx,t = exp{ g(x, t)}.

slide-52
SLIDE 52

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

How large should b be on relation to c?

b: no. of instances of Y = 0 ; c: no. of instances of Y = 1

  • Mantel (1973)...

little to be gained by letting the size of one series, b, become arbitrarily large if the size of the other series, c, must remain fixed.

  • With 2008 computing, we can use a b/c ratio as high as 100,

and thereby extract virtually all of the information in the base.

slide-53
SLIDE 53

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

How large should b be on relation to c?

b: no. of instances of Y = 0 ; c: no. of instances of Y = 1

  • Mantel (1973)...

little to be gained by letting the size of one series, b, become arbitrarily large if the size of the other series, c, must remain fixed.

  • With 2008 computing, we can use a b/c ratio as high as 100,

and thereby extract virtually all of the information in the base.

slide-54
SLIDE 54

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

How large should b be on relation to c?

b: no. of instances of Y = 0 ; c: no. of instances of Y = 1

  • Mantel (1973)...

little to be gained by letting the size of one series, b, become arbitrarily large if the size of the other series, c, must remain fixed.

  • With 2008 computing, we can use a b/c ratio as high as 100,

and thereby extract virtually all of the information in the base.

slide-55
SLIDE 55

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

How large should b be on relation to c?

b: no. of instances of Y = 0 ; c: no. of instances of Y = 1

  • Mantel (1973)...

little to be gained by letting the size of one series, b, become arbitrarily large if the size of the other series, c, must remain fixed.

  • With 2008 computing, we can use a b/c ratio as high as 100,

and thereby extract virtually all of the information in the base.

slide-56
SLIDE 56

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

OUR HAZARD MODEL FOR SHEP DATA

log[h] = ΣβkXk, where X1 = Age (in yrs) - 60 X2 = Indicator of male gender X3 = Indicator of Black race X4 = Systolic BP (in mmHg) - 140 ...................................................................... X5 = Indicator of active treatment ...................................................................... X6 = T ...................................................................... X7 = X5 × X6. (non-proportional hazards)

slide-57
SLIDE 57

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

OUR HAZARD MODEL FOR SHEP DATA

log[h] = ΣβkXk, where X1 = Age (in yrs) - 60 X2 = Indicator of male gender X3 = Indicator of Black race X4 = Systolic BP (in mmHg) - 140 ...................................................................... X5 = Indicator of active treatment ...................................................................... X6 = T ...................................................................... X7 = X5 × X6. (non-proportional hazards)

slide-58
SLIDE 58

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

PARAMETER ESTIMATION

  • Formed person-moments dataset pertaining to:
  • case series of size c = 263 (Y = 1)

and

  • (randomly-selected) base series of size b = 26, 300

(Y = 0).

  • Each of 26,563 rows contained realizations of
  • X1, . . . , X7
  • Y
  • offset = log(20, 894/26, 300).
  • Logistic model fitted to data in the two series.
slide-59
SLIDE 59

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

PARAMETER ESTIMATION

  • Formed person-moments dataset pertaining to:
  • case series of size c = 263 (Y = 1)

and

  • (randomly-selected) base series of size b = 26, 300

(Y = 0).

  • Each of 26,563 rows contained realizations of
  • X1, . . . , X7
  • Y
  • offset = log(20, 894/26, 300).
  • Logistic model fitted to data in the two series.
slide-60
SLIDE 60

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

PARAMETER ESTIMATION

  • Formed person-moments dataset pertaining to:
  • case series of size c = 263 (Y = 1)

and

  • (randomly-selected) base series of size b = 26, 300

(Y = 0).

  • Each of 26,563 rows contained realizations of
  • X1, . . . , X7
  • Y
  • offset = log(20, 894/26, 300).
  • Logistic model fitted to data in the two series.
slide-61
SLIDE 61

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

PARAMETER ESTIMATION

  • Formed person-moments dataset pertaining to:
  • case series of size c = 263 (Y = 1)

and

  • (randomly-selected) base series of size b = 26, 300

(Y = 0).

  • Each of 26,563 rows contained realizations of
  • X1, . . . , X7
  • Y
  • offset = log(20, 894/26, 300).
  • Logistic model fitted to data in the two series.
slide-62
SLIDE 62

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

PARAMETER ESTIMATION

  • Formed person-moments dataset pertaining to:
  • case series of size c = 263 (Y = 1)

and

  • (randomly-selected) base series of size b = 26, 300

(Y = 0).

  • Each of 26,563 rows contained realizations of
  • X1, . . . , X7
  • Y
  • offset = log(20, 894/26, 300).
  • Logistic model fitted to data in the two series.
slide-63
SLIDE 63

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

PARAMETER ESTIMATION

  • Formed person-moments dataset pertaining to:
  • case series of size c = 263 (Y = 1)

and

  • (randomly-selected) base series of size b = 26, 300

(Y = 0).

  • Each of 26,563 rows contained realizations of
  • X1, . . . , X7
  • Y
  • offset = log(20, 894/26, 300).
  • Logistic model fitted to data in the two series.
slide-64
SLIDE 64

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

PARAMETER ESTIMATION

  • Formed person-moments dataset pertaining to:
  • case series of size c = 263 (Y = 1)

and

  • (randomly-selected) base series of size b = 26, 300

(Y = 0).

  • Each of 26,563 rows contained realizations of
  • X1, . . . , X7
  • Y
  • offset = log(20, 894/26, 300).
  • Logistic model fitted to data in the two series.
slide-65
SLIDE 65

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

PARAMETER ESTIMATION

  • Formed person-moments dataset pertaining to:
  • case series of size c = 263 (Y = 1)

and

  • (randomly-selected) base series of size b = 26, 300

(Y = 0).

  • Each of 26,563 rows contained realizations of
  • X1, . . . , X7
  • Y
  • offset = log(20, 894/26, 300).
  • Logistic model fitted to data in the two series.
slide-66
SLIDE 66

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DATASET: c = 263; b = 10 × 263

  • 1

2 3 4 5 6 1000 2000 3000 4000

Time Persons

slide-67
SLIDE 67

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FITTED VALUES

Proposed Cox logistic regression regression βage−60 0.041 0.041 0.041 βImale 0.257 0.258 0.259 βIblack 0.302 0.301 0.303 βSBP−140 0.017 0.017 0.017 .................... βIActive treatment

  • 0.200
  • 0.435
  • 0.435

.................... β0

  • 5.390
  • 5.295

βt

  • 0.014
  • 0.057

βt×IActive treatment

  • 0.107
  • Fitted logistic function represents log[hx(t)]
  • → cumulative hazard HX(t), and, thus, X-specific risk.
slide-68
SLIDE 68

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FITTED VALUES

Proposed Cox logistic regression regression βage−60 0.041 0.041 0.041 βImale 0.257 0.258 0.259 βIblack 0.302 0.301 0.303 βSBP−140 0.017 0.017 0.017 .................... βIActive treatment

  • 0.200
  • 0.435
  • 0.435

.................... β0

  • 5.390
  • 5.295

βt

  • 0.014
  • 0.057

βt×IActive treatment

  • 0.107
  • Fitted logistic function represents log[hx(t)]
  • → cumulative hazard HX(t), and, thus, X-specific risk.
slide-69
SLIDE 69

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FITTED VALUES

Proposed Cox logistic regression regression βage−60 0.041 0.041 0.041 βImale 0.257 0.258 0.259 βIblack 0.302 0.301 0.303 βSBP−140 0.017 0.017 0.017 .................... βIActive treatment

  • 0.200
  • 0.435
  • 0.435

.................... β0

  • 5.390
  • 5.295

βt

  • 0.014
  • 0.057

βt×IActive treatment

  • 0.107
  • Fitted logistic function represents log[hx(t)]
  • → cumulative hazard HX(t), and, thus, X-specific risk.
slide-70
SLIDE 70

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FITTED VALUES

Proposed Cox logistic regression regression βage−60 0.041 0.041 0.041 βImale 0.257 0.258 0.259 βIblack 0.302 0.301 0.303 βSBP−140 0.017 0.017 0.017 .................... βIActive treatment

  • 0.200
  • 0.435
  • 0.435

.................... β0

  • 5.390
  • 5.295

βt

  • 0.014
  • 0.057

βt×IActive treatment

  • 0.107
  • Fitted logistic function represents log[hx(t)]
  • → cumulative hazard HX(t), and, thus, X-specific risk.
slide-71
SLIDE 71

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

ESTIMATED 5-YEAR RISK OF STROKE

Risk I h(t) H(5) CI(5) ∆ [ ID(t) ] [ 5

0 hx(t)dt ]

[ 1 − e−H(5) ] Low e−4.86−0.014t 0.037 0.036 1 e−5.06−0.124t 0.024 0.024 1.2% High 0.16 1 0.10 6% Overall 0.076 1 0.049 2.7% Low: 65 year old white female with a SBP of 160 mmHg. High: 80 year old black male with a SBP of 180 mmHg

slide-72
SLIDE 72

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

ESTIMATED 5-YEAR RISK OF STROKE

Risk I h(t) H(5) CI(5) ∆ [ ID(t) ] [ 5

0 hx(t)dt ]

[ 1 − e−H(5) ] Low e−4.86−0.014t 0.037 0.036 1 e−5.06−0.124t 0.024 0.024 1.2% High 0.16 1 0.10 6% Overall 0.076 1 0.049 2.7% Low: 65 year old white female with a SBP of 160 mmHg. High: 80 year old black male with a SBP of 180 mmHg

slide-73
SLIDE 73

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

ESTIMATED 5-YEAR RISK OF STROKE

Risk I h(t) H(5) CI(5) ∆ [ ID(t) ] [ 5

0 hx(t)dt ]

[ 1 − e−H(5) ] Low e−4.86−0.014t 0.037 0.036 1 e−5.06−0.124t 0.024 0.024 1.2% High 0.16 1 0.10 6% Overall 0.076 1 0.049 2.7% Low: 65 year old white female with a SBP of 160 mmHg. High: 80 year old black male with a SBP of 180 mmHg

slide-74
SLIDE 74

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

ESTIMATED 5-YEAR RISK OF STROKE

Risk I h(t) H(5) CI(5) ∆ [ ID(t) ] [ 5

0 hx(t)dt ]

[ 1 − e−H(5) ] Low e−4.86−0.014t 0.037 0.036 1 e−5.06−0.124t 0.024 0.024 1.2% High 0.16 1 0.10 6% Overall 0.076 1 0.049 2.7% Low: 65 year old white female with a SBP of 160 mmHg. High: 80 year old black male with a SBP of 180 mmHg

slide-75
SLIDE 75

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

ESTIMATED 5-YEAR RISK OF STROKE

Risk I h(t) H(5) CI(5) ∆ [ ID(t) ] [ 5

0 hx(t)dt ]

[ 1 − e−H(5) ] Low e−4.86−0.014t 0.037 0.036 1 e−5.06−0.124t 0.024 0.024 1.2% High 0.16 1 0.10 6% Overall 0.076 1 0.049 2.7% Low: 65 year old white female with a SBP of 160 mmHg. High: 80 year old black male with a SBP of 180 mmHg

slide-76
SLIDE 76

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

ESTIMATED 5-YEAR RISK OF STROKE

Risk I h(t) H(5) CI(5) ∆ [ ID(t) ] [ 5

0 hx(t)dt ]

[ 1 − e−H(5) ] Low e−4.86−0.014t 0.037 0.036 1 e−5.06−0.124t 0.024 0.024 1.2% High 0.16 1 0.10 6% Overall 0.076 1 0.049 2.7% Low: 65 year old white female with a SBP of 160 mmHg. High: 80 year old black male with a SBP of 180 mmHg

slide-77
SLIDE 77

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

1 2 3 4 5 5 10 15 Prospective time (years) Cumulative incidence (%) (a.0): 80 year old black male, SBP=180 (a.1) (b.1): 65 year old white female, SBP=160 (b.0) semi−parametric (Cox) proposed

slide-78
SLIDE 78

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

Points

1 2 3 4 5 6 7 8 9 10

Age

60 65 70 75 80 85 90 95 100

Male

1

Black

1

SBP

155 165 175 185 195 205 215

I

1

t

6

I.t

6 5 4 3 2 1

Total Points

2 4 6 8 10 12 14 16 18 20 22

Linear Predictor

−6 −5.5 −5 −4.5 −4 −3.5 −3 −2.5

5−year Risk (%) if not treated

3 4 5 6 7 8 9 12 15 18

5−year Risk (%) if treated

2 3 4 5 6 7 8 9 12 15 18

slide-79
SLIDE 79

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 1. FEATURES
  • Smooth-in-t h(t)—and CI’s– not new; fitting procedure is.
  • Keys: 1. representative sampling of the base; 2. offset.
  • b/c =100 feasible and adequate.
slide-80
SLIDE 80

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 1. FEATURES
  • Smooth-in-t h(t)—and CI’s– not new; fitting procedure is.
  • Keys: 1. representative sampling of the base; 2. offset.
  • b/c =100 feasible and adequate.
slide-81
SLIDE 81

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 1. FEATURES
  • Smooth-in-t h(t)—and CI’s– not new; fitting procedure is.
  • Keys: 1. representative sampling of the base; 2. offset.
  • b/c =100 feasible and adequate.
slide-82
SLIDE 82

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 1. FEATURES
  • Smooth-in-t h(t)—and CI’s– not new; fitting procedure is.
  • Keys: 1. representative sampling of the base; 2. offset.
  • b/c =100 feasible and adequate.
slide-83
SLIDE 83

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 2. MODELLING POSSIBILITIES

Log-linear modelling for hx(t) via logistic regression ...

  • Standard methods to assess model fit.
  • Wide range of functional forms for the t-dimension of hx(t).
  • Effortless handling of censored data.
  • Flexibility in modeling non-proportionality over t.
  • Splines for h(t) rather than hr(t).
slide-84
SLIDE 84

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 2. MODELLING POSSIBILITIES

Log-linear modelling for hx(t) via logistic regression ...

  • Standard methods to assess model fit.
  • Wide range of functional forms for the t-dimension of hx(t).
  • Effortless handling of censored data.
  • Flexibility in modeling non-proportionality over t.
  • Splines for h(t) rather than hr(t).
slide-85
SLIDE 85

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 2. MODELLING POSSIBILITIES

Log-linear modelling for hx(t) via logistic regression ...

  • Standard methods to assess model fit.
  • Wide range of functional forms for the t-dimension of hx(t).
  • Effortless handling of censored data.
  • Flexibility in modeling non-proportionality over t.
  • Splines for h(t) rather than hr(t).
slide-86
SLIDE 86

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 2. MODELLING POSSIBILITIES

Log-linear modelling for hx(t) via logistic regression ...

  • Standard methods to assess model fit.
  • Wide range of functional forms for the t-dimension of hx(t).
  • Effortless handling of censored data.
  • Flexibility in modeling non-proportionality over t.
  • Splines for h(t) rather than hr(t).
slide-87
SLIDE 87

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 2. MODELLING POSSIBILITIES

Log-linear modelling for hx(t) via logistic regression ...

  • Standard methods to assess model fit.
  • Wide range of functional forms for the t-dimension of hx(t).
  • Effortless handling of censored data.
  • Flexibility in modeling non-proportionality over t.
  • Splines for h(t) rather than hr(t).
slide-88
SLIDE 88

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 2. MODELLING POSSIBILITIES

Log-linear modelling for hx(t) via logistic regression ...

  • Standard methods to assess model fit.
  • Wide range of functional forms for the t-dimension of hx(t).
  • Effortless handling of censored data.
  • Flexibility in modeling non-proportionality over t.
  • Splines for h(t) rather than hr(t).
slide-89
SLIDE 89

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 2. MODELLING POSSIBILITIES

Log-linear modelling for hx(t) via logistic regression ...

  • Standard methods to assess model fit.
  • Wide range of functional forms for the t-dimension of hx(t).
  • Effortless handling of censored data.
  • Flexibility in modeling non-proportionality over t.
  • Splines for h(t) rather than hr(t).
slide-90
SLIDE 90

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 3. CLINICAL POSSIBILITIES / DESIDERATA
  • PDAs (personal digital assistants) → online information.
  • Profile-specific risk estimates for various interventions.
  • Already, online calculators: risk of MI, Breast/Lung Cancer;

probability of extra-organ spread of cancer.

  • RCT reports should contain: suitably designed risk

function, fitted parameters of hx(t), and risk function.

  • (Offline:) risk scores → risks via nomogram/table.
slide-91
SLIDE 91

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 3. CLINICAL POSSIBILITIES / DESIDERATA
  • PDAs (personal digital assistants) → online information.
  • Profile-specific risk estimates for various interventions.
  • Already, online calculators: risk of MI, Breast/Lung Cancer;

probability of extra-organ spread of cancer.

  • RCT reports should contain: suitably designed risk

function, fitted parameters of hx(t), and risk function.

  • (Offline:) risk scores → risks via nomogram/table.
slide-92
SLIDE 92

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 3. CLINICAL POSSIBILITIES / DESIDERATA
  • PDAs (personal digital assistants) → online information.
  • Profile-specific risk estimates for various interventions.
  • Already, online calculators: risk of MI, Breast/Lung Cancer;

probability of extra-organ spread of cancer.

  • RCT reports should contain: suitably designed risk

function, fitted parameters of hx(t), and risk function.

  • (Offline:) risk scores → risks via nomogram/table.
slide-93
SLIDE 93

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 3. CLINICAL POSSIBILITIES / DESIDERATA
  • PDAs (personal digital assistants) → online information.
  • Profile-specific risk estimates for various interventions.
  • Already, online calculators: risk of MI, Breast/Lung Cancer;

probability of extra-organ spread of cancer.

  • RCT reports should contain: suitably designed risk

function, fitted parameters of hx(t), and risk function.

  • (Offline:) risk scores → risks via nomogram/table.
slide-94
SLIDE 94

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 3. CLINICAL POSSIBILITIES / DESIDERATA
  • PDAs (personal digital assistants) → online information.
  • Profile-specific risk estimates for various interventions.
  • Already, online calculators: risk of MI, Breast/Lung Cancer;

probability of extra-organ spread of cancer.

  • RCT reports should contain: suitably designed risk

function, fitted parameters of hx(t), and risk function.

  • (Offline:) risk scores → risks via nomogram/table.
slide-95
SLIDE 95

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 3. CLINICAL POSSIBILITIES / DESIDERATA
  • PDAs (personal digital assistants) → online information.
  • Profile-specific risk estimates for various interventions.
  • Already, online calculators: risk of MI, Breast/Lung Cancer;

probability of extra-organ spread of cancer.

  • RCT reports should contain: suitably designed risk

function, fitted parameters of hx(t), and risk function.

  • (Offline:) risk scores → risks via nomogram/table.
slide-96
SLIDE 96

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 4. SUMMARY
  • Profile-specific risk (CI) functions are important.
  • Two paths to CI, via...
  • Steps-in-time S0(t)
  • Smooth-in-time IDx(t).
  • New simple estimation method for broad class of

smooth-in-time ID / hazard functions.

  • Biostatistics & Epidemiology methods: a little more unified?
slide-97
SLIDE 97

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 4. SUMMARY
  • Profile-specific risk (CI) functions are important.
  • Two paths to CI, via...
  • Steps-in-time S0(t)
  • Smooth-in-time IDx(t).
  • New simple estimation method for broad class of

smooth-in-time ID / hazard functions.

  • Biostatistics & Epidemiology methods: a little more unified?
slide-98
SLIDE 98

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 4. SUMMARY
  • Profile-specific risk (CI) functions are important.
  • Two paths to CI, via...
  • Steps-in-time S0(t)
  • Smooth-in-time IDx(t).
  • New simple estimation method for broad class of

smooth-in-time ID / hazard functions.

  • Biostatistics & Epidemiology methods: a little more unified?
slide-99
SLIDE 99

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 4. SUMMARY
  • Profile-specific risk (CI) functions are important.
  • Two paths to CI, via...
  • Steps-in-time S0(t)
  • Smooth-in-time IDx(t).
  • New simple estimation method for broad class of

smooth-in-time ID / hazard functions.

  • Biostatistics & Epidemiology methods: a little more unified?
slide-100
SLIDE 100

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 4. SUMMARY
  • Profile-specific risk (CI) functions are important.
  • Two paths to CI, via...
  • Steps-in-time S0(t)
  • Smooth-in-time IDx(t).
  • New simple estimation method for broad class of

smooth-in-time ID / hazard functions.

  • Biostatistics & Epidemiology methods: a little more unified?
slide-101
SLIDE 101

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 4. SUMMARY
  • Profile-specific risk (CI) functions are important.
  • Two paths to CI, via...
  • Steps-in-time S0(t)
  • Smooth-in-time IDx(t).
  • New simple estimation method for broad class of

smooth-in-time ID / hazard functions.

  • Biostatistics & Epidemiology methods: a little more unified?
slide-102
SLIDE 102

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • 4. SUMMARY
  • Profile-specific risk (CI) functions are important.
  • Two paths to CI, via...
  • Steps-in-time S0(t)
  • Smooth-in-time IDx(t).
  • New simple estimation method for broad class of

smooth-in-time ID / hazard functions.

  • Biostatistics & Epidemiology methods: a little more unified?
slide-103
SLIDE 103

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FUNDING / CO-ORDINATES / SOFTWARE

Natural Sciences and Engineering Research Council of Canada James.Hanley@McGill.CA http://www.biostat.mcgill.ca/hanley

http:/ p: /ww ww www.m w.m w.m mcgill.ca/ ca/ a epi epi epi epi-bi biost

  • st

s at- at- a occh/g /g grad/bi b ostatisti t cs/

slide-104
SLIDE 104

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THE ETIOLOGIC STUDY IN EPIDEMIOLOGY

  • Aggregate of population-time: ‘study base.’
  • All instances of event in study base identified → study’s

‘case series’ of person-moments, characterized by Y = 1.

  • Study base – infinite number of person-moments – sampled

→ corresponding ‘base series,’ characterized by Y = 0.

  • Document potentially etiologic antecedent, modifiers of

incidence-density ratio, & confounders.

  • Fit Logistic model

.............................................................................................

  • With our approach . . .
  • → Incidence density, hx(u) in study base.
  • → CIx(t) = 1 − exp{−Hx(t)} = 1 − exp{−

t

0 hx(u)du}.

slide-105
SLIDE 105

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THE ETIOLOGIC STUDY IN EPIDEMIOLOGY

  • Aggregate of population-time: ‘study base.’
  • All instances of event in study base identified → study’s

‘case series’ of person-moments, characterized by Y = 1.

  • Study base – infinite number of person-moments – sampled

→ corresponding ‘base series,’ characterized by Y = 0.

  • Document potentially etiologic antecedent, modifiers of

incidence-density ratio, & confounders.

  • Fit Logistic model

.............................................................................................

  • With our approach . . .
  • → Incidence density, hx(u) in study base.
  • → CIx(t) = 1 − exp{−Hx(t)} = 1 − exp{−

t

0 hx(u)du}.

slide-106
SLIDE 106

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THE ETIOLOGIC STUDY IN EPIDEMIOLOGY

  • Aggregate of population-time: ‘study base.’
  • All instances of event in study base identified → study’s

‘case series’ of person-moments, characterized by Y = 1.

  • Study base – infinite number of person-moments – sampled

→ corresponding ‘base series,’ characterized by Y = 0.

  • Document potentially etiologic antecedent, modifiers of

incidence-density ratio, & confounders.

  • Fit Logistic model

.............................................................................................

  • With our approach . . .
  • → Incidence density, hx(u) in study base.
  • → CIx(t) = 1 − exp{−Hx(t)} = 1 − exp{−

t

0 hx(u)du}.

slide-107
SLIDE 107

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THE ETIOLOGIC STUDY IN EPIDEMIOLOGY

  • Aggregate of population-time: ‘study base.’
  • All instances of event in study base identified → study’s

‘case series’ of person-moments, characterized by Y = 1.

  • Study base – infinite number of person-moments – sampled

→ corresponding ‘base series,’ characterized by Y = 0.

  • Document potentially etiologic antecedent, modifiers of

incidence-density ratio, & confounders.

  • Fit Logistic model

.............................................................................................

  • With our approach . . .
  • → Incidence density, hx(u) in study base.
  • → CIx(t) = 1 − exp{−Hx(t)} = 1 − exp{−

t

0 hx(u)du}.

slide-108
SLIDE 108

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THE ETIOLOGIC STUDY IN EPIDEMIOLOGY

  • Aggregate of population-time: ‘study base.’
  • All instances of event in study base identified → study’s

‘case series’ of person-moments, characterized by Y = 1.

  • Study base – infinite number of person-moments – sampled

→ corresponding ‘base series,’ characterized by Y = 0.

  • Document potentially etiologic antecedent, modifiers of

incidence-density ratio, & confounders.

  • Fit Logistic model

.............................................................................................

  • With our approach . . .
  • → Incidence density, hx(u) in study base.
  • → CIx(t) = 1 − exp{−Hx(t)} = 1 − exp{−

t

0 hx(u)du}.

slide-109
SLIDE 109

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THE ETIOLOGIC STUDY IN EPIDEMIOLOGY

  • Aggregate of population-time: ‘study base.’
  • All instances of event in study base identified → study’s

‘case series’ of person-moments, characterized by Y = 1.

  • Study base – infinite number of person-moments – sampled

→ corresponding ‘base series,’ characterized by Y = 0.

  • Document potentially etiologic antecedent, modifiers of

incidence-density ratio, & confounders.

  • Fit Logistic model

.............................................................................................

  • With our approach . . .
  • → Incidence density, hx(u) in study base.
  • → CIx(t) = 1 − exp{−Hx(t)} = 1 − exp{−

t

0 hx(u)du}.

slide-110
SLIDE 110

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THE ETIOLOGIC STUDY IN EPIDEMIOLOGY

  • Aggregate of population-time: ‘study base.’
  • All instances of event in study base identified → study’s

‘case series’ of person-moments, characterized by Y = 1.

  • Study base – infinite number of person-moments – sampled

→ corresponding ‘base series,’ characterized by Y = 0.

  • Document potentially etiologic antecedent, modifiers of

incidence-density ratio, & confounders.

  • Fit Logistic model

.............................................................................................

  • With our approach . . .
  • → Incidence density, hx(u) in study base.
  • → CIx(t) = 1 − exp{−Hx(t)} = 1 − exp{−

t

0 hx(u)du}.

slide-111
SLIDE 111

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THE ETIOLOGIC STUDY IN EPIDEMIOLOGY

  • Aggregate of population-time: ‘study base.’
  • All instances of event in study base identified → study’s

‘case series’ of person-moments, characterized by Y = 1.

  • Study base – infinite number of person-moments – sampled

→ corresponding ‘base series,’ characterized by Y = 0.

  • Document potentially etiologic antecedent, modifiers of

incidence-density ratio, & confounders.

  • Fit Logistic model

.............................................................................................

  • With our approach . . .
  • → Incidence density, hx(u) in study base.
  • → CIx(t) = 1 − exp{−Hx(t)} = 1 − exp{−

t

0 hx(u)du}.

slide-112
SLIDE 112

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

  • ● ●
  • ● ● ●
  • ● ●
  • ● ● ●
  • 5

10 15 20 25 −6.0 −5.6 −5.2 −4.8

intercept sample

  • ● ● ●
  • ● ●
  • ● ● ●
  • ● ● ● ● ● ●

5 10 15 20 25 0.02 0.04 0.06

age sample

  • ● ●
  • ● ● ● ●
  • 5

10 15 20 25 0.0 0.2 0.4

male sample

  • ● ● ● ●
  • ● ●
  • ● ● ● ● ● ● ● ● ● ● ● ●

5 10 15 20 25 0.0 0.2 0.4 0.6

black sample

  • ● ● ● ●
  • ● ●
  • ● ● ●
  • ● ● ● ● ● ● ●
  • 5

10 15 20 25 0.005 0.015 0.025

sbp sample

  • ● ● ● ●
  • ● ● ●
  • ● ●
  • ● ● ● ●

5 10 15 20 25 −0.8 −0.4 0.0 0.4

tx sample

  • ● ●
  • ● ●
  • ● ● ●
  • ● ● ● ● ●
  • ● ●

5 10 15 20 25 −0.15 −0.05 0.05

t sample

  • ● ● ● ● ●
  • ● ● ●
  • ● ●
  • ● ●
  • ● ●

5 10 15 20 25 −0.3 −0.1 0.0 0.1

tx*t sample

  • ● ●
  • ● ● ● ●
  • ● ● ● ● ● ● ● ● ● ● ● ●
  • 5

10 15 20 25 0.10 0.15 0.20

5 year Risk sample

STABILITY ?

Point and (95% confidence) interval estimates of hazard function, and of 5-year risk for a specific (untreated) high-risk

  • profile. Fits are based
  • n 25 different random

samples of b =26,300 from the infinite number of person-moments in the study base, and same c = 263 cases each run.

slide-113
SLIDE 113

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DATASET FOR LOGISTIC REGRESSION

(SCHEMATIC)

1 69 1 0 166 1 0.57 1 69 0 1 161 0 1.79 1 85 0 1 184 0 3.39 0 69 0 0 182 0 1.70 0 73 0 1 167 1 2.02 0 73 1 0 199 0 0.62 0 81 1 0 161 0 1.16 0 70 0 1 185 0 1.11 0 72 0 0 172 1 3.56 Y Age B M SBP I t

1000 2000 3000 4000 5000 1 2 3 4 5 6 Prognostic time (years) Persons

slide-114
SLIDE 114

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DATA ANALYZED BY EFRON, 1988

Arm A [ time-to-recurrence of head & neck cancer ]

  • Cum. Inc. estimates – K-M, Efron & Proposed

10 20 30 40 50 Months

Efron Proposed

Cumulative Incidence Incidence Density

0.05 0.1 0.15 0.2 0.4 0.6 0.8 1

  • Inc. density estimates – Efron & Proposed

Arm A vs. Arm B

.

10 20 30 40 50 Months Cumulative Incidence Incidence Density

0.05 0.1 0.15 0.2 0.4 0.6 0.8 1

A B A: Radiation Alone B: Radiation + Chemotherapy A B

.

slide-115
SLIDE 115

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY THIS CULTURE?

Predominant use of the semi-parametric ‘Cox model.’

  • Time is considered as a non-essential element.
  • Primary focus is on hazard ratios.
  • Form of hazard per se as function of time left unspecified.
  • Attention deflected from estimates of profile-specific CI.
  • Many unaware that software provides profile-specific CI.
slide-116
SLIDE 116

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY THIS CULTURE?

Predominant use of the semi-parametric ‘Cox model.’

  • Time is considered as a non-essential element.
  • Primary focus is on hazard ratios.
  • Form of hazard per se as function of time left unspecified.
  • Attention deflected from estimates of profile-specific CI.
  • Many unaware that software provides profile-specific CI.
slide-117
SLIDE 117

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY THIS CULTURE?

Predominant use of the semi-parametric ‘Cox model.’

  • Time is considered as a non-essential element.
  • Primary focus is on hazard ratios.
  • Form of hazard per se as function of time left unspecified.
  • Attention deflected from estimates of profile-specific CI.
  • Many unaware that software provides profile-specific CI.
slide-118
SLIDE 118

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY THIS CULTURE?

Predominant use of the semi-parametric ‘Cox model.’

  • Time is considered as a non-essential element.
  • Primary focus is on hazard ratios.
  • Form of hazard per se as function of time left unspecified.
  • Attention deflected from estimates of profile-specific CI.
  • Many unaware that software provides profile-specific CI.
slide-119
SLIDE 119

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY THIS CULTURE?

Predominant use of the semi-parametric ‘Cox model.’

  • Time is considered as a non-essential element.
  • Primary focus is on hazard ratios.
  • Form of hazard per se as function of time left unspecified.
  • Attention deflected from estimates of profile-specific CI.
  • Many unaware that software provides profile-specific CI.
slide-120
SLIDE 120

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY THIS CULTURE?

Predominant use of the semi-parametric ‘Cox model.’

  • Time is considered as a non-essential element.
  • Primary focus is on hazard ratios.
  • Form of hazard per se as function of time left unspecified.
  • Attention deflected from estimates of profile-specific CI.
  • Many unaware that software provides profile-specific CI.
slide-121
SLIDE 121

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY THIS CULTURE?

Predominant use of the semi-parametric ‘Cox model.’

  • Time is considered as a non-essential element.
  • Primary focus is on hazard ratios.
  • Form of hazard per se as function of time left unspecified.
  • Attention deflected from estimates of profile-specific CI.
  • Many unaware that software provides profile-specific CI.
slide-122
SLIDE 122

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DIFFERENT CULTURE

Practice of reporting estimates of profile-specific probability more common when no variable element of time of outcome.

  • Estimates can be based on logistic regression.
  • Examples
  • (“Framingham-based”) estimated 6-year risk for Myocardial

Infarction as function of set of prognostic indicators;

  • estimated probability that prostate cancer is
  • rgan-confined, as a function of diagnostic indicators.
slide-123
SLIDE 123

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DIFFERENT CULTURE

Practice of reporting estimates of profile-specific probability more common when no variable element of time of outcome.

  • Estimates can be based on logistic regression.
  • Examples
  • (“Framingham-based”) estimated 6-year risk for Myocardial

Infarction as function of set of prognostic indicators;

  • estimated probability that prostate cancer is
  • rgan-confined, as a function of diagnostic indicators.
slide-124
SLIDE 124

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DIFFERENT CULTURE

Practice of reporting estimates of profile-specific probability more common when no variable element of time of outcome.

  • Estimates can be based on logistic regression.
  • Examples
  • (“Framingham-based”) estimated 6-year risk for Myocardial

Infarction as function of set of prognostic indicators;

  • estimated probability that prostate cancer is
  • rgan-confined, as a function of diagnostic indicators.
slide-125
SLIDE 125

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DIFFERENT CULTURE

Practice of reporting estimates of profile-specific probability more common when no variable element of time of outcome.

  • Estimates can be based on logistic regression.
  • Examples
  • (“Framingham-based”) estimated 6-year risk for Myocardial

Infarction as function of set of prognostic indicators;

  • estimated probability that prostate cancer is
  • rgan-confined, as a function of diagnostic indicators.
slide-126
SLIDE 126

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DIFFERENT CULTURE

Practice of reporting estimates of profile-specific probability more common when no variable element of time of outcome.

  • Estimates can be based on logistic regression.
  • Examples
  • (“Framingham-based”) estimated 6-year risk for Myocardial

Infarction as function of set of prognostic indicators;

  • estimated probability that prostate cancer is
  • rgan-confined, as a function of diagnostic indicators.
slide-127
SLIDE 127

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

DIFFERENT CULTURE

Practice of reporting estimates of profile-specific probability more common when no variable element of time of outcome.

  • Estimates can be based on logistic regression.
  • Examples
  • (“Framingham-based”) estimated 6-year risk for Myocardial

Infarction as function of set of prognostic indicators;

  • estimated probability that prostate cancer is
  • rgan-confined, as a function of diagnostic indicators.
slide-128
SLIDE 128

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

COX MODEL

Hazard modelled, semi-parametrically, as hx(t) = [exp(βx)]λ0(t),

  • T = t: a point in prognostic time,
  • β : vector of parameters with unknown values;
  • X = x : vector of realizations for variates based on

prognostic indicators and interventions;

  • λ0(t) : hazard as a function – unspecified – of t

corresponding to x = 0.

slide-129
SLIDE 129

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

COX MODEL

Hazard modelled, semi-parametrically, as hx(t) = [exp(βx)]λ0(t),

  • T = t: a point in prognostic time,
  • β : vector of parameters with unknown values;
  • X = x : vector of realizations for variates based on

prognostic indicators and interventions;

  • λ0(t) : hazard as a function – unspecified – of t

corresponding to x = 0.

slide-130
SLIDE 130

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

COX MODEL

Hazard modelled, semi-parametrically, as hx(t) = [exp(βx)]λ0(t),

  • T = t: a point in prognostic time,
  • β : vector of parameters with unknown values;
  • X = x : vector of realizations for variates based on

prognostic indicators and interventions;

  • λ0(t) : hazard as a function – unspecified – of t

corresponding to x = 0.

slide-131
SLIDE 131

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

COX MODEL

Hazard modelled, semi-parametrically, as hx(t) = [exp(βx)]λ0(t),

  • T = t: a point in prognostic time,
  • β : vector of parameters with unknown values;
  • X = x : vector of realizations for variates based on

prognostic indicators and interventions;

  • λ0(t) : hazard as a function – unspecified – of t

corresponding to x = 0.

slide-132
SLIDE 132

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

COX MODEL

Hazard modelled, semi-parametrically, as hx(t) = [exp(βx)]λ0(t),

  • T = t: a point in prognostic time,
  • β : vector of parameters with unknown values;
  • X = x : vector of realizations for variates based on

prognostic indicators and interventions;

  • λ0(t) : hazard as a function – unspecified – of t

corresponding to x = 0.

slide-133
SLIDE 133

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

COX MODEL

Hazard modelled, semi-parametrically, as hx(t) = [exp(βx)]λ0(t),

  • T = t: a point in prognostic time,
  • β : vector of parameters with unknown values;
  • X = x : vector of realizations for variates based on

prognostic indicators and interventions;

  • λ0(t) : hazard as a function – unspecified – of t

corresponding to x = 0.

slide-134
SLIDE 134

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FROM ˆ β TO PROFILE-SPECIFIC CI’s

  • Obtain

S0(t) { the complement of CI0(t) }.

  • Estimate risk (cum. incidence) CIx(t) for a particular

determinant pattern X = x as CIx(t) = 1 − S0(t)

exp( ˆ βx)

.

  • Breslow suggested an estimator of λ0(t) that gives a

smooth estimate of CIx(t). However, step function estimators of Sx(t), with as many steps as there are distinct failure times in the dataset, are more easily derived, and the only ones available in most packages.

  • Step-function S0(t) estimators: “Kaplan-Meier” type

(“Breslow”) and Nelson-Aalen. heuristics: jh, Epidemiology 2008

  • Clinical Trials article (Julien & Hanley, 2008) encourages

investigators to make more use of these for ‘profiling’.

slide-135
SLIDE 135

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FROM ˆ β TO PROFILE-SPECIFIC CI’s

  • Obtain

S0(t) { the complement of CI0(t) }.

  • Estimate risk (cum. incidence) CIx(t) for a particular

determinant pattern X = x as CIx(t) = 1 − S0(t)

exp( ˆ βx)

.

  • Breslow suggested an estimator of λ0(t) that gives a

smooth estimate of CIx(t). However, step function estimators of Sx(t), with as many steps as there are distinct failure times in the dataset, are more easily derived, and the only ones available in most packages.

  • Step-function S0(t) estimators: “Kaplan-Meier” type

(“Breslow”) and Nelson-Aalen. heuristics: jh, Epidemiology 2008

  • Clinical Trials article (Julien & Hanley, 2008) encourages

investigators to make more use of these for ‘profiling’.

slide-136
SLIDE 136

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FROM ˆ β TO PROFILE-SPECIFIC CI’s

  • Obtain

S0(t) { the complement of CI0(t) }.

  • Estimate risk (cum. incidence) CIx(t) for a particular

determinant pattern X = x as CIx(t) = 1 − S0(t)

exp( ˆ βx)

.

  • Breslow suggested an estimator of λ0(t) that gives a

smooth estimate of CIx(t). However, step function estimators of Sx(t), with as many steps as there are distinct failure times in the dataset, are more easily derived, and the only ones available in most packages.

  • Step-function S0(t) estimators: “Kaplan-Meier” type

(“Breslow”) and Nelson-Aalen. heuristics: jh, Epidemiology 2008

  • Clinical Trials article (Julien & Hanley, 2008) encourages

investigators to make more use of these for ‘profiling’.

slide-137
SLIDE 137

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FROM ˆ β TO PROFILE-SPECIFIC CI’s

  • Obtain

S0(t) { the complement of CI0(t) }.

  • Estimate risk (cum. incidence) CIx(t) for a particular

determinant pattern X = x as CIx(t) = 1 − S0(t)

exp( ˆ βx)

.

  • Breslow suggested an estimator of λ0(t) that gives a

smooth estimate of CIx(t). However, step function estimators of Sx(t), with as many steps as there are distinct failure times in the dataset, are more easily derived, and the only ones available in most packages.

  • Step-function S0(t) estimators: “Kaplan-Meier” type

(“Breslow”) and Nelson-Aalen. heuristics: jh, Epidemiology 2008

  • Clinical Trials article (Julien & Hanley, 2008) encourages

investigators to make more use of these for ‘profiling’.

slide-138
SLIDE 138

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FROM ˆ β TO PROFILE-SPECIFIC CI’s

  • Obtain

S0(t) { the complement of CI0(t) }.

  • Estimate risk (cum. incidence) CIx(t) for a particular

determinant pattern X = x as CIx(t) = 1 − S0(t)

exp( ˆ βx)

.

  • Breslow suggested an estimator of λ0(t) that gives a

smooth estimate of CIx(t). However, step function estimators of Sx(t), with as many steps as there are distinct failure times in the dataset, are more easily derived, and the only ones available in most packages.

  • Step-function S0(t) estimators: “Kaplan-Meier” type

(“Breslow”) and Nelson-Aalen. heuristics: jh, Epidemiology 2008

  • Clinical Trials article (Julien & Hanley, 2008) encourages

investigators to make more use of these for ‘profiling’.

slide-139
SLIDE 139

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FROM ˆ β TO PROFILE-SPECIFIC CI’s

  • Obtain

S0(t) { the complement of CI0(t) }.

  • Estimate risk (cum. incidence) CIx(t) for a particular

determinant pattern X = x as CIx(t) = 1 − S0(t)

exp( ˆ βx)

.

  • Breslow suggested an estimator of λ0(t) that gives a

smooth estimate of CIx(t). However, step function estimators of Sx(t), with as many steps as there are distinct failure times in the dataset, are more easily derived, and the only ones available in most packages.

  • Step-function S0(t) estimators: “Kaplan-Meier” type

(“Breslow”) and Nelson-Aalen. heuristics: jh, Epidemiology 2008

  • Clinical Trials article (Julien & Hanley, 2008) encourages

investigators to make more use of these for ‘profiling’.

slide-140
SLIDE 140

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

FROM ˆ β TO PROFILE-SPECIFIC CI’s

  • Obtain

S0(t) { the complement of CI0(t) }.

  • Estimate risk (cum. incidence) CIx(t) for a particular

determinant pattern X = x as CIx(t) = 1 − S0(t)

exp( ˆ βx)

.

  • Breslow suggested an estimator of λ0(t) that gives a

smooth estimate of CIx(t). However, step function estimators of Sx(t), with as many steps as there are distinct failure times in the dataset, are more easily derived, and the only ones available in most packages.

  • Step-function S0(t) estimators: “Kaplan-Meier” type

(“Breslow”) and Nelson-Aalen. heuristics: jh, Epidemiology 2008

  • Clinical Trials article (Julien & Hanley, 2008) encourages

investigators to make more use of these for ‘profiling’.

slide-141
SLIDE 141

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THESE ARE NOT ISOLATED CASES

Survey of RCT’s : Jan - June 2006 : NEJM, JAMA, The Lancet:

  • Survival statistics from clinical trials – and non-randomised

studies – limited to the “average” patient

  • Cox regression used merely to ensure ‘fairer comparisons’
  • Seldom used to provide profile-specific estimates of

survival and survival differences

  • Despite range of risk profiles in each study, and common

use of Cox regression, none presented info. that would allow reader to assess Tx-specific risk for a specific profile, e.g., for a specific age-sex combination.

slide-142
SLIDE 142

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THESE ARE NOT ISOLATED CASES

Survey of RCT’s : Jan - June 2006 : NEJM, JAMA, The Lancet:

  • Survival statistics from clinical trials – and non-randomised

studies – limited to the “average” patient

  • Cox regression used merely to ensure ‘fairer comparisons’
  • Seldom used to provide profile-specific estimates of

survival and survival differences

  • Despite range of risk profiles in each study, and common

use of Cox regression, none presented info. that would allow reader to assess Tx-specific risk for a specific profile, e.g., for a specific age-sex combination.

slide-143
SLIDE 143

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THESE ARE NOT ISOLATED CASES

Survey of RCT’s : Jan - June 2006 : NEJM, JAMA, The Lancet:

  • Survival statistics from clinical trials – and non-randomised

studies – limited to the “average” patient

  • Cox regression used merely to ensure ‘fairer comparisons’
  • Seldom used to provide profile-specific estimates of

survival and survival differences

  • Despite range of risk profiles in each study, and common

use of Cox regression, none presented info. that would allow reader to assess Tx-specific risk for a specific profile, e.g., for a specific age-sex combination.

slide-144
SLIDE 144

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THESE ARE NOT ISOLATED CASES

Survey of RCT’s : Jan - June 2006 : NEJM, JAMA, The Lancet:

  • Survival statistics from clinical trials – and non-randomised

studies – limited to the “average” patient

  • Cox regression used merely to ensure ‘fairer comparisons’
  • Seldom used to provide profile-specific estimates of

survival and survival differences

  • Despite range of risk profiles in each study, and common

use of Cox regression, none presented info. that would allow reader to assess Tx-specific risk for a specific profile, e.g., for a specific age-sex combination.

slide-145
SLIDE 145

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THESE ARE NOT ISOLATED CASES

Survey of RCT’s : Jan - June 2006 : NEJM, JAMA, The Lancet:

  • Survival statistics from clinical trials – and non-randomised

studies – limited to the “average” patient

  • Cox regression used merely to ensure ‘fairer comparisons’
  • Seldom used to provide profile-specific estimates of

survival and survival differences

  • Despite range of risk profiles in each study, and common

use of Cox regression, none presented info. that would allow reader to assess Tx-specific risk for a specific profile, e.g., for a specific age-sex combination.

slide-146
SLIDE 146

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THESE ARE NOT ISOLATED CASES

Survey of RCT’s : Jan - June 2006 : NEJM, JAMA, The Lancet:

  • Survival statistics from clinical trials – and non-randomised

studies – limited to the “average” patient

  • Cox regression used merely to ensure ‘fairer comparisons’
  • Seldom used to provide profile-specific estimates of

survival and survival differences

  • Despite range of risk profiles in each study, and common

use of Cox regression, none presented info. that would allow reader to assess Tx-specific risk for a specific profile, e.g., for a specific age-sex combination.

slide-147
SLIDE 147

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THESE ARE NOT ISOLATED CASES

Survey of RCT’s : Jan - June 2006 : NEJM, JAMA, The Lancet:

  • Survival statistics from clinical trials – and non-randomised

studies – limited to the “average” patient

  • Cox regression used merely to ensure ‘fairer comparisons’
  • Seldom used to provide profile-specific estimates of

survival and survival differences

  • Despite range of risk profiles in each study, and common

use of Cox regression, none presented info. that would allow reader to assess Tx-specific risk for a specific profile, e.g., for a specific age-sex combination.

slide-148
SLIDE 148

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THESE ARE NOT ISOLATED CASES

Survey of RCT’s : Jan - June 2006 : NEJM, JAMA, The Lancet:

  • Survival statistics from clinical trials – and non-randomised

studies – limited to the “average” patient

  • Cox regression used merely to ensure ‘fairer comparisons’
  • Seldom used to provide profile-specific estimates of

survival and survival differences

  • Despite range of risk profiles in each study, and common

use of Cox regression, none presented info. that would allow reader to assess Tx-specific risk for a specific profile, e.g., for a specific age-sex combination.

slide-149
SLIDE 149

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

THESE ARE NOT ISOLATED CASES

Survey of RCT’s : Jan - June 2006 : NEJM, JAMA, The Lancet:

  • Survival statistics from clinical trials – and non-randomised

studies – limited to the “average” patient

  • Cox regression used merely to ensure ‘fairer comparisons’
  • Seldom used to provide profile-specific estimates of

survival and survival differences

  • Despite range of risk profiles in each study, and common

use of Cox regression, none presented info. that would allow reader to assess Tx-specific risk for a specific profile, e.g., for a specific age-sex combination.

slide-150
SLIDE 150

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY STUDY PROFILE-SPECIFIC RISK FUNCTIONS

  • I. Prob[surv. benefit] if man, aged 58, PSA 9.1, ‘Gleason 7’

prostate cancer, selects radical over conservative Tx?

  • Cannot turn info. into surv. ∆ for men with pt’s profile.
  • II. Report of classic RCT: ?? 5-year risk of stroke for a

65-year old white woman with a SBP of 160 mmHg and how much it is lowered if she were to take anti-hypertensive drug treatment.

  • Report did not provide information from which to estimate

the risk, and risk difference, for this specific profile.

slide-151
SLIDE 151

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY STUDY PROFILE-SPECIFIC RISK FUNCTIONS

  • I. Prob[surv. benefit] if man, aged 58, PSA 9.1, ‘Gleason 7’

prostate cancer, selects radical over conservative Tx?

  • Cannot turn info. into surv. ∆ for men with pt’s profile.
  • II. Report of classic RCT: ?? 5-year risk of stroke for a

65-year old white woman with a SBP of 160 mmHg and how much it is lowered if she were to take anti-hypertensive drug treatment.

  • Report did not provide information from which to estimate

the risk, and risk difference, for this specific profile.

slide-152
SLIDE 152

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY STUDY PROFILE-SPECIFIC RISK FUNCTIONS

  • I. Prob[surv. benefit] if man, aged 58, PSA 9.1, ‘Gleason 7’

prostate cancer, selects radical over conservative Tx?

  • Cannot turn info. into surv. ∆ for men with pt’s profile.
  • II. Report of classic RCT: ?? 5-year risk of stroke for a

65-year old white woman with a SBP of 160 mmHg and how much it is lowered if she were to take anti-hypertensive drug treatment.

  • Report did not provide information from which to estimate

the risk, and risk difference, for this specific profile.

slide-153
SLIDE 153

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY STUDY PROFILE-SPECIFIC RISK FUNCTIONS

  • I. Prob[surv. benefit] if man, aged 58, PSA 9.1, ‘Gleason 7’

prostate cancer, selects radical over conservative Tx?

  • Cannot turn info. into surv. ∆ for men with pt’s profile.
  • II. Report of classic RCT: ?? 5-year risk of stroke for a

65-year old white woman with a SBP of 160 mmHg and how much it is lowered if she were to take anti-hypertensive drug treatment.

  • Report did not provide information from which to estimate

the risk, and risk difference, for this specific profile.

slide-154
SLIDE 154

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

WHY STUDY PROFILE-SPECIFIC RISK FUNCTIONS

  • I. Prob[surv. benefit] if man, aged 58, PSA 9.1, ‘Gleason 7’

prostate cancer, selects radical over conservative Tx?

  • Cannot turn info. into surv. ∆ for men with pt’s profile.
  • II. Report of classic RCT: ?? 5-year risk of stroke for a

65-year old white woman with a SBP of 160 mmHg and how much it is lowered if she were to take anti-hypertensive drug treatment.

  • Report did not provide information from which to estimate

the risk, and risk difference, for this specific profile.

slide-155
SLIDE 155

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

STATISTICS AND THE AVERAGE PATIENT

  • For a patient,

HR = IDR = 0.6 not very helpful.

  • Cumulative Incidence:
  • CI0−10 = 15% if Tx = 0; 10% if Tx = 1, more helpful.
  • Not specific to this particular type of patient, if grade &

stage {of Pr Ca} or age/race/sex/SPB {SHEP Study} not near the typical of those in trial.

  • EXAMPLES I. and II. ARE NOT ISOLATED /MADE-UP

...

  • cf. Julien & Hanley ’07
slide-156
SLIDE 156

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

STATISTICS AND THE AVERAGE PATIENT

  • For a patient,

HR = IDR = 0.6 not very helpful.

  • Cumulative Incidence:
  • CI0−10 = 15% if Tx = 0; 10% if Tx = 1, more helpful.
  • Not specific to this particular type of patient, if grade &

stage {of Pr Ca} or age/race/sex/SPB {SHEP Study} not near the typical of those in trial.

  • EXAMPLES I. and II. ARE NOT ISOLATED /MADE-UP

...

  • cf. Julien & Hanley ’07
slide-157
SLIDE 157

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

STATISTICS AND THE AVERAGE PATIENT

  • For a patient,

HR = IDR = 0.6 not very helpful.

  • Cumulative Incidence:
  • CI0−10 = 15% if Tx = 0; 10% if Tx = 1, more helpful.
  • Not specific to this particular type of patient, if grade &

stage {of Pr Ca} or age/race/sex/SPB {SHEP Study} not near the typical of those in trial.

  • EXAMPLES I. and II. ARE NOT ISOLATED /MADE-UP

...

  • cf. Julien & Hanley ’07
slide-158
SLIDE 158

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

STATISTICS AND THE AVERAGE PATIENT

  • For a patient,

HR = IDR = 0.6 not very helpful.

  • Cumulative Incidence:
  • CI0−10 = 15% if Tx = 0; 10% if Tx = 1, more helpful.
  • Not specific to this particular type of patient, if grade &

stage {of Pr Ca} or age/race/sex/SPB {SHEP Study} not near the typical of those in trial.

  • EXAMPLES I. and II. ARE NOT ISOLATED /MADE-UP

...

  • cf. Julien & Hanley ’07
slide-159
SLIDE 159

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

STATISTICS AND THE AVERAGE PATIENT

  • For a patient,

HR = IDR = 0.6 not very helpful.

  • Cumulative Incidence:
  • CI0−10 = 15% if Tx = 0; 10% if Tx = 1, more helpful.
  • Not specific to this particular type of patient, if grade &

stage {of Pr Ca} or age/race/sex/SPB {SHEP Study} not near the typical of those in trial.

  • EXAMPLES I. and II. ARE NOT ISOLATED /MADE-UP

...

  • cf. Julien & Hanley ’07
slide-160
SLIDE 160

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

How large should b be on relation to c?

Mantel (1973)... [our notation, and slight change of wording]

By the reasoning that cb/(c + b) [= (1/c + 1/b)−1] measures the relative information in a comparison of two averages based on sample sizes of c and b respectively, we might expect by analogy, which would of course not be exact in the present case, that this approach would result in only a moderate loss of information. (The practicing statistician is generally aware of this kind of thing. There is little to be gained by letting the size of one series, b, become arbitrarily large if the size of the other series, c, must remain fixed.)

  • With 2008 computing, we can use a b/c ratio as high as 100.
  • b/c = 100 → Var[ˆ

β]b/c=100 = 1.01 × Var[ˆ β]b/c=∞, i.e. 1% ↑

  • Var[ˆ

β] ∝ 1/c + 1/100c rather than 1/c + 1/∞.

slide-161
SLIDE 161

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

How large should b be on relation to c?

Mantel (1973)... [our notation, and slight change of wording]

By the reasoning that cb/(c + b) [= (1/c + 1/b)−1] measures the relative information in a comparison of two averages based on sample sizes of c and b respectively, we might expect by analogy, which would of course not be exact in the present case, that this approach would result in only a moderate loss of information. (The practicing statistician is generally aware of this kind of thing. There is little to be gained by letting the size of one series, b, become arbitrarily large if the size of the other series, c, must remain fixed.)

  • With 2008 computing, we can use a b/c ratio as high as 100.
  • b/c = 100 → Var[ˆ

β]b/c=100 = 1.01 × Var[ˆ β]b/c=∞, i.e. 1% ↑

  • Var[ˆ

β] ∝ 1/c + 1/100c rather than 1/c + 1/∞.

slide-162
SLIDE 162

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

How large should b be on relation to c?

Mantel (1973)... [our notation, and slight change of wording]

By the reasoning that cb/(c + b) [= (1/c + 1/b)−1] measures the relative information in a comparison of two averages based on sample sizes of c and b respectively, we might expect by analogy, which would of course not be exact in the present case, that this approach would result in only a moderate loss of information. (The practicing statistician is generally aware of this kind of thing. There is little to be gained by letting the size of one series, b, become arbitrarily large if the size of the other series, c, must remain fixed.)

  • With 2008 computing, we can use a b/c ratio as high as 100.
  • b/c = 100 → Var[ˆ

β]b/c=100 = 1.01 × Var[ˆ β]b/c=∞, i.e. 1% ↑

  • Var[ˆ

β] ∝ 1/c + 1/100c rather than 1/c + 1/∞.

slide-163
SLIDE 163

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

How large should b be on relation to c?

Mantel (1973)... [our notation, and slight change of wording]

By the reasoning that cb/(c + b) [= (1/c + 1/b)−1] measures the relative information in a comparison of two averages based on sample sizes of c and b respectively, we might expect by analogy, which would of course not be exact in the present case, that this approach would result in only a moderate loss of information. (The practicing statistician is generally aware of this kind of thing. There is little to be gained by letting the size of one series, b, become arbitrarily large if the size of the other series, c, must remain fixed.)

  • With 2008 computing, we can use a b/c ratio as high as 100.
  • b/c = 100 → Var[ˆ

β]b/c=100 = 1.01 × Var[ˆ β]b/c=∞, i.e. 1% ↑

  • Var[ˆ

β] ∝ 1/c + 1/100c rather than 1/c + 1/∞.

slide-164
SLIDE 164

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

How large should b be on relation to c?

Mantel (1973)... [our notation, and slight change of wording]

By the reasoning that cb/(c + b) [= (1/c + 1/b)−1] measures the relative information in a comparison of two averages based on sample sizes of c and b respectively, we might expect by analogy, which would of course not be exact in the present case, that this approach would result in only a moderate loss of information. (The practicing statistician is generally aware of this kind of thing. There is little to be gained by letting the size of one series, b, become arbitrarily large if the size of the other series, c, must remain fixed.)

  • With 2008 computing, we can use a b/c ratio as high as 100.
  • b/c = 100 → Var[ˆ

β]b/c=100 = 1.01 × Var[ˆ β]b/c=∞, i.e. 1% ↑

  • Var[ˆ

β] ∝ 1/c + 1/100c rather than 1/c + 1/∞.

slide-165
SLIDE 165

Introduction Smooth-in-time hazard functions How we fit fully-parametric hazard model Illustration Comments/Summary

How large should b be on relation to c?

Mantel (1973)... [our notation, and slight change of wording]

By the reasoning that cb/(c + b) [= (1/c + 1/b)−1] measures the relative information in a comparison of two averages based on sample sizes of c and b respectively, we might expect by analogy, which would of course not be exact in the present case, that this approach would result in only a moderate loss of information. (The practicing statistician is generally aware of this kind of thing. There is little to be gained by letting the size of one series, b, become arbitrarily large if the size of the other series, c, must remain fixed.)

  • With 2008 computing, we can use a b/c ratio as high as 100.
  • b/c = 100 → Var[ˆ

β]b/c=100 = 1.01 × Var[ˆ β]b/c=∞, i.e. 1% ↑

  • Var[ˆ

β] ∝ 1/c + 1/100c rather than 1/c + 1/∞.