[PPT] - Shape Constrained Nonparametric Baseline Estimators in the Cox Model PowerPoint Presentation

SLIDE 1

Leiden University

The Netherlands

Leiden University

Shape Constrained Nonparametric Baseline Estimators in the Cox Model

Joint work with Rik Lopuha¨ a (TU Delft)

Tina Nane, Center for Science and Technology Studies.

SLIDE 2

Leiden University

Basic concepts in survival analysis

Events of interest - death, onset (relapse) of a disease, etc
Let X ∼ F denote the survival time, with density f
Functions that characterize the distribution of X
The survival function S(x) = P(X > x)
The hazard function

λ(x) = lim

∆x↓0

P(x ≤ X < x + ∆x|X ≥ x) ∆x = f (x) S(x)

The cumulative hazard function Λ(x) =

x

0 λ(u) du

Let C ∼ G denote the censoring time
Let Z denote the covariate (age, weight, treatment)

SLIDE 3

Leiden University

The Cox proportional hazards model

Right-censored data (Ti, ∆i, Zi), for i = 1, . . . , n
T = min(X, C) denotes the follow-up time
∆ = {X ≤ C} is the censoring indicator
The covariate vector Z ∈ Rp is time invariant
X|Z⊥C|Z
The Cox model

λ(x|z) = λ0(x)eβ′

0z,

where

λ0 is the underlying baseline hazard function
β0 ∈ Rp is the vector of the underlying regression coefficients

SLIDE 4

Leiden University

Assumptions

X ∼ F, C ∼ G, T ∼ H
F, G are assumed absolutely continuous.
(A.1) Let τF, τG and τH be the end points of the support of

F, G, H. Then τH = τG < τF

(A.2) There exists ε > 0 such that

sup

|β−β0|≤ε

❊

|Z|2 e2β′Z

< ∞, where | · | denotes the Euclidean norm

SLIDE 5

Leiden University

Estimating monotone baseline hazards in the Cox model

The NPMLE ˆ

λn of a nondecreasing baseline hazard

Let T(1) ≤ · · · ≤ T(n) denote the ordered follow-up times
For β fixed, maximize the (log)likelihood function over all

nondecreasing baseline hazards and obtain ˆ λn(x; β)

zero, for x < T(1)
constant on [T(i), T(i+1)), for i = 1, 2, . . . , n − 1
∞, for x ≥ T(n)
Replace β in ˆ

λn(x; β) by ˆ βn, the maximum partial likelihood estimator

We propose ˆ

λn(x) = ˆ λn(x; ˆ βn) as our estimator of λ0

SLIDE 6

Leiden University

Estimating monotone baseline hazards in the Cox model

Grenander-type estimator
1. Start from the Breslow estimator Λn of the

baseline cumulative hazard Λ0

100 200 300 400 500 600 700 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 survival times baseline cumulative hazards estimates

SLIDE 7

Leiden University

Estimating monotone baseline hazards in the Cox model

2. Take its Greatest Convex Minorant (GCM)

Λn

100 200 300 400 500 600 700 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 survival times baseline cumulative hazards estimates

SLIDE 8

Leiden University

Estimating monotone baseline hazards in the Cox model

3. The Grenander-type estimator ˜

λn is defined as the left-hand slope of Λn

200 400 600 0.000 0.005 0.010 0.015 0.020 survival times baseline hazards estimates

SLIDE 9

Leiden University

Estimating monotone baseline hazards in the Cox model

Another estimator of a nondecreasing baseline hazard was

proposed by Chung and Chang (1994)

Consistency: ˆ

λC

n (x) → λ0(x) a.s.

No limiting distribution available

SLIDE 10

Leiden University

Estimating monotone baseline hazards in the Cox model

Comparison between the three baseline hazard estimators

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.5 1.0 1.5 2.0 2.5 3.0 survival times hazard function estimates

1000 Weibull(3/2,1) observations

LS estimator CC estimator MLE estimator True hazard

SLIDE 11

Leiden University

Estimating monotone baseline densities in the Cox model

Grenander-type estimator of a monotone baseline density f0
Since

F0(x) = 1 − e−Λ0(x)

We propose

Fn(x) = 1 − e−Λn(x), where Λn is the Breslow estimator.

Define the nonincreasing Grenander-type estimator ˜

fn as the left derivative of the Least Concave Majorant (LCM) of Fn

SLIDE 12

Leiden University

Pointwise consistency

Theorem 1 (Lopuha¨

a & Nane, 2013a)

Assume that (A.1) and (A.2) hold and that λ0 is nondecreasing on [0, ∞) and f0 is nonincreasing on [0, ∞). Then, for any x0 ∈ (0, τH), λ0(x0−) ≤ lim inf

n→∞

ˆ λn(x0) ≤ lim sup

n→∞

ˆ λn(x0) ≤ λ0(x0+), λ0(x0−) ≤ lim inf

n→∞

˜ λn(x0) ≤ lim sup

n→∞

˜ λn(x0) ≤ λ0(x0+), f0(x0+) ≤ lim inf

n→∞

˜ fn(x0) ≤ lim sup

n→∞

˜ fn(x0) ≤ f0(x0−), with probability one. The values λ0(x0−), f0(x0−) and λ0(x0+), f0(x0+) denote the left (right) limit of the baseline hazard and density function at x0.

SLIDE 13

Leiden University

Asymptotic distribution

Typical features for isotonic estimators
n1/3 rate of convergence
non-normal limiting distribution
Groeneboom (1985) recipe
1. Define an inverse process
2. Use the switching relationship
3. Use the Hungarian embedding (KMT construction) to derive

the limiting distribution of the inverse process

4. Obtain the limiting distribution of the monotone estimator

SLIDE 14

Leiden University

Asymptotic distribution

For the Grenander-type estimator ˜

λn

1. Inverse process

Un(a) = argmin

x∈[0,T(n)]

{Λn(x) − ax} , for a > 0, where argmin denotes the largest location of the minimum

2. For any a > 0, the following switching relationship holds

Un(a) ≥ x ⇔ ˜ λn(x) ≤ a, with probability one

SLIDE 15

Leiden University

Asymptotic distribution

For a fixed x0,

P

n1/3

˜ λn(x0) − λ0(x0)

> a
= P
n1/3

Un(λ0(x0) + n−1/3a) − x0

< 0
Moreover

n1/3 Un(λ0(x0) + n−1/3a) − x0

= argmin

x∈In(x0)

{❩n(x) − ax}, where In(x0) = [−n1/3x0, n1/3(T(n) − x0)] and for x ∈ In(x0) ❩n(x) = n2/3 Λn(x0 + n−1/3x) − Λ0(x0 + n−1/3x)

− [Λn(x0) − Λ0(x0)]

+ Λ0(x0 + n−1/3x) − Λ0(x0) − n−1/3λ0(x0)x

SLIDE 16

Leiden University

Asymptotic distribution

3. No embedding available for the Breslow estimator
3’. Linearization result of the Breslow estimator

(Lopuha¨ a & Nane, 2013b)

Let Φ(β0, x) = ❊[{T ≥ x}eβ′

0Z]

Theorem 2 (Lopuha¨

a & Nane, 2013a)

Assume (A.1) and (A.2) and let x0 ∈ (0, τH). Suppose that λ0 is nondecreasing on [0, ∞) and continuously differentiable in a neighborhood of x0, with λ0(x0) = 0 and λ′

0(x0) > 0. Then,

n1/3

Φ(β0, x0)

4λ0(x0)λ′

0(x0)

1/3 ˜ λn(x0) − λ0(x0)

→d argmin

t∈❘

{W (t) + t2}, where W is a standard two-sided Brownian motion

riginating from zero.

SLIDE 17

Leiden University

Asymptotic distribution

Theorem 3 (Lopuha¨

a & Nane, 2013a)

Assume (A.1) and (A.2) and let x0 ∈ (0, τH). Suppose that λ0 is nondecreasing on [0, ∞) and continuously differentiable in a neighborhood of x0, with λ0(x0) = 0 and λ′

0(x0) > 0. Then,

n1/3

Φ(β0, x0)

4λ0(x0)λ′

0(x0)

1/3 ˆ λn(x0) − λ0(x0)

→d argmin

t∈❘

{W (t) + t2}, where W is a standard two-sided Brownian motion originating from zero.

SLIDE 18

Leiden University

Asymptotic distribution

Theorem 4 (Lopuha¨

a & Nane, 2013a)

Assume (A.1) and (A.2) and let x0 ∈ (0, τH). Suppose that f0 is nonincreasing on [0, ∞) and continuously differentiable in a neighborhood of x0, with f0(x0) = 0 and f ′

0(x0) < 0. Let F0 be the

baseline distribution function. Then, n1/3

Φ(β0, x0)

4f0(x0)f ′

0(x0)[1 − F0(x0)]

1/3 ˜ fn(x0) − f0(x0)

→d argmin

t∈❘

{W (t) + t2}, where W is a standard two-sided Brownian motion originating from zero.

SLIDE 19

Leiden University

Hypothesis testing

Likelihood ratio test of H0 : λ0(x0) = θ0 versus

H1 : λ0(x0) = θ0

Let Lβ(λ0) the (log)likelihood function
For fixed β ∈ ❘p, x0 ∈ (0, τH) and θ0 ∈ (0, ∞) fixed

maximize Lβ(λ0) under H0

Propose ˆ

λ0

n(x) = ˆ

λ0

n(x; ˆ

βn) as the constrained NPMLE

SLIDE 20

Leiden University

Hypothesis testing

Recall ˆ

λn(x) = ˆ λn(x; ˆ βn), the unconstrained NPMLE estimator of a nonincreasing λ0

By Theorem 3,

n1/3 ˆ λn(x0) − λ0(x0)

→d

4λ0(x0)λ′

0(x0)

Φ(β0, x0)

1/3

argmin

t∈❘

{W (t) + t2} ≡ C(x0)argmin

t∈❘

{W (t) + t2} ≡ C(x0) 2 g(0), where g(x) is the slope at x of the GCM of {❲(t) + t2}

SLIDE 21

Leiden University

Hypothesis testing

Similarly, it can be shown that

n1/3 ˆ λ0

n(x0) − λ0(x0)

→d

C(x0) 2 g0(0), where g0 is the constrained slope process of the GCM of {❲(t) + t2}

SLIDE 22

Leiden University

Hypothesis testing

Banerjee & Wellner (2001)

SLIDE 23

Leiden University

Hypothesis testing

Banerjee & Wellner (2001)

SLIDE 24

Leiden University

Hypothesis testing

Replacing β by ˆ

βn in Lβ(λ0) gives 2 log ξn(θ0) = 2Lˆ

βn(ˆ

λn) − 2Lˆ

βn(ˆ

λ0

n)

Theorem 5 (Nane, 2013)

Suppose that (A.1) and (A.2) hold and let x0 ∈ (0, τH). Assume that λ0 is nondecreasing on [0, ∞) and continuously differentiable in a neighborhood of x0, with λ0(x0) = 0 and λ′

0(x0) > 0. Then, under the null hypothesis,

2 log ξn(θ0) →d ❉, where ❉ =

[(g(u))2 − (g0(u))2]du.

SLIDE 25

Leiden University

Interval estimation

Pointwise confidence intervals for λ0(x0)
Likelihood ratio method

{θ : 2 log ξn(θ) ≤ q(❉, 1 − α) where q(❉, 1 − α) is the (1 − α)th quantile of ❉ (Banerjee & Wellner, 2005)

Asymptotic distribution

[ˆ λn(x0)−n−1/3 ˆ Cn(x0)q(❩, 1−α/2), ˆ λn(x0)+n−1/3 ˆ Cn(x0)q(❩, 1−α/2)], where ❩ = argmin{❲(t) + t2} and q(❩, 1 − α/2) is the (1 − α/2)th quantile of the distribution ❩ (Groeneboom & Wellner, 2001)

SLIDE 26

Leiden University

Current research

Citation analysis
Event of interest - time to first citation, 5th citation, etc
Time frame - first five years after publication (field specific)
Censored data
Covariates - document type, collaboration type, number of

authors, number of pages, etc

Nondecreasing baseline hazard

SLIDE 27

Leiden University

References

BANERJEE, M. and WELLNER, J.A. 2001. Likelihood ratio tests for monotone functions.

The Annals of Statistics. Vol 29, No. 6, 1699 − 1731.

BANERJEE, M. and WELLNER, J.A. 2005. Score statistics for current status data:

comparisons with likelihood ratio and Wald statistics. International Journal od

Biostatistics. Vol 1, Art. 1, 29 pp. (electronic).
CHUNG, D. and CHANG, M.N. 1994. An isotonic estimator of the baseline hazard

function in Cox’s regression model under order restriction. Statistics & Probability letters.

No. 21, 223 − 228.
GREONEBOOM, P. 1985. Estimating a monotone density, In proceedings of the Berkeley

conference in honor of Jerzy Neyman and Jack Kiefer , 2, 539 − 555.

GREONEBOOM, P. and WELLNER, J.A. 2001. Computing Chernoff’s distribution.

Journal of Computational and Graphical Statistics. No. 10, 388 − 400.

KOML ´

OS,J.,MAJOR,P. and TUSN´ ADY, G. 1975. An approximation of partial sums of independent r.v.’s and the sample d.f. Z.Wahrsch. verw. Gebiete, No. 32, 111 − 131.

LOPUHA¨

A, H.P. and NANE, G.F. 2013a. Shape constrained nonparametric estimators of the baseline distribution in Cox proportional hazards model. Scandinavian Journal of

Statistics. Vol. 40, No. 3, 619 − 646.
LOPUHA¨

A, H.P. and NANE, G.F. 2013b. An asymptotic linear representation for the Breslow estimator. Communications in Statistics - Theory and Methods. No. 42, 1314–1324.

NANE, G.F. 2013. A likelihood ratio test for monotone baseline hazard functions in the

Cox model. Submitted. arxiv:1304.1295v1.