Bayesian Piecewise Constant Hazard Model
1.1 Bayesian survival modelling using the piecewise con- stant hazard model
Bayesian inference may be implemented to the piecewise constant hazard model with time varying covariate effects. Much work has been done on piecewise constant hazard models where the hazard parameters are independent between intervals. See, for example Genest & Kalbfleisch (1998);Ibrahim et al. (2001). Kim et al. (2007) proposed a class of semi-parametric models for survival data in which they constructed a dynamic model for piecewise constant hazard functions over a finite partition of the time axis. Gamerman (1991) introduced the dynamic generalised linear model into survival analysis where he applied it to the piecewise constant hazard model.
Hence, the cut points for the piecewise constant hazard model need to be chosen.
1.1.1 Choosing the cut points of time intervals for the piecewise constant hazard model
The piecewise constant hazard model requires the discretisation of the time axis into intervals and thereby choosing the number and locations of the cut points. Some authors have suggested defining the intervals as beginning and ending at the observed failure times while Kalbfleisch & Prentice (1973) suggested selecting intervals independently of the data.
West (1992), in an attempt to define intervals, suggested shorter intervals over the first few years since deaths in cancer data are common in early stages and longer intervals in the later years since there are fewer deaths. One possibility is to choose our cut points deliberately so that each interval has the same number of deaths. A researcher would
time interval using prior judgements.
Suppose the time axis is discretised to have ten time intervals so that it can be assumed that 10% of the events (e.g deaths) occur in each interval. The event times are supposed to follow approximately an exponential distribution. The survival function of an exponential distribution with parameter λ is given by exp {−λt}.
Given the cut points τ1 < τ2 < . . . < τJ −1, the probability of surviving until τj will be exp {−λτj} = 1 − 0.1j
so
−λτj = log (1 − 0.1j) and
τj = −1
λlog (1 − 0.1j)
Let the mean of the exponential distribution (1λ) be ν and ζ = 1J. So,
τj = −ν log (1 − jζ) (1.1)
1.1.2 Construction of the likelihood for piecewise constant hazard model Suppose that associated with every patient is a time which could either be a death or censoring time. Every interval is associated with three different groups of patients, patients who died during the interval, patients who were censored during the interval and patients who survived the interval. The contribution of the likelihood from the patients in every interval will depend on the three different groups of patients. The likelihood contribution L of the patients is given as
L =
n
Y
i=1 J
Y
j=1
Li,j
where
Li,j =
1 if ti < τj−1
λδi,ji,jexp {−λ(ti− τj−1)} if τj−1 ≤ ti< τj
exp {−λ(τj− τj−1)} if ti ≥ τj where n is the number of patients
J is the number of time intervals
ti is the event or censoring time of the ith patient.
λi,j is the hazard of the ith patient in the jth interval.
δi,j is the indicator of death or censoring of the ith patient in the jth interval.
Within each interval the conditional survival distribution given that the patient is alive
and uncensored at time τj−1, is exponential since the hazard is constant.
exponential distribution with probability density function fi(ti|λi,j) given by
fi(ti|λi,j) = λi,jexp {−λi,jti} and survival function Si(ti|λi,j) given by
Si(ti|λi,j) = exp {−λi,jti}
The likelihood contribution of patients L for each interval is given by
L = Y
i∈Dj
λi,jexp {−λi,j(ti− τj)}Y
i∈Cj
exp {−λi,j(ti− τj)}Y
i∈Fj
exp {−λi,j(τj+1− τj)}
where Dj is the set of patients who died in the jth interval and have a time which is the difference between the patient’s death time and the time of the start of the interval.
Cj is the set of patients who were censored in the jthinterval and have a time which is the difference between the patient’s censored time and the time of the start of the interval.
Fj is the set of patients who neither died or censored in the jth interval but survived until the j + 1th interval and have a time which is the length of the interval.
The logarithm of the likelihood ` for the jth interval is given by
` = X
i∈Dj
log λi,j− X
i∈Dj∪Cj
λi,j(ti− τj) −X
i∈Fj
λi,j(τj+1− τj)
1.1.3 Construction of prior distribution for the parameters of piecewise constant hazard model
In a Bayesian context, there is an advantage of constructing a prior that makes the hazard parameters in neighbouring intervals to be correlated. In practice, a piecewise constant hazard function which has prior distribution in which the parameters are correlated over time could be used. Genest & Kalbfleisch (1978) and Ibrahim et al. (2001) made the prior distribution of the parameters independent between time intervals. However, it would be reasonable to think that the hazards in the intervals which are closely together are likely to be similar. McKeague & Tighiouart (2000) gave a dependent prior using a Markov random field.
The prior used in the illustration of the piecewise constant hazard model is assumed to take the form of a realisation of a stochastic process which could either be stationary or non stationary. If the prior is made stationary then each parameter gets the same variance in each time period. For example, an autoregressive process with autoregressive parameter
which governs how strong the correlation between time periods will be might be used. The autoregressive parameter ρ > 0 might be selected to give positive autocorrelation. Then, a first order autoregressive process is chosen so that for any given value of autoregressive parameter ρ, the process is given by
βs− µ = ρ(βs−1− µ) + s
where s is normally distributed with mean zero and variance σ2 (Chatfield, 2004).
This can be rearranged this as
Yt= µ + φ(Yt−1− µ) + t By repeated substitution, the following shall be obtained
Yt= µ + t+ φt−1+ φ2t−2+ ....
Yt= µ +
∞
X
p=0
φpt−p
The expectation of βj, E(βj) is µ . The variance of the process is
Var(βs) = σ2
∞
X
p=0
ρ2p= σ2 1 − ρ2
Making the process stationary makes the prior variance the same in all intervals. The value of ρ can also be chosen following the recommendations by Revie et al. (2003) on the construction of correlations. If ρ = 1, the process will be non-stationary. In some other cases, it is feasible to assume that more knowledge is possessed at the starting time and less prior experience as the time goes on and for this reason the variance is allowed to increase or decrease as time increases. The most flexible thing that can possibly be done is to have a different hazard for each possible combination of the categories of covariates. For instance, the continous covariates may be converted to categorical or ordered categories.
1.1.4 Construction of prior distribution for the frailty variance
Supposing that the logarithm of the frailty has a normal distribution with zero mean and a variance of σf2. The prior distribution for the frailty variance σ2f can be constructed.It is then feasible to suppose that the precision τf = σf−2 has a gamma prior distribution.
If the parameters remained constant over all time intervals then an individual’s lifetime would have an exponential distribution since the hazard λ of the individual is constant.
The individual’s mean lifetime will be 1/λ. Doubling λ would half the mean lifetime and
correspond to a log-frailty of log 2. Suppose, for example, σf = log 2 then this corresponds to τf = (log 2)1 2. Thus, the prior mean for τf is set to be (log 2)1 2. So, if τf ∼ Ga(af, bf),
then af
bf = 1
(log 2)2 (1.2)
The prior variance of τf is abf2 f
and the coefficient of variation is √1a
f. af is chosen to reflect our prior uncertainty in τf. Since little prior information is known on τf, a small value for af is chosen. However, so that the prior density of τf is at τf, af > 1, say 1.1, is chosen.
Combining this with Equation 6.2 gives bf = 0.53.
1.2 Application: Bayesian survival modelling using the piece- wise constant hazard model
The Bayesian survival modelling using the piecewise constant hazard model to the two example data sets described in previous chapter. Equation (6.1) to the two example data sets to choose the cut points is applied. Based on a prior assessment of the mean survival life time of a patient, ν is set as 3 years in the SNLG data set, and hence get the cut points are
0.105, 0.223, 0.357, 0.511, 0.693, 0.916, 1.204, 1.609, 2.303.