7.4 The Piece-Wise Exponential Model
We will consider fitting a proportional hazards model of the usual form
| (7.13) |
7.4.1 A Piece-wise Constant Hazard
Consider partitioning duration into J intervals with cutpoints 0 = t0 < t1 < < tJ = . We will define the j-th interval as [tj-1,tj), extending from the (j-1)-st boundary to the j-th and including the former but not the latter.
We will then assume that the baseline hazard is constant within each interval, so that
| (7.14) |
Clearly, judicious choice of the cutpoints should allow us to approximate reasonably well almost any baseline hazard, using closely-spaced boundaries where the hazard varies rapidly and wider intervals where the hazard changes more slowly.

Figure 7.1 shows how a Weibull distribution with l = 1 and p = 0.8 can be approximated using a piece-wise exponential distribution with boundaries at 0.5, 1.5 and 3.5. The left panel shows how the piece-wise constant hazard can follow only the broad outline of the smoothly declining Weibull hazard yet, as shown on the right panel, the corresponding survival curves are indistinguishable.
7.4.2 A Proportional Hazards Model
let us now introduce some covariates in the context of the proportional hazards model in Equation 7.13, assuming that the baseline hazard is piece-wise constant as in Equation 7.14. We will write the model as
| (7.15) |
Taking logs, we obtain the additive log-linear model
| (7.16) |
The model can be extended to introduce time-varying covariates and time-dependent effects, but we will postpone discussing the details until we study estimation of the simpler proportional hazards model.
7.4.3 The Equivalent Poisson Model
Holford (1980) and Laird and Oliver (1981), in papers produced independently and published very close to each other, noted that the piece-wise proportional hazards model of the previous subsection was equivalent to a certain Poisson regression model. We first state the result and then sketch its proof.
Recall that we observe ti, the total time lived by the i-th individual, and di, a death indicator that takes the value one if the individual died and zero otherwise. We will now define analogous measures for each interval that individual I goes through. You may think of this process as creating a bunch of pseudo-observations, one for each combination of individual and interval.
First we create measures of exposure. Let tij denote the time lived by the i-th individual in the j-th interval, that is, between tj-1 and tj. If the individual lived beyond the end of the interval, so that ti > tj, then the time lived in the interval equals the width of the interval and tij = tj-tj-1. If the individual died or was censored in the interval, i.e. if tj-1 < ti < tj, then the timed lived in the interval is tij = ti-tj-1, the difference between the total time lived and the lower boundary of the interval. We only consider intervals actually visited, but obviously the time lived in an interval would be zero if the individual had died before the start of the interval and ti < tj-1.
Now we create death indicators. Let dij take the value one if individual I dies in interval j and zero otherwise. Let j(i) indicate the interval where ti falls, i.e. the interval where individual I died or was censored. We use functional notation to emphasize that this interval will vary from one individual to another. If ti falls in interval j(i), say, then dij must be zero for all j < j(i) (i.e. all prior intervals) and will equal di for j = j(i), (i.e. the interval where individual I was last seen).
Then, the piece-wise exponential model may be fitted to data by treating the death indicators dij's as if they were independent Poisson observations with means
|
|
Thus, the piece-wise exponential proportional hazards model is equivalent to a Poisson log-linear model for the pseudo observations, one for each combination of individual and interval, where the death indicator is the response and the log of exposure time enters as an offset.
It is important to note that we do not assume that the dij have independent Poisson distributions, because they clearly do not. If individual I died in interval j(i), then it must have been alive in all prior intervals j < j(i), so the indicators couldn't possibly be independent. Moreover, each indicator can only take the values one and zero, so it couldn't possibly have a Poisson distribution, which assigns some probability to values greater than one. The result is more subtle. It is the likelihood functions that coincide. Given a realization of a piece-wise exponential survival process, we can find a realization of a set of independent Poisson observations that happens to have the same likelihood, and therefore would lead to the same estimates and tests of hypotheses.
The proof is not hard. Recall from Section 7.2.2 that the contribution of the i-th individual to the log-likelihood function has the general form
|
Under the piece-wise exponential model, the first term in the log-likelihood can be written as
|
The cumulative hazard in the second term is an integral, and can be written as a sum as follows
|
One slight lack of symmetry in our results is that the hazard leads to one term on dij(i)loglij(i), but the cumulative hazard leads to j(i) terms, one for each interval from j = 1 to j(i). However, we know that dij = 0 for all j < j(i), so we can add terms on dijloglij for all prior j's; as long as dij = 0 they will make no contribution to the log-likelihood. This trick allows us to write the contribution of the i-th individual to the log-likelihood as a sum of j(i) contributions, one for each interval visited by the individual:
|
The final step is to identify the contribution of each pseudo-observation, and we note here that it agrees, except for a constant, with the likelihood one would obtain if dij had a Poisson distribution with mean μij = tijlij. To see this point write the Poisson log-likelihood as
|
This result generalizes the observation made at the end of Section 7.2.2 noting the relationship between the likelihood for censored exponential data and the Poisson likelihood. The extension is that instead of having just one `Poisson' death indicator for each individual, we have one for each interval visited by each individual.
Generating pseudo-observations can substantially increase the size of the dataset, perhaps to a point where analysis is impractical. Note, however, that the number of distinct covariate patterns may be modest even when the total number of pseudo-observations is large. In this case one can group observations, adding up the measures of exposure and the death indicators. In this more general setting, we can define dij as the number of deaths and tij as the total exposure time of individuals with characteristics xi in interval j. As usual with Poisson aggregate models, the estimates, standard errors and likelihood ratio tests would be exactly the same as for individual data. Of course, the model deviances would be different, representing goodness of fit to the aggregate rather than individual data, but this may be a small price to pay for the convenience of working with a small number of units.
7.4.4 Time-varying Covariates
It should be obvious from the previous development that we can easily accommodate time-varying covariates provided they change values only at interval boundaries. In creating the pseudo-observations required to set-up a Poisson log-likelihood, one would normally replicate the vector of covariates xi, creating copies xij, one for each interval. However, there is nothing in our development requiring these vectors to be equal. We can therefore redefine xij to represent the values of the covariates of individual I in interval j, and proceed as usual, rewriting the model as
|
Of course, splitting observations further increases the size of the dataset, and there will usually be practical limitations on how far one can push this approach, even if one uses grouped data. An alternative is to use simpler indicators such as the mean value of a covariate in an interval, perhaps lagged to avoid predicting current hazards using future values of covariates.
7.4.5 Time-dependent Effects
It turns out that the piece-wise exponential scheme lends itself easily to the introduction of non-proportional hazards or time-varying effects, provided again that we let the effects vary only at interval boundaries.
To fix ideas, suppose we have a single predictor taking the value xij for individual I in interval j. Suppose further that this predictor is a dummy variable, so its possible values are one and zero. It doesn't matter for our current purpose whether the value is fixed for the individual or changes from one interval to the next.
In a proportional hazards model we would write
|
To allow for a time-dependent effect of the predictor, we would write
|
|
To sum up, we can accommodate non-proportionality of hazards simply by introducing interactions with duration. Obviously we can also test the assumption of proportionality of hazards by testing the significance of the interactions with duration. We are now ready for an example.
Continue with 7.5. Infant and Child Mortality in Colombia

