Full article: Censored Interquantile Regression Model with Time-Dependent Covariates

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Conventionally, censored quantile regression stipulates a specific, pointwise conditional quantile of the survival time given covariates. Despite its model flexibility and straightforward interpretation, the pointwise formulation oftentimes yields rather unstable estimates across neighboring quantile levels with large variances. In view of this phenomenon, we propose a new class of quantile-based regression models with time-dependent covariates for censored data. The models proposed aim to capture the relationship between the failure time and the covariate processes of a target population that falls within a specific quantile bracket. The pooling of information within a homogeneous neighborhood facilitates more efficient estimates hence, more consistent conclusion on statistical significances of the variables concerned. This new formulation can also be regarded as a generalization of the accelerated failure time model for survival data in the sense that it relaxes the assumption of global homogeneity for the error at all quantile levels. By introducing a class of weighted rank-based estimation procedure, our framework allows a quantile-based inference on the covariate effect with a less restrictive set of assumptions. Numerical studies demonstrate that the proposed estimator outperforms existing alternatives under various settings in terms of smaller empirical biases and standard deviations. A perturbation-based resampling method is also developed to reconcile the asymptotic distribution of the parameter estimates. Finally, consistency and weak convergence of the proposed estimator are established via empirical process theory. Supplementary materials for this article are available online.

KEYWORDS:

1 Introduction

Censored quantile regression has been a popular alternative to Cox proportional hazards model (Cox Citation1972, Citation1975) in survival analysis. This broad class of models allows a greater degree of flexibility as it can capture the heterogeneity of covariate effects without assuming that the relative risks are constant with time. It also provides better interpretability as the model directly relates a specific, pointwise quantile of the failure time with a set of candidate covariates. There has been rich literature discussing quantile regression models on censored survival data. In particular, Powell (Citation1984, Citation1986) studied the least absolute deviation estimation for the censored regression model. Under the independent censoring assumption, Ying, Jung, and Wei (Citation1995) derived the median regression procedure for survival data. With a less restrictive assumption on conditionally independent censoring, Portnoy’s (Citation2003) and Peng and Huang’s (Citation2008) proposals provided novel adaptations for Kaplan-Meier (KM) and Nelson-Aalen (NA) estimators, respectively, upon which corresponding inference procedures for censored quantiles can be implemented. To avoid recursive algorithms required in the aforementioned procedures, Wang and Wang (Citation2009) insightfully developed a locally weighted procedure that adopts the redistribution-of-mass idea for a kernel-based, local reweighting scheme. All these approaches were, however, designed for time-independent covariates only.

In many real applications, since survival events occur over time, there is a natural need to handle time-dependent covariates such as heart blood pressures or the drug dosages received by individual patients. Statistical theories of the Cox proportional hazards model can be readily extended to handle time-dependent covariates, see, for example, Andersen and Gill (Citation1982), Kalbfleisch and Prentice (Citation2002) and Martinussen and Scheike (Citation2006) amongst others. The classic accelerated failure time model has also been extended in Lin and Ying (Citation1995), Zeng and Lin (Citation2007), and Tseng, Hsieh, and Wang (Citation2005), amongst others to accommodate time-dependent covariates. For censored quantile regression with time-dependent covariates, there have only been few results to the best of our knowledge. One interesting work is Gorfine, Goldberg, and Ritov (Citation2017), whose estimating equation was developed upon an inverse probability weighting scheme under the independent censoring assumption.

As pointed out in Peng and Huang (Citation2008), censored quantile regression models can be reduced to the accelerated failure time model (Robins and Tsiatis Citation1992; Lin and Ying Citation1995) if the regression parameters are constant for all quantile values $τ \in (0, 1)$ . In other words, when the homogeneity assumption holds for all τ’s, the two models are equivalent. This observation motivates us to explore the uncharted region between the two ends of the spectrum. Indeed, our new proposal attempts to seek a reasonable compromise between model flexibility and estimation efficiency. Because of the homogeneity assumption, the accelerated failure time model can pool the information provided by all the observations for estimating a reduced set of model parameters. As we can see in Example 4 in Section 5, if one can delicately leverage the local homogeneity shared amongst the specific quantile level concerned and its neighboring levels, the problem of obtaining volatile point estimates for quantile regression parameters, which is frequently observed in many existing studies, can be alleviated.

Recall that the censored quantile regression model studies the influence of the covariates on the failure time at designated quantile levels, such as the median and the quartiles, while the accelerated failure time model examines the overall covariate effect provided the aforementioned homogeneity assumption holds for all quantile levels. None of these models, however, can quantify the covariate effects over a specific group in the sample, say the top 25% or the bottom 25% of a population in term of length of survival after covariate adjustment. For instance, to measure the effectiveness of a new drug, investigators are interested in quantifying its effect on the survival times of individuals who are among the shortest lived 25% in the population given the covariates instead of its effect on an individual who represents the lower quartile of the failure time distribution. In such scenarios, a localized censored quantile regression model with parameters representing the treatment effect over the designated range of quantiles should appear more attractive, whereas the classic quantile regression model only quantifies treatment effect on particular levels of quantiles instead.

The main contribution of this work is the development of a quantile-based version of the accelerated failure time model for censored data with time-dependent covariates under which we relax the homogeneity requirement under the conditional independent censoring assumption. Moreover, our approach also provides an alternative perspective to the study of quantile regression problems. The pooling effect enjoyed by our model can remediate the drawback that many existing proposals for censored quantile regression may share. Besides numerical stability, our model also offers an alternative perspective for researchers so as to best match their primary goals of quantifying the covariate effects on the failure time for an interval of quantiles rather than a specific level. It should be emphasized that although there are similar works in literature that discuss methods of combining neighboring quantile coefficient estimates to yield more stable estimates, to the best of our knowledge, none can be straightforwardly extended to our specific problem. In Zheng, Peng, and He (Citation2015), for example, the aggregation is carried out via the penalty term based on an overall sum of the magnitudes of the neighboring coefficients concerned, but the parameter estimates remain capturing the covariates effects on the response variable at specific quantile levels. Same as Zheng, Peng, and He (Citation2015), Zou and Yuan (Citation2008), and Kai, Li, and Zou (Citation2010) are also designed for uncensored observations in which case the celebrated check function can be readily applied to provide initial estimates upon which smoothing or other aggregation is introduced to achieve the inference goals.

The rest of this article is organized as follows: Section 2 outlines the model framework and establishes the corresponding estimation equation for the model parameters. Section 3 outlines the asymptotic results of the proposed estimator, while Section 4 provides a perturbation based resampling method and a two stage testing procedure for inference. A set of simulation results is presented in Section 5, followed by a data analysis of the Stanford heart transplant data in Section 6. A short concluding discussion is given in Section 7. All proofs of the asymptotic results and justification of the resampling method are presented in the appendix. A sample of the MATLAB code used in our numerical studies is provided online as a supplementary material.

2 Methodology

For $i = 1, \dots, n$ , let T_i denote the actual failure time of the ith individual, ${\bar{Z}}_{i} = {Z_{i} (s), s \geq 0} = {{(Z_{i 1} (s), \dots, Z_{i p} (s))}^{⊤}, s \geq 0}$ be the p-dimensional multivariate random covariate process corresponding to this person, and $β_{0}$ be the p-dimensional vector of coefficients. Let $ε_{i} = h_{i} (T_{i}; β_{0})$ be a monotone transform of T_i, where $h_{i} (T_{i}; β) = \int_{0}^{T_{i}} \exp {- Z_{i} {(s)}^{⊤} β} d s .$

Note that $h_{i} (T_{i}; β_{0})$ reduces to $T_{i} \exp {- Z_{i}^{⊤} β_{0}}$ when $Z_{i}$ is time-independent, that is, we have $\log ε_{i} = \log T_{i} - Z_{i}^{⊤} β_{0}$ , which reduces to the accelerated failure time model with time-independent covariates. In this regard, the definition of $ε_{i}$ here generalizes the error term to the setting of time-dependent covariates, which can be interpreted as a rescaled failure time of the ith individual such that these transformed time stamps are identically distributed under the accelerated failure time model.

In our setup, instead of assuming a homogeneous error term, that is, a global homogeneity for all quantile levels, we hypothesize only a local homogeneity for a designated range of quantiles. More precisely, our model assumes that for all $u \in [τ_{L}, τ_{U}] \subset [0, 1]$ , the uth quantile of $ε_{i}$ conditional on the covariate process ${\bar{Z}}_{i}$ is homogeneous for all $i = 1, \dots, n$ . Mathematically, it stipulates that (2.1) $Q_{h_{i} (T_{i}; β_{0})} (u ∣ {\bar{Z}}_{i}) = q (u), (u \in [τ_{L}, τ_{U}]; i = 1, \dots, n),$ (2.1) where q(u) is a function independent of ${\bar{Z}}_{i}$ representing the common conditional quantile, and $Q_{h_{i} (T_{i}; β_{0})} (\cdot ∣ {\bar{Z}}_{i})$ denotes the conditional quantile function of $h_{i} (T_{i}; β_{0})$ given ${\bar{Z}}_{i}$ . Our goal is to estimate $β_{0} \equiv β_{0, [τ_{L}, τ_{U}]}$ under model (2.1), where $β_{0}$ quantifies the covariate effect over the homogeneous region. Here, we suppress the dependence on τ_L and τ_U for brevity of notation. Note that the choice of $[τ_{L}, τ_{U}]$ depends on the primarily interested range of quantiles. As an example, if the user is interested in the covariate effects on the bottom 25% subgroup of population after covariate adjustment, then $[τ_{L}, τ_{U}]$ could be chosen as $[0, 0.25]$ .

Model (2.1) can be viewed as a generalization of the accelerated failure time model with time-dependent covariates (Robins and Tsiatis Citation1992; Lin and Ying Citation1995). Recall their model assumes that $h_{i} (T_{i}; β_{0})$ shares a common baseline failure time distribution conditional on ${\bar{Z}}_{i}$ , which is equivalent to say $Q_{h_{i} (T_{i}; β_{0})} (u ∣ {\bar{Z}}_{i}) = q (u)$ for all $u \in [0, 1]$ , that is, (2.1) holds for $τ_{L} = 0$ and $τ_{U} = 1$ . In contrast, we relax such a requirement so that (2.1) is only satisfied for a sub-interval of quantiles $[τ_{L}, τ_{U}]$ . Recall that the accelerated failure time model can be regarded as a special case of the censored quantile regression model which does not assume any error homogeneity. In this regard, our model can be viewed as a hybrid that connects these two extremes. While the parameter in the accelerated failure time model gives the right transformation of the failure time so that the rescaled time stamps $h_{i} (T_{i}; β_{0})$ are homogeneous, our model parameter can be regarded as an analogy that gives the correct transformation over $[τ_{L}, τ_{U}]$ . In other words, one can interpret $β_{0}$ as the uniform covariate effect on the failure time over this subgroup of population that lies within this quantile bracket. As an example, suppose the homogeneity within the middle 50% of quantile levels is concerned, that is, $[τ_{L}, τ_{U}] = [0.25, 0.75]$ , then our model hypothesizes that a unit uniform increment in the covariate process $Z_{i} (s)$ results in $\exp {1^{⊤} β_{0}}$ -times prolongation of the failure time for this middle group of sub-population, where $1 = {(1, \dots, 1)}^{⊤}$ .

Suppose our data are iid copies of ${{\tilde{T}}_{i}, Δ_{i}, {\bar{Z}}_{i} ({\tilde{T}}_{i})}$ for $i = 1, \dots, n$ , where ${\tilde{T}}_{i} = T_{i} \land C_{i}$ is the observed event time, $Δ_{i} = I (T_{i} \leq C_{i})$ is the censoring indicator, and ${\bar{Z}}_{i} ({\tilde{T}}_{i}) = {Z_{i} (s), 0 \leq s \leq {\tilde{T}}_{i}}$ is the covariate history up to the survival time. Here, C_i is the censoring time which is conditionally independent of T_i given ${\bar{Z}}_{i}$ , and $I (\cdot)$ denotes the indicator function. Define the conditional cumulative hazard function, counting process and at-risk process associated with $h_{i} (T_{i}; β_{0})$ to be $Λ_{i} (t; β_{0})$ , $N_{i} (t; β_{0}) = Δ_{i} I (h_{i} ({\tilde{T}}_{i}; β_{0}) \leq t)$ , $Y_{i} (t; β_{0}) = I (h_{i} ({\tilde{T}}_{i}; β_{0}) \geq t)$ , respectively. Note that model (2.1) implies that $h_{i} (T_{i}; β_{0})$ has a homogeneous conditional distribution function for $t \in [q (τ_{L}), q (τ_{U})]$ , so we can write $Λ_{i} (t; β_{0}) = Λ (t; β_{0})$ for such t. Now, based on the classical counting process theory (Fleming and Harrington Citation1991), we have (2.2) $E [d N_{i} (t; β_{0}) - Y_{i} (t; β_{0}) d Λ (t; β_{0}) ∣ {\bar{Z}}_{i}] = 0,$ (2.2) and also (2.3) $E [Z_{i} (h_{i}^{- 1} (t; β_{0})) {d N_{i} (t; β_{0}) - Y_{i} (t; β_{0}) d Λ (t; β_{0})} ∣ {\bar{Z}}_{i}] = 0,$ (2.3) for $t \in [q (τ_{L}), q (τ_{U})]$ , where $h_{i}^{- 1} (\cdot;; β)$ is the inverse function of $h_{i} (\cdot;; β)$ . By solving the empirical counterpart of (2.2), we obtain a Nelson-Aalen type estimator of $d Λ (t; β_{0})$ , which is $\sum_{i = 1}^{n} d N_{i} (t; β_{0}) / \sum_{i = 1}^{n} Y_{i} (t; β_{0})$ for $t \in [q (τ_{L}), q (τ_{U})]$ . Meanwhile, with a slight abuse of notation, we denote the τth unconditional quantile of $h_{i} (T_{i}; β)$ by $q (τ; β)$ , that is, $q (τ; β)$ is the inverse function of $F (t; β)$ , where $F (t; β)$ is the unconditional, or marginal, distribution function of $h_{i} (T_{i}; β)$ . Model (2.1) implies the homogeneous conditional quantile $q (τ)$ coincides with $q (τ; β_{0})$ for $τ \in [τ_{L}, τ_{U}]$ , so for $τ = τ_{L}$ and $τ = τ_{U}, q (τ)$ can be estimated by $\hat{q} (τ; β_{0})$ , the inverse function of $\hat{F} (t; β_{0})$ , where $\hat{F} (t; β)$ is an estimator of $F (t; β)$ . In particular, we take $\hat{F} (t; β)$ to be the Kaplan-Meier estimator based on ${h_{i} ({\tilde{T}}_{i}; β_{0}), Δ_{i}}$ . As mentioned, this estimation works because the conditional quantile and the unconditional quantile are equivalent for $τ \in [τ_{L}, τ_{U}]$ when $β = β_{0}$ .

Incorporate the two estimators of $d Λ (t; β_{0})$ and $q (τ)$ into the empirical counterpart of (2.3) and integrate over the range of t, it is natural to consider the weighted estimating equation $U_{n} (β; τ_{L}, τ_{U}) = 0$ , where $U_{n} (β; τ_{L}, τ_{U}) = \frac{1}{n} \sum_{i = 1}^{n} \int_{\hat{q} (τ_{L}; β)}^{\hat{q} (τ_{U}; β)} Ψ (t, β)$ $\times {Z_{i} (h_{i}^{- 1} (t; β)) - \frac{S^{(1)} (t; β)}{S^{(0)} (t; β)}} d N_{i} (t; β) .$

Here, $S^{(0)} (t; β) = n^{- 1} \sum_{i = 1}^{n} Y_{i} (t; β), S^{(1)} (t; β) = n^{- 1} \sum_{i = 1}^{n}$ $Z_{i} (h_{i}^{- 1} (t; β)) Y_{i} (t; β)$ , and $Ψ (t, β)$ is a weight function that possibly depends on the data ${{\tilde{T}}_{i}, Δ_{i}, {\bar{Z}}_{i} ({\tilde{T}}_{i}), i = 1, \dots, n}$ . Our proposed estimator of $β_{0}$ , denoted by $\hat{β}$ , is obtained as the root of $U_{n} (β; τ_{L}, τ_{U}) = 0$ . To this end, we simplify $U_{n} (β; τ_{L}, τ_{U})$ into a more familiar form by recalling that $N_{i} (t; β) = Δ_{i} I (h_{i} ({\tilde{T}}_{i}; β) \leq t)$ . The above display can thus be rewritten as (2.4) $\begin{matrix} U_{n} (β; τ_{L}, τ_{U}) = \frac{1}{n} \sum_{i = 1}^{n} Ψ (h_{i} ({\tilde{T}}_{i}; β); β) \\ \times {Z_{i} ({\tilde{T}}_{i}) - \frac{S^{(1)} (h_{i} ({\tilde{T}}_{i}; β); β)}{S^{(0)} (h_{i} ({\tilde{T}}_{i}; β); β)}} \\ \times {N_{i} (\hat{q} (τ_{U}; β); β) - N_{i} (\hat{q} (τ_{L}; β); β)} . \end{matrix}$ (2.4)

Here, $\hat{q} (τ_{L}; β)$ reduces to 0 if $τ_{L} = 0$ and $\hat{q} (τ_{U}; β)$ reduces to $\infty$ if $τ_{U} = 1$ . Observe that if $h_{i} ({\tilde{T}}_{i}; β) > h_{j} ({\tilde{T}}_{j}; β)$ , then $Y_{j} (h_{i} ({\tilde{T}}_{i}; β); β) = 0$ ; otherwise if $h_{i} ({\tilde{T}}_{i}; β) \leq h_{j} ({\tilde{T}}_{j}; β)$ , then we have $h_{j}^{- 1} (h_{i} ({\tilde{T}}_{i}; β); β) \leq {\tilde{T}}_{j}$ . Therefore, both $S^{(1)} (h_{i} ({\tilde{T}}_{i}; β); β)$ and $S^{(0)} (h_{i} ({\tilde{T}}_{i}; β); β)$ are observable for any $β$ given the data, regardless of the censoring status of the subject. As a result, despite we condition on the entire covariate process ${\bar{Z}}_{i}$ in the derivation, the estimating (2.4) remains computable provided that only ${\bar{Z}}_{i} ({\tilde{T}}_{i})$ is known, that is, the covariate value up to the observed survival time instead of the entire process. We emphasize that, as in the accelerated failure time model and the quantile regression model for time-dependent covariates such as Lin and Ying (Citation1995) and Gorfine, Goldberg, and Ritov (Citation2017), although internal covariates could possibly be fit into (2.4), one has to be cautious in interpreting the model parameters as opposed to external covariates; see chap. 6 of Kalbfleisch and Prentice (Citation2002).

Given that $U_{n} (β; τ_{L}, τ_{U})$ is neither continuous nor monotone in general, we solve its root by minimizing the L₁ norm. In particular, we adopt the differential evolution algorithm of Rainer Storn for root solving. The corresponding MATLAB code devec3.m is available in the author’s website at http://www.icsi.berkeley.edu/∼storn/code.html. According to our numerical experience, such stochastic algorithm is generally more robust to local minima than, for example, the deterministic simplex search method. Nevertheless, when the covariate is high-dimensional, the computational burden will increase substantially due to the curse of dimensionality. It is interesting for explore the corresponding variable selection issue for this problem, as in, for example, Zheng, Peng, and He (Citation2018) in the future.

To shed light on the connection between our formulation with existing estimators, one could view $Ψ (h_{i} ({\tilde{T}}_{i}; β); β)$ ${N_{i} (\hat{q} (τ_{U}; β); β) - N_{i} (\hat{q} (τ_{L}; β); β)}$ as a single subject specific weight, then $U_{n} (β; τ_{L}, τ_{U})$ falls into the class of weighted log rank statistics for time-dependent covariate setting, with $Ψ (h_{i} ({\tilde{T}}_{i}; β); β) {N_{i} (\hat{q} (τ_{U}; β); β) - N_{i} (\hat{q} (τ_{L}; β); β)}$ as the modified weight. Indeed, when $Ψ (h_{i} ({\tilde{T}}_{i}; β); β) = 1$ , $τ_{L} = 0$ and $τ_{U} = 1$ , the expression reduces to Δ_i and thus (2.4) recovers the estimating equation of Lin and Ying (Citation1995).

Typical weight functions include $Ψ (t; β) = 1$ and $Ψ (t; β) = S^{(0)} (t; β)$ , which is generally known as Gehan weight. Due to the extra complication introduced by the quantile specific inference and the time-dependent covariates, the two choices do not allow for an easier numerical implementation as in Jin et al. (Citation2003). Based on our numerical experience as shown in Section 5, the performance using the two different weight functions does not differ considerably. Unless otherwise specified, we shall assume the estimating equation is unweighted when we adopt (2.4) in the numerical studies.

Before concluding this section, we stress here that despite model (2.1) resembles the formulation considered in Gorfine, Goldberg, and Ritov (Citation2017), they are fundamentally different in the following two senses. First, Gorfine, Goldberg, and Ritov (Citation2017) developed their inference procedure based on inverse probability weighting approach because the event time and the censoring time concerned are assumed to be independent. In particular, they made use of the observation that $P (h_{i} (T_{i}; β_{τ}) \leq 1 ∣ {\bar{Z}}_{i}) = τ$ to derive their estimating equation, namely $\frac{1}{n} \sum_{i = 1}^{n} \frac{Δ_{i} Z_{i} (0)}{\hat{G} ({\tilde{T}}_{i})} {I (h_{i} (T_{i}; β_{τ}) \leq 1) - τ} = 0,$ where $\hat{G} (\cdot)$ denotes the Kaplan-Meier estimate of the survival function of the censoring distribution. On the contrary, in this article, we relax this assumption and adopt the martingale formulation as in (2.2) through which we establish our inference procedure. Second, in Gorfine, Goldberg, and Ritov (Citation2017), specific quantile parameters are estimated independently with no pooling of information across adjoining quantile levels. As we witness in our numerical studies, overlooking such homogeneity results in volatile estimates, see Example 4 in Section 5. This observation indeed motivates the development of our methodology.

Another important difference between our formulation and Gorfine, Goldberg, and Ritov (Citation2017) is the exclusion of intercept term in the covariates. Note that their intercept parameter represents the quantile of the error term, which is identifiable because of their additional model assumption that requires this quantity to be one; see Equationeq. (2.4)(2.4) $\begin{matrix} U_{n} (β; τ_{L}, τ_{U}) = \frac{1}{n} \sum_{i = 1}^{n} Ψ (h_{i} ({\tilde{T}}_{i}; β); β) \\ \times {Z_{i} ({\tilde{T}}_{i}) - \frac{S^{(1)} (h_{i} ({\tilde{T}}_{i}; β); β)}{S^{(0)} (h_{i} ({\tilde{T}}_{i}; β); β)}} \\ \times {N_{i} (\hat{q} (τ_{U}; β); β) - N_{i} (\hat{q} (τ_{L}; β); β)} . \end{matrix}$ (2.4) of Gorfine, Goldberg, and Ritov (Citation2017). In our setup, the distribution of the error term is left to be entirely unspecified without any assumption on its quantile. In other words, we combine the intercept and the error into one quantity. Hence, we skip the intercept estimate in our discussion as our main focus remains primarily on the covariate effects.

3 Asymptotic Results

In this section, we state the asymptotic results of the proposed estimator and the corresponding regularity conditions, while the proofs are deferred to the appendix. The required conditions are listed as follows:

Condition 1. $| Z_{i k} (0) | + \int_{0}^{\infty} | d Z_{i k} (t) | < \infty$ for each $i = 1, \dots, n$ and $k = 1, \dots, p$ .
Condition 2. $β_{0} \in B$ , where $B$ is a bounded convex region.
Condition 3. $P (\tilde{T} > L ∣ \bar{Z}) > 0$ almost surely for all $\bar{Z}$ , where $L = \sup_{β \in B} h^{- 1} (t; β)$ .
Condition 4. $\inf_{t \geq 0, β \in B} f (t; β) > 0$ , where $f (t; β)$ denotes the density function of the unconditional distribution of $h (T; β)$ .
Condition 5. $\sup_{β \in B} | Ψ (0; β) | + \int_{0}^{\infty} | d Ψ (t; β) | < \infty$ , and there exists a function $ψ (t; β)$ such that $\sup_{t \in [q (τ_{L}; β), q (τ_{U}; β)], β \in B}$ $| Ψ (t; β) - ψ (t; β) | \to 0$ almost surely.
Condition 6. $A_{n} (β_{0}; τ_{L}, τ_{U})$ is invertible, where $A_{n} (β; τ_{L}, τ_{U})$ is the gradient of ${\tilde{U}}_{n} (β; τ_{L}, τ_{U})$ with respect to $β$ , and ${\tilde{U}}_{n} (β; τ_{L}, τ_{U})$ is a smoothed analogy of $U_{n} (β; τ_{L}; τ_{U})$ as defined in the appendix.

Condition 1 is a mild smoothness requirement on the covariate processes. In particular, if $Z_{i k} (t)$ has a uniformly bounded derivative, then it is absolutely continuous and hence of bounded total variation. More generally, $Z_{i k} (t)$ is allowed to have jumps with sojourn times having uniformly bounded densities. Conditions 2, 3, and 4 are standard assumptions in survival analysis. In particular, using the idea of the functional delta method, see, for example, sec. 12 of Kosorok (Citation2008), Conditions 3 and 4 imply the uniform consistency of the Kaplan-Meier estimator $\hat{F} (t; β)$ and hence the quantile estimator $\hat{q} (τ; β)$ over $β \in B$ . Condition 5 imposes restriction on the weight function so as to guarantee that it is well behaved to achieve the consistency and weak convergence of the proposed estimator. This condition is satisfied for a broad class of weight functions including, but not limited to, the constant and Gehan weights discussed previously. Condition 6 is essential to ensure the uniqueness of $β_{0}$ and to recover the asymptotic normality of the estimator from that of the estimating equation.

With the above conditions, we now establish the consistency and weak convergence of the proposed estimator through the following theorems.

Theorem 1.

If Conditions 1–6 hold, then there exists a fixed neighborhood of $β_{0}$ , denoted by $N (β_{0})$ , such that for any $\hat{β} \in N (β_{0})$ , we have $‖ \hat{β} - β_{0} ‖ \to 0$ almost surely.

Theorem 2.

If Conditions 1–6 hold, then $n^{1 / 2} (\hat{β} - β_{0})$ converges weakly to a mean zero multivariate normal with covariance matrix $A {(β_{0}; τ_{L}, τ_{U})}^{- 1} V (β_{0}; τ_{L}, τ_{U}) A {(β_{0}; τ_{L}, τ_{U})}^{- 1}$ , where $A (β_{0}; τ_{L}, τ_{U})$ is the limit of $A_{n} (β_{0}; τ_{L}, τ_{U})$ and $V (β_{0}; τ_{L}, τ_{U})$ is defined in the appendix.

In the appendix, we prove Theorems 1 and 2 by approximating $U_{n} (β; τ_{L}; τ_{U})$ with its smoothed analogy ${\tilde{U}}_{n} (β; τ_{L}, τ_{U})$ and hence showing its local asymptotic linearity in $β$ . In the spirit of Ying (Citation1993) and Lin and Ying (Citation1995), the lack of monotonicity in rank-based estimating equations results in large sample properties in a local sense. Similar conclusion can also be seen in pointwise censored quantile regression models such as Peng and Huang (Citation2008). Global consistency might be justified under particular special cases, see, for example, Jin et al. (Citation2003) for a scenario where the accelerated failure time model with time-independent covariate and Gehan weight is considered.

4 Inference

4.1 Resampling Scheme

As shown in the proof of Theorem 2, the covariance matrix of the limiting distribution of $n^{1 / 2} (\hat{β} - β_{0})$ involves the unknown $Λ (t; β_{0})$ . To estimate the variances of the model parameters, we apply the resampling technique proposed in Jin, Ying, and Wei (Citation2001) by perturbing the minimand of the estimating equations. Accordingly, we let ${ζ_{1}, \dots, ζ_{n}}$ be iid nonnegative random variables with mean 1 and variance 1, such as exponential random variable with mean 1. We fix the data ${{\tilde{T}}_{i}, Δ_{i}, {\bar{Z}}_{i} ({\tilde{T}}_{i}), i = 1, \dots, n}$ and generate ${ζ_{1}, \dots, ζ_{n}}$ , then consider the following perturbation of $U_{n} (β; τ_{L}, τ_{U})$ , that is, $U_{n}^{*} (β; τ_{L}, τ_{U}) = \frac{1}{n} \sum_{i = 1}^{n} ζ_{i} Ψ (h_{i} ({\tilde{T}}_{i}; β); β)$ $\times {Z_{i} ({\tilde{T}}_{i}) - \frac{S^{(1)} (h_{i} ({\tilde{T}}_{i}; β); β)}{S^{(0)} (h_{i} ({\tilde{T}}_{i}; β); β)}} \times$ $\times {N_{i} (\hat{q} (τ_{U}; β); β) - N_{i} (\hat{q} (τ_{L}; β); β)} .$

Again we find the root of $U_{n}^{*} (β; τ_{L}, τ_{U})$ , denoted by $β^{*}$ , by minimizing its L₁ norm. Note that $\hat{q} (τ_{L}; β)$ and $\hat{q} (τ_{U}; β)$ are functions of $β$ which also depend on the perturbation as they are recomputed from the Kaplan-Meier estimator based on different values of the perturbed estimate $β^{*}$ . While keeping the data fixed, we repeat the above procedure B times, that is, for each $r = 1, \dots, B$ , generate a set of ${ζ_{1}, \dots, ζ_{n}}$ and obtain a corresponding realization of $β^{*}$ , denoted by $β_{r}^{*}$ . Now, we state the following theorem, which will be proven in the appendix by invoking similar arguments as presented in Jin, Ying, and Wei (Citation2001).

Theorem 3.

If Conditions 1–6 hold, then the conditional distribution of $n^{1 / 2} (β^{*} - \hat{β})$ given the observed data converges weakly to the same limiting distribution of $n^{1 / 2} (\hat{β} - β_{0})$ .

As a consequence of Theorem 3, we may use ${β_{r}^{*}, r = 1, \dots, B}$ to estimate the limiting distribution of the parameter estimates, that is, the variance of $\hat{β}$ can be approximated by the sample variance of ${β_{r}^{*}, r = 1, \dots, B}$ , and the confidence interval for $β_{0}$ can be constructed using the empirical quantile of ${β_{r}^{*}, r = 1, \dots, B}$ or by normal approximation.

4.2 Two-Stage Testing Procedure for Homogeneity

In this section, we present a procedure for testing the homogeneity across quantile regions. Based on the derivation of (2.4), the estimating equation remains valid as long as the homogeneity condition (2.1) holds over $[τ_{L}, τ_{U}]$ . In such cases, the covariate effects represented by the regression coefficients are identical over the range of quantiles in which (2.1) holds. Hence, one may be interested in assessing the subregions of quantile over $[0, 1]$ , or more generally, any two quantile regions, for which the covariate effects are constant. This is equivalent to testing whether the regression coefficients over the subregions are identical or not.

For a specific interval $[τ_{L}, τ_{U}] \subset [0, 1]$ , suppose we want to test the constancy of $β_{0}$ within the range. A straightforward approach is to divide the interval into two halves, namely $[τ_{L}, τ_{M}]$ and $[τ_{M}, τ_{U}]$ where $τ_{M} = (τ_{L} + τ_{U}) / 2$ , and test whether or not the corresponding parameters in the two nonoverlapping regions are equal. More precisely, let $β_{A}$ denote the model parameter when the homogeneity condition (2.1) is assumed over a set A, and ${\hat{β}}_{A}$ be the corresponding estimate using (2.4), it suffices to test if $β_{[τ_{L}, τ_{M}]} = β_{[τ_{M}, τ_{U}]}$ . One possible test is to reject the null hypothesis when the absolute deviation statistic $T = \sum_{k = 1}^{p} | {\hat{β}}_{[τ_{L}, τ_{M}]}^{(k)} - {\hat{β}}_{[τ_{M}, τ_{U}]}^{(k)} |$ is large. As a result of Theorem 3, the rejection cutoff can be determined by the empirical quantile of the resampling-based $T^{*} = \sum_{k = 1}^{p} | (β_{[τ_{L}, τ_{M}]}^{* (k)} - β_{[τ_{M}, τ_{U}]}^{* (k)}) - ({\hat{β}}_{[τ_{L}, τ_{M}]}^{(k)} - {\hat{β}}_{[τ_{M}, τ_{U}]}^{(k)}) |$ . Theoretically, the test works for any arbitrary interval provided the sample size is large enough. However, because of the practical concern on the sample size of our data analysis, we propose a two stage testing procedure below that would identify the constant regions with reasonable statistical powers.

In particular, if one concerns about the lower quantiles $[0, 0.25]$ of the failure time, she may question if the covariate effect is in fact homogeneous over $[0, 1]$ , and if not, whether it is same as that of $[0.25, 0.5]$ . Denote $A_{1} = [0, 0.25], A_{2} = [0.25, 0.5]$ , $A_{3} = [0.5, 0.75]$ and $A_{4} = [0.75, 1]$ , then the two questions translate to testing the two null hypotheses of $H_{0}^{(1)} : β_{A_{1}} = β_{A_{2}} = β_{A_{3}} = β_{A_{4}}$ and $H_{0}^{(2)} : β_{A_{1}} = β_{A_{2}}$ , respectively. To this end, we first test $H_{0}^{(1)}$ using the test statistic $T_{1} = \sum_{k = 1}^{p} \sum_{i \neq j} | {\hat{β}}_{A_{i}}^{(k)} - {\hat{β}}_{A_{j}}^{(k)} |$ , where the rejection cutoff is taken to be the empirical quantile of $T_{1}^{*} = \sum_{k = 1}^{p} \sum_{i \neq j} | (β_{A_{i}}^{* (k)} - β_{A_{j}}^{* (k)}) - ({\hat{β}}_{A_{i}}^{(k)} - {\hat{β}}_{A_{j}}^{(k)}) |$ . Suppose $H_{0}^{(1)}$ is not rejected, we conclude that the covariate effects are identical across the subregions of quantile. On the other hand, if the first hypothesis $H_{0}^{(1)}$ is rejected, we proceed to test the second hypothesis of $H_{0}^{(2)}$ . Similarly, we consider the test statistic $T_{2} = \sum_{k = 1}^{p} | {\hat{β}}_{A_{1}}^{(k)} - {\hat{β}}_{A_{2}}^{(k)} |$ and use the empirical quantiles of $T_{2}^{*} = \sum_{k = 1}^{p} | (β_{A_{1}}^{* (k)} - β_{A_{2}}^{* (k)}) - ({\hat{β}}_{A_{1}}^{(k)} - {\hat{β}}_{A_{2}}^{(k)}) |$ as cutoff. Rejection of $H_{0}^{(2)}$ means the covariate effects are different on A₁ and A₂.

The choices of A_i depend on the quantile range of interest. If the upper quantiles $[0.75, 1]$ are concerned, one may set $A_{1} = [0.75, 1], A_{2} = [0.5, 0.75], A_{3} = [0.25, 0.5]$ and $A_{4} = [0, 0.25]$ . Meanwhile, $A_{1} = [0.375, 0.625]$ , $A_{2} = [0.25, 0.375] \cup [0.625, 0.75], A_{3} = [0, 0.25]$ and $A_{4} = [0.75, 1]$ can be chosen if the median covariate effect is of interest. Despite the two-stage testing scheme, there is no need to adjust for the significance level for multiple hypotheses because of the nested structure of our hypotheses and procedure. Note that $H_{0}^{(1)} : β_{A_{1}} = β_{A_{2}} = β_{A_{3}} = β_{A_{4}}$ is a subset of $H_{0}^{(2)} : β_{A_{1}} = β_{A_{2}}$ . Also, as we would only continue to test $H_{0}^{(2)}$ when we reject $H_{0}^{(1)}$ , so rejecting $H_{0}^{(2)}$ is a subset of rejecting $H_{0}^{(1)}$ . As a result, the probability of rejecting at least one hypothesis when both are true, also known as the family error rate, that is, $P (Reject H_{0}^{(1)} \cup Reject H_{0}^{(2)} ∣ H_{0}^{(1)} \cap H_{0}^{(2)})$ reduces to $P (Reject H_{0}^{(1)} ∣ H_{0}^{(1)})$ . Recall that the family error rate is exactly the quantity that shall be controlled below the nominal significance level α to adjust for multiple tests (Simes Citation1986). Hence, we can set both $P (Reject H_{0}^{(1)} ∣ H_{0}^{(1)})$ and $P (Reject H_{0}^{(2)} ∣ H_{0}^{(2)})$ to be α, that is, the significance levels of both stages need not be modified, yet the Type I error of the procedure is controlled at α. Also, one may generalize the two-stage test by slicing $[0, 1]$ into finer sub-regions and consider a multi-stage test so as to find out the region for which homogeneity holds more precisely, provided reasonable power can be achieved with the given sample size. Alternatively, the test can also serve as an a priori assessment of the suitable quantile interval upon the availability of a pilot study. Users may employ this test to validate their initial choice of quantile intervals for the full scaled study.

5 Simulations

In the following simulation examples, we consider a covariate process that changes in values exactly d – 1 times at the time nodes ${W_{i 1}, \dots, W_{i, d - 1}}$ where $0 = W_{i 0} < W_{i 1} < \dots < W_{i, d - 1} < W_{i d} = \infty$ . As a result, we can write $h_{i} (t; β)$ as a piecewise linear function of t. Since $h_{i} (t; β)$ is monotone in t, its inverse function $h_{i}^{- 1} (t; β)$ is also piecewise linear in t. Note that by the equivariance property of the quantile function, model (2.1) can be written as $Q_{T_{i}} (u ∣ {\bar{Z}}_{i}) = h_{i}^{- 1} (q (u); β_{0})$ for $u \in [τ_{L}, τ_{U}]$ , which can be exploited to generate the desired lifetime with a designated conditional quantile function by the inverse transformation method.

In particular, let $Z_{i} (t) = {(Z_{i 1} (t), Z_{i 2} (t))}^{⊤}$ be a two-dimensional covariate process with d = 3 subintervals. Specifically, let $Z_{i 1} (t) = \sum_{s = 1}^{d} X_{i 1 s} I (W_{i, s - 1} \leq t < W_{i s}),$ $Z_{i 2} (t) = \sum_{s = 1}^{d} X_{i 2 s} I (W_{i, s - 1} \leq t < W_{i s}),$ where $X_{i 1 s}, X_{i 2 s} \sim Gamma (2, 1 / 2)$ independently for each $i = 1, \dots, n$ and $s = 1, \dots, d$ . The true regression coefficient is $β_{0} = {(1, 1)}^{⊤}$ and the common conditional quantile function $\log q (τ) = Q_{η} (τ)$ where $η \sim N (0, 1)$ . For $i = 1, \dots, n$ and $s = 1, \dots, d - 1$ , the time nodes are recovered recursively through $W_{i s} = W_{i, s - 1} + S_{i s}$ , where S_is (the sojourn times between the time nodes) are independently distributed as U(0, 1). Recall that the covariate processes ${Z_{i} (s), 0 \leq s \leq {\tilde{T}}_{i}}$ can only be observed up to the survival times, so we generate the sojourn time in a way that the simulated failure times ${T_{i}, i = 1, \dots, n}$ spread through the subintervals derived from ${W_{i 0}, \dots, W_{i d}}$ . Although we consider two time-dependent covariates in the simulation studies, any time-independent covariates can be easily incorporated by treating them as constant processes. One can also notice that the resulting conditional quantile function of the failure time remains monotone increasing for any realization of the covariate process.

Example 1:

Homogeneous Error

Given the above setup, the first numerical example demonstrates that the proposed method resembles the classic accelerated failure time model when the homogeneous conditional quantile assumption (2.1) is satisfied for $[τ_{L}, τ_{U}] = [0, 1]$ . To this end, we generate the censoring times ${C_{i}, i = 1, \dots, n}$ from iid exponential distribution with mean $\exp {c + 100 Z_{i 1} (0) + 100 Z_{i 2} (0)}$ so that the censoring times depend on the covariate processes, where we set c = – 150 or c = – 110 to achieve a censoring rate of 40% or 20%, respectively. With $β_{0} = {(1, 1)}^{⊤}$ and $\log q (τ) = Q_{η} (τ)$ , we have (2.1) holds for $[τ_{L}, τ_{U}] = [0, 1]$ , and thus for a subinterval $[τ_{L}, τ_{U}] = [0, 0.5]$ . We use (2.4) with $[τ_{L}, τ_{U}] = [0, 0.5]$ to obtain our parameter estimates $\hat{β}$ . We generate 500 Monte Carlo datasets with n = 200 or n = 400 sample points each, and adopt the aforementioned resampling procedure with B = 200 and ζ_i exponentially distributed with mean 1 to reconcile the asymptotic distribution of the parameter estimates. The numerical optimization is carried out via the differential evolution algorithm in MATLAB. The choice of differential evolution parameters follows the rules of thumb suggested by Storn (Citation1996). Further discussion on the parameter selection can also be found in Price, Storn, and Lampinen (Citation2006). We adopt existing quantile regression techniques with time-independent covariates, such as Wang and Wang’s (Citation2009) approach, to obtain preliminary initial estimates. The average value over time of the covariate process observed in the study is considered as a time-independent proxy to be used in their methods. Our numerical experience suggests that the monotonicity of Wang and Wang’s (Citation2009) estimating equation with respect to the parameter of interest serves as a reasonable guideline for initial value selection.

Remark 1.

It is noteworthy that differential evolution is not the only viable algorithm for the optimization problem concerned. Indeed, as suggested by an anonymous referee, we may update $\hat{β}$ and $\hat{q}$ in (2.4) in an alternate manner, that is, update $\hat{q}$ given the current $\hat{β}$ and solve for $\hat{β}$ by fixing $\hat{q}$ iteratively, until convergence. Simulation results obtained by this iterative method is presented in the appendix for reference. We observe that this method achieves similar numerical results as compared to the differential evolution algorithm, with slightly larger empirical biases under some circumstances. We trust that the iterative method can be a valuable and effective alternative especially for scenarios where each update of $\hat{β}$ involves a convex and smooth optimization problem. Unfortunately, this simplification may not bring in significant improvements for the problem discussed in this article or nonconvex optimization problems in general.

reports the simulation results for our proposal in terms of empirical bias (Bias), empirical standard deviation (SD), average standard error (SE) based on the resampling method, and coverage rate (CP) of the 95% confidence intervals constructed by normal approximation. The results obtained by our proposal using the different weight functions are quite similar, so the estimation is not sensitive with respect to the specific choice of weight function. The empirical performance is compared against Lin and Ying’s (Citation1995) and Gorfine, Goldberg, and Ritov’s (Citation2017) proposals. Under the current setting where the accelerated failure time model holds, the proposed method produces competitive results in empirical biases and standard deviations when compared to Lin and Ying (Citation1995). Moreover, the standard errors are closed to the empirical standard deviations and the coverage probabilities are close to the nominal level of 95%, validating the aforementioned resampling method. Our proposal loses some efficiency against Lin and Ying’s (Citation1995) method as reflected in slightly larger empirical standard deviations. This is reasonable because we only consider (2.4) over a subinterval $[τ_{L}, τ_{U}] = [0, 0.5]$ in which (2.1) holds to obtain $\hat{β}$ , while they consider the whole region $[τ_{L}, τ_{U}] = [0, 1]$ . On the other hand, despite Gorfine, Goldberg, and Ritov’s (Citation2017) method only requires (2.1) to hold for a single value of τ, their estimates demonstrate larger empirical biases and standard deviations, especially when the censoring rate is high. One reason is that their methodology only works for cases with independent censoring, which obviously does not hold in this example as the censoring distribution depends on the covariate processes.

Table 1 Simulation results with homogeneous error.

Display Table

We also employ the augmentation-based estimator discussed in Gorfine, Goldberg, and Ritov (Citation2017) for a more thorough comparison, which we denote as Augmented- Gorfine, Goldberg, and Ritov (Citation2017) in . The augmentation component exploits the information from the censored observations to yield a doubly-robust estimator. To extend its scope so that it can handle conditional independent censoring, we incorporate a kernel-based local Kaplan-Meier estimator (see, Wang and Wang Citation2009) for estimating the conditional censoring distribution given the covariates into the augmentation-based estimator. Equipped with a more refined estimate for the censoring distribution, the augmented estimator manages to outperform its vanilla counterpart in terms of empirical bias. However, as observed in Gorfine, Goldberg, and Ritov (Citation2017), the augmented version may not necessarily improve the empirical standard deviations. Our proposed method still maintains a reasonable edge, particularly on the empirical standard deviations. It is noteworthy that the local Kaplan-Meier estimator is kernel-based and requires the use of instrumental variables for the time-dependent covariates. However, the performance of the arbitrary choice of the instrumental variables is fairly sensitive to the censoring mechanism. In practice, as the underlying censoring mechanism can hardly be determined, this casts concerns in the actual implementation. Another drawback of the augmentation-based estimator as described by the authors is the long computing time compared to its vanilla version and our proposal. In our numerical experience with a E5-2650v4 CPU and 8GB memory, the augmented method requires about 21 min to run for a single quantile, against 1 sec of its vanilla counterpart and 20 sec of the proposed estimator.

Table 2 Simulation results with homogeneity assumed for $τ \leq 0.25.$

Display Table

Table 3 Simulation results with homogeneity assumed for $0.25 \leq τ \leq 0.75.$

Display Table

The composite quantile regression introduced in Zou and Yuan (Citation2008) also shares the pooling effect that our methodology provides, although the two ideas have different modeling motivations. We include their possible extension to censored data for comparison, which we refer to as Composite- Gorfine, Goldberg, and Ritov (Citation2017). The modification exploits and composites the most relevant existing quantile regression model for time-dependent covariates, that is, Gorfine, Goldberg, and Ritov’s (Citation2017) methodology, over a grid of quantiles. We use a grid size of 0.1 over the interested range of quantiles $[0, 0.5]$ for illustration here. While the composite feature could possibly improves the numerical performance of the ordinary quantile regression, it involves a higher dimensional optimization procedure, hence, the numerical problem turns out to be more challenging here. It is shown that the Gorfine, Goldberg, and Ritov’s (Citation2017) method shares similar empirical bias with Gorfine, Goldberg, and Ritov (Citation2017) while its standard deviations are slightly inferior, which are both not comparable to the proposed method. The main reason is that the estimates being composited requires an independent censoring assumption which does not hold in our examples.

Example 2:

Heterogeneity in Upper Tail Quantiles

The second example is more illustrative to show that our proposed method generalizes the accelerated failure time model proposed by Lin and Ying (Citation1995), in the sense that it relaxes homogeneous conditional quantile assumption (2.1) for all quantile levels. With the same covariate processes described above, we now generate the failure times based on $Q_{T_{i}} (τ ∣ {\bar{Z}}_{i}) = h_{i}^{- 1} (q (τ); β_{0} + I (τ > 0.5))$ . Here, $β_{0} + I (τ > 0.5)$ reduces to $β_{0}$ for $τ \leq 0.5$ , which implies that model (2.1) holds for $[τ_{L}, τ_{U}] = [0, 0.5]$ . But the expression becomes $β_{0} + 1$ , not the same constant $β_{0}$ , for $[τ_{L}, τ_{U}] = [0.5, 1]$ , therefore, model (2.1) is not valid over the entire $[τ_{L}, τ_{U}] = [0, 1]$ . Owing to the heterogeneity across the two subregions of quantiles, the accelerated failure time model assumed by Lin and Ying (Citation1995) does not hold here. Nevertheless, suppose we are only interested in the covariate effect on the lower quantiles $[0, 0.25]$ , then our proposal of using (2.4) with $[τ_{L}, τ_{U}] = [0, 0.25]$ to compute $\hat{β}$ will still be a justified methodology. Under the same simulation setting as in the first example, reports the simulation results in the same format as does. It is clear that the proposed estimator produces reasonable results as in the homogeneous setting described in the first example, because our model (2.1) holds for $[τ_{L}, τ_{U}] = [0, 0.25]$ here. Meanwhile, it is reasonable that Lin and Ying’s (Citation1995) method produces substantially larger empirical bias and standard deviation than it does in the first example, because of the violation of their model assumption. The performance of vanilla, augmented, and composite Gorfine, Goldberg, and Ritov’s (Citation2017) methods are worse than our proposal as in the first example, which is due to the same reason of the violation of the independent censoring assumption.

Example 3:

Heterogeneity in Both Lower and Upper Tail Quantiles

The third example considers failure times with quantile function $Q_{T_{i}} (τ ∣ {\bar{Z}}_{i}) = h_{i}^{- 1} (q (τ); β_{0} - Q_{η} (τ) I (τ < 0.25)$ $+ Q_{η} (τ) I (τ > 0.75)) .$

The second argument of the last display reduces to $β_{0}$ for $τ \in [0.25, 0.75]$ . Therefore, model (2.1) holds for $[τ_{L}, τ_{U}] = [0.25, 0.75]$ whereas such equivalence does not hold beyond that. Suppose we want to examine the covariate effect on the failure time around the median, we may consider a subinterval of quantiles around the median, say $[τ_{L}, τ_{U}] = 0.5 \pm 0.25 = [0.25, 0.75]$ for the estimating Equationequation (2.4)(2.4) $\begin{matrix} U_{n} (β; τ_{L}, τ_{U}) = \frac{1}{n} \sum_{i = 1}^{n} Ψ (h_{i} ({\tilde{T}}_{i}; β); β) \\ \times {Z_{i} ({\tilde{T}}_{i}) - \frac{S^{(1)} (h_{i} ({\tilde{T}}_{i}; β); β)}{S^{(0)} (h_{i} ({\tilde{T}}_{i}; β); β)}} \\ \times {N_{i} (\hat{q} (τ_{U}; β); β) - N_{i} (\hat{q} (τ_{L}; β); β)} . \end{matrix}$ (2.4) . As shown in , the empirical biases and standard deviations of our method can mostly demonstrate the advantage as compared to the competitors. Same as the previous scenario, Lin and Ying’s (Citation1995) method, Gorfine, Goldberg, and Ritov’s (Citation2017) vanilla proposal, augmented method, and composite modification yield significantly larger biases because of violation of the homogeneous conditional quantile assumption over all quantiles and the independent censoring assumption, respectively.

With a significance level of 5%, reports the empirical rejection rate of the two-stage test described in Section 4 for the previous three simulation examples with the unweighted estimating equation. We assume $A_{1} = [0, 0.25], A_{2} = [0.25, 0.5], A_{3} = [0.5, 0.75]$ and $A_{4} = [0.75, 1]$ for the first two examples; $A_{1} = [0.375, 0.625], A_{2} = [0.25, 0.375] \cup [0.625, 0.75], A_{3} = [0, 0.25]$ and $A_{4} = [0.75, 1]$ for the third example. It is shown that the test has well controlled empirical Type I errors with reasonable powers under the alternative hypothesis.

Table 4 Empirical rejection rate of the test for homogeneity.

Display Table

Example 4:

Independent Censoring

Despite the proposed method assumes a local homogeneity around the interested quantile level compared to the quantile regression method described by Gorfine, Goldberg, and Ritov (Citation2017), it effectively exploits more information than the latter to allow a more numerically stable estimation, provided such an assumption holds. Here, we present a simulation example to demonstrate that the stability in the proposed method offers a more reliable inference across a range of quantiles even under the independent censoring setup. We consider covariate processes as described in the previous examples, while the true regression coefficients are now $β_{0} = {(0.8, 0.8)}^{⊤}$ . The failure times are generated based on the conditional quantile function of $Q_{T_{i}} (τ ∣ {\bar{Z}}_{i}) = h_{i}^{- 1} (q (τ); β_{0})$ , while the censoring times are simulated from iid exponential distribution with mean (3.6) to yield a censoring rate of 20%. The failure times and the censoring times are then marginally independent in which case the condition assumed in Gorfine, Goldberg, and Ritov (Citation2017) holds. Their method is implemented for every quantile level from 0.25 to 0.75 with an interval of 0.05, while the proposed method is adopted with $[τ_{L}, τ_{U}] = [0.25, 0.75]$ for comparison.

shows that when the independent censoring assumption holds, Gorfine, Goldberg, and Ritov’s (Citation2017) proposal provides reasonable estimates over the quantile levels considered. Nevertheless, their empirical confidence intervals for both parameter estimates are volatile and fluctuate across neighboring quantiles. This phenomenon is not desirable as confusing conclusions may be drawn on whether or not a covariate is significant over the inter-quantile range concerned. As a hypothetical example, if the user is interested in the failure time distribution around the median, looking at $τ = 0.5$ would conclude that the first covariate is insignificant while the second is significant. But if one performs the analysis by checking the neighboring quantile $τ = 0.45$ or $τ = 0.55$ , a different conclusion will be drawn, which may be confusing. Here, we imagine analyzing the designated quantile level separately, so there is no multiple comparison involved. On the other hand, provided that the homogeneity assumption holds over $[τ_{L}, τ_{U}]$ , our proposal pools information over this range to produce a more numerically stable estimate, so that the conclusion can be more consistently drawn. We also study the performance of the Composite- Gorfine, Goldberg, and Ritov (Citation2017) variant discussed above for reference. This composite method produces estimates that are averages of the vanilla Gorfine, Goldberg, and Ritov’s (Citation2017) estimates. As compared to the proposed method, they have larger empirical biases and standard deviations.

Table 5 Simulation results with homogeneous error and independent censoring.

Display Table

6 Data Analysis

In this section, we demonstrate the proposed method with the Stanford heart transplant data. The study is described in details in Clark et al. (Citation1971) and Crowley and Hu (Citation1977); see also Aitkin, Laird, and Francis (Citation1983) for a brief literature review on the statistical analysis of the data. Lin and Ying (Citation1995) analyzed the effect of heart transplantation on the lifetime using the accelerated failure time model with time-dependent covariates. Their results suggest that younger patients with lower mismatch scores are more likely to benefit from transplantation, while it is less beneficial for older patients with higher mismatch scores to receive transplant. Here, in particular, we are interested in examining the effect of heart transplantation in a neighborhood of the median lifetime, and those in the lower quantile and the upper quantile.

Same as Lin and Ying (Citation1995), we consider a three-dimensional time-dependent covariate process in our analysis, namely transplant status, age at transplant, and mismatch score, where the last two covariates are assumed to be zero before transplant. Mathematically, let W_i denote the waiting time to transplant of the ith patient, that is, the duration from the date of acceptance into the program to the date of transplant. We take W_i to be $\infty$ if the ith patient never received transplantation. If we denote $Z_{i} (t) = {Z_{i 1} (t), Z_{i 2} (t), Z_{i 3} (t)}^{⊤}$ to be the three-dimensional covariate process of the ith patient, then $\begin{matrix} Z_{i 1} (t) & = I (t \geq W_{i}), \\ Z_{i 2} (t) & = (age at transplant - 35) I (t \geq W_{i}), \\ Z_{i 3} (t) & = (mismatch score - 0.5) I (t \geq W_{i}) . \end{matrix}$

Here, we centralize $Z_{i 2} (t)$ and $Z_{i 3} (t)$ so that the regression coefficient associated with $Z_{i 1} (t)$ can be interpreted as the effect of transplantation for a patient who is 35 years old at the time of transplant and has a mismatch score of 0.5.

Suppose we are interested in the covariate effects on the bottom quantiles, neighborhood of median, and the top quantiles. We may set $[τ_{L}, τ_{U}]$ to be $[0, 0.25], [0.375, 0.625]$ and $[0.5, 1]$ , respectively. Here, $[0.5, 1]$ is considered instead of $[0.75, 1]$ to avoid any possible identifiability issues on the extreme quantiles. reports the parameter estimates and their corresponding 95% confidence intervals. The 95% confidence intervals are constructed by normal approximation under the aforementioned resampling scheme with B = 500. Estimation results based on the accelerated failure time model, the Cox model, and the quantile model (at $τ = 0.25, 0.5, 0.75$ ), each under the framework of time-dependent covariates, are also provided as in Gorfine, Goldberg, and Ritov (Citation2017). With a significance level of 5%, both the accelerated failure time model and the Cox model conclude that the transplant status and the age at transplant are significant, whereas the mismatch score is not. Based on Gorfine, Goldberg, and Ritov’s (Citation2017) approach, the quantile regression models at 25th, 50th, and 75th percentiles reveal that the covariate effects are not significant with the exception of transplant status at 25th and 50th percentiles, which might be due to the higher variance associated with their estimator as shown in the previous simulation examples. While the composite variant of Gorfine, Goldberg, and Ritov (Citation2017) generally produces smaller standard errors than the vanilla version, its estimates for the bottom quantiles are relatively unstable. Possible causes can be the higher dimensional optimization problem involved in the composite method and the fact that Gorfine, Goldberg, and Ritov’s (Citation2017) procedure is intrinsically volatile across extreme tail quantiles in nature. On the other hand, the parameter estimates of the proposed method largely share consistent signs with those given by Lin and Ying (Citation1995) and the Cox model. The estimated coefficients for transplant status are larger on the middle and top quantiles than that on bottom quantile range, suggesting that a transplantation can be more beneficial to those patients who live longer than their counterparts with similar covariate background. Meanwhile, the age at transplant has a negative influence on the failure time over all three regions of quantiles. For the mismatch score, the estimated coefficients flip from positive on the bottom quantile range to negative on the higher quantile ranges. It can be interpreted as a transplant mismatch would be more unfavorable to the patients surviving longer after covariate adjustment, which again infers that a transplantation is more influential to that cohort. Such observations could be reasonable in nature, because it would take time for the transplantation to become effective. Comparing the estimated coefficients for mismatch score across the three regions, the difference can be an indication that the mismatch score having an insignificant overall effect over all quantile levels as estimated by Lin and Ying (Citation1995).

Table 6 Parameter estimates of the Stanford heart transplant data.

Display Table

We also implement the two-stage testing procedure described in Section 4 for assessing the quantile regions of homogeneous covariate effects under a 5% level of significance. With $[τ_{L}, τ_{U}] = [0, 0.25]$ , the empirical p-value of the first stage is 0.01, so rejecting $H_{0}^{(1)} : β_{A_{1}} = β_{A_{2}} = β_{A_{3}} = β_{A_{4}}$ suggests that the error is not homogeneous. Furthermore, the second stage produces an empirical p-value of 0.01, thus, rejecting $H_{0}^{(2)} : β_{A_{1}} = β_{A_{2}}$ implies the covariate effects over $[0, 0.25]$ and $[0.25, 0.5]$ are different. For $[τ_{L}, τ_{U}] = [0.375, 0.625]$ , the empirical p-values of the two stages are 0 and 0, respectively, that is, both $H_{0}^{(1)}$ and $H_{0}^{(2)}$ are rejected, hence, the homogeneous quantile region concluded would be $[0.375, 0.625]$ . Finally, for $[τ_{L}, τ_{U}] = [0.75, 1]$ , the empirical p-values are 0.01 and 0.224, so $H_{0}^{(2)}$ is not rejected in the second stage, that is, the covariate effects over $[0.75, 1]$ and $[0.5, 0.75]$ are not distinguishable. If further numerical study is to be carried out on a full scaled dataset, this result strengthens the idea of using $[0.5, 1]$ to access the covariate effects on the top quantiles.

7 Discussion

In view of this new perspective in survival analysis, further studies may also be of interest for generalizing the proposed methodology to accommodate different variations of failure time data in practice. Particularly, modification can be made without disrupting the framework of rank-based estimation procedure so as to handle length-biased data and case-cohort studies. On the other hand, following the exploration of quantile localized covariate effects, one may abandon the assumption of universal covariate effects but assume a hierarchical structure of the regression parameters instead. Being viewed as a hybrid between our setup and the censored quantile regression model, the current proposal can be generalized when the regression parameters are presumed to be with certain structure, especially provided prior knowledge on the covariate effects. Investigation of these new directions shall be handled in separate papers.

Supplementary Materials

The supplementary materials contain additional details about the asymptotic results and the simulation studies. Section A to C of the appendix present respectively the proofs of Theorems 1–3 discussed in Sections 3 and 4. Section D summarizes the numerical results for the alternative iterative algorithm discussed in Remark 1 of Section 5.

Supplemental material

Supplemental Material

Download Zip (272.8 KB)

Acknowledgments

The authors would like to thank the editor, the associate editor and the anonymous referees for their constructive comments and valuable suggestions with which the manuscript have been substantially improved.

Additional information

Funding

Sit’s work was partially supported by Hong Kong Research Grant Council RGC-14301618, RGC-14301920, and RGC-14307221.

References

Aitkin, M., Laird, N., and Francis, B. (1983), “A Reanalysis of the Stanford Heart Transplant Data,” Journal of the American Statistical Association, 78, 264–274. DOI: 10.1080/01621459.1983.10477959.
Web of Science ®Google Scholar
Andersen, P. K., and Gill, R. D. (1982), “Cox’s Regression Model for Counting Processes: A Large Sample Study,” The Annals of Statistics, 10, 1100–20. DOI: 10.1214/aos/1176345976.
Web of Science ®Google Scholar
Clark, D., Stinson, E., Griepp, R., Schroeder, J., Shumway, N., and Harrison, D. (1971), “Cardiac Transplantation in Man: VI. Prognosis of Patients Selected for Cardiac Transplantation,” Annals of Internal Medicine, 75, 15–21. DOI: 10.7326/0003-4819-75-1-15.
PubMed Web of Science ®Google Scholar
Cox, D. R. (1972), “Regression Models and Life-Tables,” Journal of the Royal Statistical Society, Series B, 34, 187–220. DOI: 10.1111/j.2517-6161.1972.tb00899.x.
Web of Science ®Google Scholar
Cox, D. R. (1975), “Partial Likelihood,” Biometrika, 62, 269–276.
Web of Science ®Google Scholar
Crowley, J., and Hu, M. (1977), “Covariance Analysis of Heart Transplant Survival Data,” Journal of the American Statistical Association, 72, 27–36. DOI: 10.1080/01621459.1977.10479903.
Web of Science ®Google Scholar
Fleming, T. R., and Harrington, D. P. (1991), Counting Processes and Survival Analysis, New York: Wiley.
Google Scholar
Gorfine, M., Goldberg, Y., and Ritov, Y. (2017), “A Quantile Regression Model for Failure-Time Data with Time-Dependent Covariates,” Biostatistics, 18, 132–146. DOI: 10.1093/biostatistics/kxw036.
PubMed Web of Science ®Google Scholar
Jin, Z., Lin, D. Y., Wei, L. J., and Ying, Z. (2003), “Rank-based Inference for the Accelerated Failure Time Model,” Biometrika, 90, 341–353. DOI: 10.1093/biomet/90.2.341.
Web of Science ®Google Scholar
Jin, Z., Ying, Z., and Wei, L. J. (2001), “A Simple Resampling Method by Perturbing the Minimand,” Biometrika, 88, 381–390. DOI: 10.1093/biomet/88.2.381.
Web of Science ®Google Scholar
Kai, B., Li, R., and Zou, H. (2010), “Local Composite Quantile Regression Smoothing: An Efficient and Safe Alternative to Local Polynomial Regression,” Journal of the Royal Statistical Society, Series B, 72, 49–69. DOI: 10.1111/j.1467-9868.2009.00725.x.
Google Scholar
Kalbfleisch, J., and Prentice, R. (2002), The Statistical Analysis of Failure Time Data (2nd ed.), New York: Wiley.
Google Scholar
Kosorok, M. (2008), Introduction to Empirical Processes and Semiparametric Inference, New York: Springer-Verlag.
Google Scholar
Lin, D., and Ying, Z. (1995), “Semiparametric Inference for the Accelerated Life Model with Time-Dependent Covariates,” Journal of Statistical Planning and Inference, 44, 47–63. DOI: 10.1016/0378-3758(94)00039-X.
Web of Science ®Google Scholar
Martinussen, T., and Scheike, T. H. (2006), Dynamic Regression Models for Survival Data, New York: Springer.
Google Scholar
Peng, L., and Huang, Y. (2008), “Survival Analysis with Quantile Regression Models,” Journal of the American Statistical Association, 103, 637–49. DOI: 10.1198/016214508000000355.
Web of Science ®Google Scholar
Portnoy, S. (2003), “Censored Regression Quantiles,” Journal of the American Statistical Association, 98, 1001–1012. DOI: 10.1198/016214503000000954.
Web of Science ®Google Scholar
Powell, J. L. (1984), “Least Absolute Deviations Estimation for the Censored Regression Model,” Journal of Econometrics, 25, 303–325. DOI: 10.1016/0304-4076(84)90004-6.
Web of Science ®Google Scholar
Powell, J. L. (1986), “Censored Regression Quantiles,” Journal of Econometrics, 32, 143–155.
Web of Science ®Google Scholar
Price, K., Storn, R. M., and Lampinen, J. A. (2006), Differential Evolution: A Practical Approach to Global Optimization, Berlin: Springer.
Google Scholar
Robins, J., and Tsiatis, A. (1992), “Semiparametric Estimation of an Accelerated Failure Time Model with Time-Dependent Covariates,” Biometrika, 79, 311–319. DOI: 10.2307/2336842.
Web of Science ®Google Scholar
Simes, R. J. (1986), “An Improved Bonferroni Procedure for Multiple Tests of Significance,” Biometrika, 73, 751–754. DOI: 10.1093/biomet/73.3.751.
Web of Science ®Google Scholar
Storn, R. (1996), “On the Usage of Differential Evolution for Function Optimization,” in Proceedings of North American Fuzzy Information Processing, IEEE, pp. 519–523. DOI: 10.1109/NAFIPS.1996.534789.
Google Scholar
Tseng, Y.-K., Hsieh, F., and Wang, J.-L. (2005), “Joint Modelling of Accelerated Failure Time and Longitudinal Data,” Biometrika, 92, 587–603. DOI: 10.1093/biomet/92.3.587.
Web of Science ®Google Scholar
Wang, H. J., and Wang, L. (2009), “Locally Weighted Censored Quantile Regression,” Journal of the American Statistical Association, 104, 1117–1128. DOI: 10.1198/jasa.2009.tm08230.
Web of Science ®Google Scholar
Ying, Z. (1993), “A Large Sample Study of Rank Estimation for Censored Regression Data,” The Annals of Statistics, 21, 76–99. DOI: 10.1214/aos/1176349016.
Web of Science ®Google Scholar
Ying, Z., Jung, S. H., and Wei, L. J. (1995), “Survival Analysis with Median Regression Models,” Journal of the American Statistical Association, 90, 178–84. DOI: 10.1080/01621459.1995.10476500.
Web of Science ®Google Scholar
Zeng, D., and Lin, D. Y. (2007), “Efficient Estimation for the Accelerated Failure Time Model,” Journal of the American Statistical Association, 102, 1387–1396. DOI: 10.1198/016214507000001085.
Web of Science ®Google Scholar
Zheng, Q., Peng, L., and He, X. (2015), “Globally Adaptive Quantile Regression with Ultra-High Dimensional Data,” The Annals of Statistics, 43, 2225–2258. DOI: 10.1214/15-AOS1340.
PubMed Web of Science ®Google Scholar
Zheng, Q., Peng, L., and He, X. (2018), “High Dimensional Censored Quantile Regression,” The Annals of Statistics, 46, 308–343. DOI: 10.1214/17-AOS1551.
PubMed Web of Science ®Google Scholar
Zou, H., and Yuan, M. (2008), “Composite Quantile Regression and the Oracle Model Selection Theory,” The Annals of Statistics, 36, 1108–1126. DOI: 10.1214/07-AOS507.
Web of Science ®Google Scholar

Censored Interquantile Regression Model with Time-Dependent Covariates

Abstract

1 Introduction

2 Methodology

3 Asymptotic Results

4 Inference

4.1 Resampling Scheme

4.2 Two-Stage Testing Procedure for Homogeneity

5 Simulations

Table 1 Simulation results with homogeneous error.

Table 2 Simulation results with homogeneity assumed for $τ \leq 0.25.$

Table 3 Simulation results with homogeneity assumed for $0.25 \leq τ \leq 0.75.$

Table 4 Empirical rejection rate of the test for homogeneity.

Table 5 Simulation results with homogeneous error and independent censoring.

6 Data Analysis

Table 6 Parameter estimates of the Stanford heart transplant data.

7 Discussion

Supplementary Materials

Supplemental Material

Acknowledgments

References

Information for

Open access

Opportunities

Help and information

Censored Interquantile Regression Model with Time-Dependent Covariates

Abstract

1 Introduction

2 Methodology

3 Asymptotic Results

4 Inference

4.1 Resampling Scheme

4.2 Two-Stage Testing Procedure for Homogeneity

5 Simulations

Table 1 Simulation results with homogeneous error.

Table 2 Simulation results with homogeneity assumed for τ≤0.25.

Table 3 Simulation results with homogeneity assumed for 0.25≤τ≤0.75.

Table 4 Empirical rejection rate of the test for homogeneity.

Table 5 Simulation results with homogeneous error and independent censoring.

6 Data Analysis

Table 6 Parameter estimates of the Stanford heart transplant data.

7 Discussion

Supplementary Materials

Supplemental Material

Acknowledgments

Additional information

Funding

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 2 Simulation results with homogeneity assumed for $τ \leq 0.25.$

Table 3 Simulation results with homogeneity assumed for $0.25 \leq τ \leq 0.75.$