Search in:

Statistical Theory and Related Fields Volume 5, 2021 - Issue 3

Submit an article Journal homepage

Free access

321

Views

CrossRef citations to date

Altmetric

Listen

Short Communications

Rejoinder on ‘Inference after covariate-adaptive randomization: aspects of methodology and theory’

Jun Shaoa KLATASDS-MOE, School of Statistics, East China Normal University, Shanghai, People's Republic of China;b Department of Statistics, University of Wisconsin-Madison, Madison, WI, USACorrespondence[email protected]
View further author information

Pages 196-199 | Received 11 Mar 2021, Published online: 24 Mar 2021

Cite this article
https://doi.org/10.1080/24754269.2021.1905357
CrossMark

In this article

1. The discussion by Drs. Ma, Zhang and Hu
2. The discussion by Drs. Wang, Susukida, Mojtabai, Amin-Esmaeili and Rosenblum
3. The discussion by Dr. Liu
4. The discussion by Drs. Ye and Yi
Disclosure statement
Additional information
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

I would like to thank all discussants for their insightful discussions on the topic of statistical inference after covariate-adaptive randomisation, especially for including reviews of some new results and references that are not in my review written more than a year ago. I hope these discussions together with my review will stimulate further studies in this important area having many applications particularly in clinical trials.

My rejoinder focuses on some main points from four separate groups of discussants.

1. The discussion by Drs. Ma, Zhang and Hu

Drs. Ma, Zhang, and Hu's discussion brings some new results or interesting directions for new studies. These include, but are not limited to, robust inference with the randomisation scheme considered in Hu and Hu (Citation2012), inference after covariate-adaptive randomisation with covariate misclassification, unobserved covariates, missing data or non-compliance, and high dimensional covariates. Covariate-adaptive randomisation can also be combined with other adaptive designs such as sequential monitoring (Zhu & Hu, Citation2019), sample size re-estimation (CitationLi et al., Citationin press), and seamless phase II/III clinical trials (CitationMa et al., Citationin press).

As I discussed in Section 7 of my review, the theoretical study of covariate-adaptive randomisation schemes such as Pocock and Simon's minimisation is not completed and still important, although some asymptotically valid inference procedures after these randomisation schemes have been derived. Let $Z$ be the discrete covariate used in covariate-adaptive randomisation, $z_{1}, \dots, z_{c}$ be all possible categories of $Z$ , $π_{1}, \dots, π_{k}$ be the target assignment proportions in the trial, n be the total sample size of the trial, $n_{a} (l)$ be the number of patients in treatment a with $Z = z_{l}$ , $a = 1, \dots, k$ , $l = 1, \dots, c$ , and $n (l) = n_{1} (l) + \dots + n_{k} (l)$ . A key result for studying the asymptotic validity of inference procedures after covariate-adaptive randomisation is

(R1)

$\sqrt{n} (\frac{n_{a} (l)}{n (l)} - π_{a}, a = 1, \dots, k, l = 1, \dots, c) | Z_{1}, \dots, Z_{n} \to N (0, D) in distribution,$

i.e. conditioned on n observed values of $Z$ , $Z_{1}, \dots, Z_{n}$ , the $k \times c$ dimensional vector whose $(a, l)$ th component is $\frac{n_{a} (l)}{n (l)} - π_{a}$ converges in distribution to a multivariate normal with mean 0 and some covariance matrix D. Result (R1) holds for type 1 or 2 covariate-adaptive randomisation schemes as described in Section 3 of my review. For Pocock and Simon's minimisation, however, it has not been rigorously shown in general whether or not (R1) holds, although (Ma et al., Citation2015) showed (R1) under restrictive conditions (a correct linear model between the response and $Z$ whose components are independent). Hu and Zhang (Citation2020) derived the asymptotic normality for $\frac{n_{a} (l)}{n (l)} - π_{a}, a = 1, \dots, k$ with a single fixed l, but (R1) requires joint asymptotic normality of the entire vector over all $l = 1, \dots, c$ .

Without (R1), asymptotic validity of inference procedures for most covariate-adaptive randomisation schemes including Pocock and Simon's minimisation can still be established under some conditions, which will be further explained in Section 4.

To construct asymptotic valid inference procedures, sometimes the explicit form of the covariance matrix D in (R1) is required.

I am less excited about balancing continuous covariates with covariate-adaptive randomisation. The reason is that covariate-adaptive randomisation is mainly use to enhance the credibility of the results of the trial (EMA, Citation2015) in terms of the balancedness of treatments across levels of some common discrete baseline prognostic factors such as institution, disease stage, prior treatment, gender, and age group. In fact, balancedness of marginal levels of these discrete prognostic factors is the main concern of agencies such as the EMA or FDA, and that is why Pocock and Simon's minimisation is popular. I do not see a clear motivation of balancing a continuous baseline covariate at the design stage. If efficiency is the concern, we may simply adjust for this continuous covariate in the inference procedure. It is much easier to construct a valid and efficient inference procedure, compared with to derive a valid inference procedure after balancing a continuous baseline covariate. So far, valid inference procedures after balancing a continuous baseline covariate are mostly model-based.

2. The discussion by Drs. Wang, Susukida, Mojtabai, Amin-Esmaeili and Rosenblum

My review mostly focuses on differences of sample means (or quantiles) and tests in survival analysis. The discussion by Drs. Wang, Susukida, Mojtabai, Amin-Esmaeili, and Rosenblum and their article (Wang et al., Citation2020) open up a wide range of robust inference methods to handle nonlinearity, various outcome types, repeated measures, missing outcomes, etc. Wang et al. (Citation2020) also contain three examples of analyses of trial data, illustrating that the gain due to stratified permuted block randomisation and covariate adjustment could be as high as 36%.

In Section 5.3 of my review, empirical distribution estimators and related quantile estimators valid under covariate-adaptive randomisation are considered. Wang et al. (Citation2020) established the asymptotic normality of Kaplan–Meier estimator under stratified permuted block or biased coin randomisation, which is important for survival analysis. Specifically, they showed that, in a trial with two arms (k = 2), ${\sqrt{n} ({\hat{S}}_{n}^{(a)} - S^{(a)}), t \in [0, τ]}$ , where ${\hat{S}}_{n}^{(a)}$ is the Kaplan–Meier estimator of the survival function $S^{(a)}$ , a is a fixed treatment, and $τ > 0$ is a fixed constant, converges weakly to a mean 0, tight Gaussian process with a covariance function $V^{(a)} (t, t^{'})$ explicitly given in the Supplementary Material of Wang et al. (Citation2020). They also showed that $V^{(a)} (t, t) = {\tilde{V}}^{(a)} (t, t) - \frac{U (t)}{π_{1} π_{2}},$ where ${\tilde{V}}^{(a)} (t, t^{'})$ is the covariance function under simple randomisation and $U (t) > 0$ under stratified permuted block or biased coin randomisation. Again, we see the common phenomenon of reducing variance by applying covariate-adaptive randomisation compared with simple randomisation.

As commented by Wang et al. (Citation2020), the result can be extended to the estimation of survival function with adjusted baseline covariates. Alternatively, one may consider a stratified version of Kaplan–Meier estimator along with the idea in formula (9) of my review.

3. The discussion by Dr. Liu

Dr. Liu's discussion provides useful details and references about asymptotic validity and efficiency of model-assisted inference procedures after covariate-adaptive randomisation and adjustment for covariates. The discussion about the efficiency gain in using ANOVA versus ANCOVA or ANCOVA with treatment-by-covariate interactions started as early as Yang and Tsiatis (Citation2001), continued later by Freedman (Citation2008), Lin (Citation2013) and Wang et al. (Citation2019), and studied under covariate-adaptive randomisation recently by Bugni et al. (Citation2018), Bugni et al. (Citation2019), Liu and Yang (Citation2020), Ma et al. (Citation2020b), Wang et al. (Citation2020) and CitationYe et al. (Citationin press).

I would like to emphasise two points here. The first one is, as pointed out by Dr. Liu, when there are only two treatment arms and equal allocation is used (k = 2 and $π_{1} = π_{2} = 1 / 2$ ), the use of ANCOVA with or without treatment-by-covariate interaction has the same asymptotic efficiency and is guaranteed to be more efficient than the use of ANOVA without adjusting for covariates. However, this phenomenon no longer exists once there are more than two treatment arms even if equal allocation is applied.

The second point is that, asymptotically, the most efficient estimator of the treatment difference $θ = E (Y^{(a)} - Y^{(b)})$ defined in the beginning of Section 5.2 of my review is the ${\hat{θ}}_{A}$ defined in Section 6.1 of my review, which adjusts for covariates by using a working linear model and the ordinary least squares estimator $\begin{aligned} {\hat{β}}_{a} (z) & = {[\sum_{i \in L_{a} (z)} {U_{i} - {\bar{U}}_{a} (z)} {U_{i} - {\bar{U}}_{a} (z)}^{T}]}^{- 1} \\ \times \sum_{i \in L_{a} (z)} {U_{i} - {\bar{U}}_{a} (z)} Y_{i} \end{aligned}$ of covariate effect within each stratum $L_{a} (z)$ under treatment a and $Z = z$ . As Dr. Liu pointed out, however, when there are small strata formed by levels of $Z$ , the stratum-specific least squares estimator ${\hat{β}}_{a} (z)$ might lead to inferior performance due to over-fitting (Liu & Yang, Citation2020). One modification is to combine ${\hat{β}}_{a} (z)$ and ${\hat{β}}_{b} (z)$ within each stratum level z, although they may estimate different quantities. Alternatively, utilising the fact that baseline covariates $U_{i}$ 's have the same distribution over all treatment arms, CitationYe et al. (Citationin press) recommended to replace the matrix inverse in ${\hat{β}}_{a} (z)$ by an average over all treatment arms to remedy the issue of small strata, which leads to replace ${\hat{β}}_{a} (z)$ by $\begin{aligned} {\tilde{β}}_{a} (z) \\ = {[\frac{1}{n (z)} \sum_{a = 1}^{k} \sum_{i \in L_{a} (z)} {U_{i} - {\bar{U}}_{a} (z)} {U_{i} - {\bar{U}}_{a} (z)}^{T}]}^{- 1} \\ \times \frac{1}{n_{a} (z)} \sum_{i \in L_{a} (z)} {U_{i} - {\bar{U}}_{a} (z)} Y_{i}, \end{aligned}$ where $n_{a} (z)$ is the number of units in $L_{a} (z)$ and $n (z) = n_{1} (z) + \dots + n_{k} (z)$ . Note that stability issues related with dimensionality for not very large data sets are mainly in the inverses of estimated covariance matrices. Hence, using the inverse of an average may largely remedy the issue of small strata. Some simulation results in CitationYe et al. (Citationin press) show that using ${\tilde{β}}_{a} (z)$ in the estimation of θ leads to better finite-sample performance compared with using ${\hat{β}}_{a} (z)$ or combining ${\hat{β}}_{a} (z)$ and ${\hat{β}}_{b} (z)$ when they actually estimate different quantities.

Finally, another way to handle many covariates is to apply high-dimensional technique as Dr. Liu commented (Ma et al., Citation2020a), or to use variable selection.

4. The discussion by Drs. Ye and Yi

In their discussion, Drs. Ye and Yi clearly described the working models behind estimators ${\hat{θ}}_{S}$ , ${\hat{θ}}_{A}$ and ${\hat{θ}}_{B}$ in Sections 5.2 and 6.1 of my review. This not only provides explanations about the asymptotic relative efficiencies among ${\hat{θ}}_{S}$ , ${\hat{θ}}_{A}$ and ${\hat{θ}}_{B}$ , but also leads to a general working model (formula (1) in the discussion) that produces a class of model-assisted estimators of θ (formula (2) in the discussion) including ${\hat{θ}}_{S}$ , ${\hat{θ}}_{A}$ and ${\hat{θ}}_{B}$ as special cases.

In the beginning of Section 6.1 of my review, $X$ is considered to be the vector of all available baseline covariates, $Z$ is the discrete baseline covariate vector (part of $X$ ) used in covariate-adaptive randomisation, and $U$ is the vector of covariates not in $Z$ but in $X$ to be adjusted for efficiency in the analysis stage. I would like to point out that $U$ may contain some components which are interactions between $Z$ and covariates not in $Z$ . Drs. Ye and Yi's discussion classifies $(Z, U)$ into two categories or vectors, $W$ and $V$ , where $W$ contains covariates having treatment-by-covariate interaction in working model (1) in their discussion and $V$ has no treatment-by-covariate interaction. Note that either $W$ or $V$ could be empty. For example, for ANOVA without using any covariate, both $W$ and $V$ are empty; for classical ANCOVA without considering any treatment-by-covariate interaction, $W$ is empty but $V$ is not; as discussed by Drs. Ye and Yi, ${\hat{θ}}_{S}$ in Section 5.2 of my review corresponds to $W = Z$ and empty $V$ , ${\hat{θ}}_{A}$ in Section 6.1 corresponds to $W = (Z, U)$ and empty $V$ , and ${\hat{θ}}_{B}$ in Section 6.1 corresponds to $W = Z$ and $V = U$ .

In applications, a crucial question is, what is the minimum set of covariates to be included in $W$ or $V$ to ensure that the resulting model-assisted estimator of θ is asymptotically normal with mean θ and variance invariant to the covariate-adaptive randomisation schemes (including Pocock and Simon's minimisation)? As pointed out by Drs. Ye and Yi, a simple answer is that $W$ should contain the dummy variables for all joint levels of $Z$ , and there is no requirement on $V$ . In fact, $V$ is used to not let the dimension of $W$ become too high. Asymptotically, the estimator with $V$ being empty is most efficient, unless some components of $W$ are actually not related with the response. We must balance between adjusting for covariates and over-fitting, for which variable selection may be a useful solution.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Notes on contributors

Jun Shao

Dr. Jun Shao holds a PhD in statistics from the University of Wisconsin-Madison. He is a Professor of Statistics at the University of Wisconsin-Madison. His research interests include variable selection and inference with high dimensional data, sample surveys, and missing data problems.

References

Bugni, F. A., Canay, I. A., & Shaikh, A. M. (2018). Inference under covariate-adaptive randomization. Journal of the American Statistical Association, 113, 1784–1796. https://doi.org/https://doi.org/10.1080/01621459.2017.1375934
PubMed Web of Science ®Google Scholar
Bugni, F. A., Canay, I. A., & Shaikh, A. M. (2019). Inference under covariate-adaptive randomization with multiple treatments. Quantitative Economics, 10, 1747–1785. https://doi.org/https://doi.org/10.3982/QE1150
Web of Science ®Google Scholar
EMA (2015). Guideline on adjustment for baseline covariates in clinical trials. Committee for Medicinal Products for Human Use, European Medicines Agency.
Google Scholar
Freedman, D. A. (2008). On regression adjustments in experiments with several treatments. Annals of Applied Statistics, 2, 176–196. https://doi.org/https://doi.org/10.1214/07-AOAS143
Web of Science ®Google Scholar
Hu, F., & Zhang, L.-X. (2020). On the theory of covariate-adaptive designs. arXiv:2004.02994.
Google Scholar
Hu, Y., & Hu, F. (2012). Asymptotic properties of covariate-adaptive randomization. Annals of Statistics, 40, 1794–1815. https://doi.org/https://doi.org/10.1214/12-AOS983
Web of Science ®Google Scholar
Li, X., Ma, W., & Hu, F. (in press). Sample size re-estimation for covariate-adaptive randomized clinical trials. Statistics in Medicine, 28.
Google Scholar
Lin, W. (2013). Agnostic notes on regression adjustments to experimental data: Reexamining freedman's critique. Annals of Applied Statistics, 7, 295–318. https://doi.org/https://doi.org/10.1214/12-AOAS583
Web of Science ®Google Scholar
Liu, H., & Yang, Y. (2020). Regression-adjusted average treatment effect estimators in stratified randomized experiments. Biometrika, 107, 935–948. https://doi.org/https://doi.org/10.1093/biomet/asaa038
Web of Science ®Google Scholar
Ma, W., Hu, F., & Zhang, L. (2015). Testing hypotheses of covariate-adaptive randomized clinical trials. Journal of the American Statistical Association, 110, 669–680. https://doi.org/https://doi.org/10.1080/01621459.2014.922469
Web of Science ®Google Scholar
Ma, W., Tu, F., & Liu, H. (2020a). A general theory of regression adjustment for covariate-adaptive randomization: Ols, lasso, and beyond. arXiv:2011.09734.
Google Scholar
Ma, W., Tu, F., & Liu, H. (2020b). Regression analysis for covariate-adaptive randomization: A robust and efficient inference perspective. arXiv:2009.02287.
Google Scholar
Ma, W., Wang, M., & Zhu, H. (in press). Regression analysis for covariate-adaptive randomization: A robust and efficient inference perspective. Statistica Sinica, 30.
Google Scholar
Wang, B., Ogburn, E., & Rosenblum, M. (2019). Analysis of covariance in randomized trials: More precision and valid confidence intervals, without model assumptions. Biometrics, 75, 1391–1400. https://doi.org/https://doi.org/10.1111/biom.v75.4
Web of Science ®Google Scholar
Wang, B., Susukida, R., Mojtabai, R., Amin-Esmaeili, M., & Rosenblum, M. (2020). Model-robust inference for clinical trials that improve precision by stratified randomization and adjustment for additional baseline variables. arXiv:1910.13954v3.
Google Scholar
Yang, L., & Tsiatis, A. (2001). Efficiency study of estimators for a treatment effect in a pretest–posttest trial. The American Statistician, 55, 314–321. https://doi.org/https://doi.org/10.1198/000313001753272466
Web of Science ®Google Scholar
Ye, T., Yi, Y., & Shao, J. (in press). Inference on average treatment effect under minimization and other covariate-adaptive randomization methods. Biometrika, 108.
Google Scholar
Zhu, H., & Hu, F. (2019). Sequential monitoring of covariate-adaptive randomized clinical trials. Statistica Sinica, 29, 265–282.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Rejoinder on ‘Inference after covariate-adaptive randomization: aspects of methodology and theory’

1. The discussion by Drs. Ma, Zhang and Hu

2. The discussion by Drs. Wang, Susukida, Mojtabai, Amin-Esmaeili and Rosenblum

3. The discussion by Dr. Liu

4. The discussion by Drs. Ye and Yi

Disclosure statement

Notes on contributors

Jun Shao

References

Information for

Open access

Opportunities

Help and information

Rejoinder on ‘Inference after covariate-adaptive randomization: aspects of methodology and theory’

1. The discussion by Drs. Ma, Zhang and Hu

2. The discussion by Drs. Wang, Susukida, Mojtabai, Amin-Esmaeili and Rosenblum

3. The discussion by Dr. Liu

4. The discussion by Drs. Ye and Yi

Disclosure statement

Additional information

Notes on contributors

Jun Shao

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date