323
Views
0
CrossRef citations to date
0
Altmetric
Short Communications

Comment on ‘Inference after covariate-adaptive randomisation: aspects of methodology and theory’

ORCID Icon
Pages 192-193 | Received 11 Mar 2021, Accepted 16 Mar 2021, Published online: 26 Mar 2021

We congratulate Professor Shao on his exciting and thought-provoking paper and appreciate the Editor's invitation to discuss it. This paper provided a comprehensive review of the methodology and theory for statistical inference under covariate-adaptive randomisation. Covariate-adaptive randomisation is widely used in the design stage of clinical trials to balance baseline covariates that are most relevant to the outcomes. Researchers often use linear regression or analysis of covariance (ANCOVA) to analyse the experimental results in the analysis stage. However, the validity of the resulting inferences is not crystal clear because the usual modelling assumptions might not be justified by covariate-adaptive randomisation. It is essential to develop a model-assisted methodology and theory for statistical inference under covariate-adaptive randomisation, allowing the working model to be arbitrarily misspecified. Professor Shao's paper discussed recent developments in this aspect and made recommendations on using valid and efficient inference procedures under covariate-adaptive randomisation.

As pointed out by Professor Shao, Ye, Yi, et al. (Citation2020) proposed a model-assisted regression approach and showed that the resulting regression-adjusted average treatment effect estimator is more efficient than (as least as efficient as) the difference-in-means estimator, without any modelling assumptions on the potential outcomes and covariates. In other words, the model-assisted inference is efficient and robust to model misspecification.

The efficiency gain and robustness of regression adjustment have been widely investigated under simple randomisation. When there are two treatment arms (treatment and control), Yang and Tsiatis (Citation2001) examined three commonly used regression models for estimating the average treatment effect: YiτAi,YiτAi+XiTβ,YiτAi+XiTβ+Ai(XiX¯)Tγ, where Yi is the observed outcome, Ai is the treatment assignment indicator, Xi is the vector of covariates, and X¯=(1/n)i=1nXi is the vector of covariate means. They showed that these three ordinary-least-squares (OLS) estimators of τ are consistent for the average treatment effect and the third OLS estimator is the most efficient. In fact, as shown by Wang, Ogburn, et al. (Citation2019), when P(Ai=1)=1/2 (equal allocation), the second OLS estimator is as efficient as the third one and the usual OLS variance estimator derived from the second regression is also consistent. Thus, we can use standard statistical software, such as lm in R, to construct confidence intervals or tests.

Recently, the efficiency and robustness of regression adjustment under covariate-adaptive randomisation have witnessed significant advances from different perspectives (Bugni et al., Citation2018Citation2019; Ma et al., Citation2020b; Wang, Susukida, et al., Citation2019; Ye & Shao, Citation2018; Ye, Shao, et al., Citation2020). Under stratified randomisation and a finite population framework, Liu and Yang (Citation2020) also proposed a regression-adjusted average treatment effect estimator and showed that it is more efficient than (as least as efficient as) the stratified difference-in-means estimator, τˆ=zn(z)n{Y¯1(z)Y¯0(z)}. In practice when there exist small strata, the stratum-specific regression-adjusted vectors βˆa(z) proposed by Ye, Yi, et al. (Citation2020) might lead to inferior performance due to overfitting (Liu & Yang, Citation2020). To solve this problem, two independent works Ma et al. (Citation2020b) (for two treatment arms) and Ye, Shao, et al. (Citation2020) (for multiple treatment arms) suggested to use the stratum-common regression-adjusted vectors: βˆa=argminβzizAi[YiY¯a(z){XiX¯a(z)}Tβ]2,a=0,1, where iz denotes sample in stratum z, and X¯a(z) is the sample mean of the covariates under treatment arm a within stratum z. The resulting regression-adjusted average treatment effect estimator is τˆols=zn(z)n([Y¯1(z){X¯1(z)X¯(z)}Tβˆ1][Y¯0(z){X¯0(z)X¯(z)}Tβˆ0]), where X¯(z) is the sample mean of the covariates within stratum z. Ma et al. (Citation2020b) and Ye, Shao, et al. (Citation2020) answered a question raised in Wang, Ogburn, et al. (Citation2019),

It is an open question, to the best of our knowledge, as to what happens when more variables than the stratification indicators are included in the ANCOVA model under such randomisation schemes, in terms of consistency of the ANCOVA estimator and how to compute its asymptotic variance under arbitrary model misspecification.

As shown by Ma et al. (Citation2020b) and Ye, Shao, et al. (Citation2020), τˆols is the OLS estimator of τ in the regression with treatment-by-covariates interactions, (1) YiτAi+UiTβ+Ai{UiU¯(z)}Tγ,(1) where Ui includes all stratum indicators and additional covariates Xi. From a practical point of view, researchers often use a simpler regression (often called ANCOVA): (2) YiτAi+UiTβ.(2) Ma et al. (Citation2020b) showed that, for equal allocation, i.e., P(Ai=1)=1/2, the OLS estimator derived from regression (Equation2) is as efficient as (asymptotically) that obtained from regression (Equation1). More importantly, the OLS variance estimator derived from regression (Equation2) is also consistent. These properties hold for almost all covariate-adaptive randomisation including minimisation. Therefore, it is valid and efficient to use ANCOVA under covariate-adaptive randomisation with two treatment arms and equal allocation, even if the linear model is misspecified. However, for clinical trials with more than two treatment arms, the equivalence of the point estimators obtained from regressions (Equation1) and (Equation2) does not hold anymore (Ye, Shao, et al., Citation2020). As shown by Ye, Shao, et al. (Citation2020), asymptotically, regression (Equation1) is more efficient than (at least as efficient as) regression (Equation2), and a nonparametric variance estimator should be used for valid inferences. Another contribution of Ye, Shao, et al. (Citation2020) is that they established the joint asymptotic normality of (Y¯1(z),,Y¯K(z)), where K is the number of treatment arms. The joint asymptotic normality is useful for testing more general treatment effects.

At the end of the discussion, we would like to point out that regression adjustment for covariate-adaptive randomisation has been extended to high-dimensional settings where the number of covariates is comparable to or even larger than the sample size, see Ma et al. (Citation2020a) for more details.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by National Natural Science Foundation of China [Grant Number 12071242].

Notes on contributors

Hanzhong Liu

Hanzhong Liu, Associate Professor, Center for Statistical Science, Department of Industrial Engineering, Tsinghua University, Beijing, China.

References

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.