Views

CrossRef citations to date

Altmetric

Special Issue on COVID-19

Machine learning for clinical trials in the era of COVID-19

William R. ZameDepartment of Economics and Mathematics, UCLA, Los Angeles, CA, USA;;

Ioana BicaUniversity of Oxford, Oxford, UK;; ;The Alan Turing Institute, London, UK;

Cong ShenDepartment of Electrical and Computer Engineering, University of Virginia, Charlottesville, VA, USA;;

Alicia CurthUniversity of Oxford, Oxford, UK;;

Hyun-Suk LeeDepartment of Applied Mathematics and Theoretical Physics, University of Cambridge, UK;;

Stuart BaileyNovartis Pharmaceuticals, Cambridge, MA, USA;

James WeatherallAstraZeneca, Cambridge, UK;;

David WrightAstraZeneca, Cambridge, UK;;

Frank BretzNovartis Pharma AG, Basel, Switzerland;; ;Section for Medical Statistics, Medical University of Vienna, Vienna, Austria;

Mihaela van der SchaarThe Alan Turing Institute, London, UK; ;Department of Applied Mathematics and Theoretical Physics, University of Cambridge, UK;; ;Department of Electrical and Computer Engineering, UCLA, Los Angeles, CA, USACorrespondence[email protected]

show all

Figures & data

Table 1. Summary and guide to the more detailed discussion in this paper.

Download CSV Display Table

Figure 1. The observed data contains information about patient characteristics $x$ , assigned treatments and observed (factual outcomes) outcomes. The observed outcomes for the control (blue) and treated (red) patients can be used to train machine learning methods to estimate the response surfaces $g_{0} (x) and g_{1} (x)$ for each treatment option. Using these response functions we can estimate individualized treatment effects and thus identify patients who would benefit most and patients who would benefit least most from receiving the treatment. This would not be possible if we only estimated the average treatment effect.

Figure 2. Given an observational dataset with patient features $X_{i}$ , assigned treatments $T_{i}$ and factual outcomes $Y_{i}$ jointly sampled from the distribution $P_{θ}$ , validation is needed (e.g. Alaa and van der Schaar 2019) to select the causal inference method, out of the large number available (e.g. Causal Forests (Athey and Imbens 2016), NSGP (Alaa and van der Schaar 2019), and GANITE (Yoon et al. 2018)) that will achieve the best estimate of the individualized treatment effects.

Figure 3. Distribution of treatment effects for subgroups identified by R2P and four benchmark methods using simulated data. The vertical axis is the estimated treatment effect; the horizontal axis indexes the subgroups identified by each method. R2P, CCT and CT-A each identify 5 subgroups, CT-H identifies 4 subgroups and CT-L identifies 3. (See the text for the description of the four benchmark methods.) Each box represents the range between the 25th and 75th percentiles of the treatment effects of the test samples; each whisker represents the range between the 5th and 95th percentiles.

Figure 4. Static vs online-learning-based clinical trial design.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Machine learning for clinical trials in the era of COVID-19

Table 1. Summary and guide to the more detailed discussion in this paper.

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Machine learning for clinical trials in the era of COVID-19

Figures & data

Table 1. Summary and guide to the more detailed discussion in this paper.

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date