1,459
Views
7
CrossRef citations to date
0
Altmetric
Theory and Methods

Proper Inference for Value Function in High-Dimensional Q-Learning for Dynamic Treatment Regimes

, &
Pages 1404-1417 | Received 01 Apr 2017, Accepted 01 Jun 2018, Published online: 29 Oct 2018
 

ABSTRACT

Dynamic treatment regimes are a set of decision rules and each treatment decision is tailored over time according to patients’ responses to previous treatments as well as covariate history. There is a growing interest in development of correct statistical inference for optimal dynamic treatment regimes to handle the challenges of nonregularity problems in the presence of nonrespondents who have zero-treatment effects, especially when the dimension of the tailoring variables is high. In this article, we propose a high-dimensional Q-learning (HQ-learning) to facilitate the inference of optimal values and parameters. The proposed method allows us to simultaneously estimate the optimal dynamic treatment regimes and select the important variables that truly contribute to the individual reward. At the same time, hard thresholding is introduced in the method to eliminate the effects of the nonrespondents. The asymptotic properties for the parameter estimators as well as the estimated optimal value function are then established by adjusting the bias due to thresholding. Both simulation studies and real data analysis demonstrate satisfactory performance for obtaining the proper inference for the value function for the optimal dynamic treatment regimes. Supplementary materials for this article are available online.

Supplementary Material

The online supplement contains additional simulation results, and the proofs for the theorems discussed in the article.

Acknowledgment

The authors thank the associate editor and two referees for their constructive comments that led to a significantly improved article.

Additional information

Funding

This work is supported in part by the National Natural Science Foundation of China (11771072 and 11371083), Foundation from China Scholarship Council (No. 201406625026), National Institute of Health and National Science Foundation (2R01NS073671-05A1, 1R01GM124104-01A1, NCI-P01-CA142538, NSF-DMS-1555244).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.