References
- Bai, X., A. A. Tsiatis, and S. M. O'Brien. 2013. Doubly-robust estimators of treatment-specific survival distributions in observational studies with stratified sampling. Biometrics 69 (4):830–39. doi:https://doi.org/10.1111/biom.12076.
- Bai, X., A. A. Tsiatis, W. Lu, and R. Song. 2017. Optimal treatment regimes for survival endpoints using a locally-efficient doubly-robust estimator from a classification perspective. Lifetime Data Analysis 23 (4):585–604. doi:https://doi.org/10.1007/s10985-016-9376-x.
- Chakraborty, B., S. Murphy, and V. Strecher. 2010. Inference for non-regular parameters in optimal dynamic treatment regimes. Statistical Methods in Medical Research 19 (3):317–43. doi:https://doi.org/10.1177/0962280209105013.
- Goldberg, Y., and M. R. Kosorok. 2012. Q-learning with censored data. The Annals of Statistics 40 (1):529–60. doi:https://doi.org/10.1214/12-AOS968.
- Jiang, R., W. Lu, R. Song, and M. Davidian. 2016. On estimation of optimal treatment regimes for maximizing t-year survival probability. Journal of the Royal Statistical Society: Series B 88:381–90.
- Lu, W., H. Zhang, and D. Zeng. 2013. Variable selection for optimal treatment decision. Statistical Methods in Medical Research 22 (5):493–504. doi:https://doi.org/10.1177/0962280211428383.
- Kang, S., W. Lu, and J. Zhang. 2018. On estimation of the optimal treatment regime with additive hazards model. Statistica Sinica 28 (3):1539–60. doi:https://doi.org/10.5705/ss.202016.0543.
- Moodie, E., N. Dean, and Y. Sun. 2014. Q-learning: Flexible learning about useful utilities. Statistics in Biosciences 6 (2):223–43. doi:https://doi.org/10.1007/s12561-013-9103-z.
- Murphy, S. A. 2005a. An experimental design for the development of adaptive treatment strategies. Statistics in Medicine 24 (10):1455–81. doi:https://doi.org/10.1002/sim.2022.
- Murphy, S. A. 2005b. A generalization error for q-learning. Journal of Machine Learning Research: JMLR 6:1073–97.
- Robins, J., M. Hernan, and B. Brumback. 2000. Marginal structural models and causal inference in epidemiology. Epidemiology (Cambridge, Mass.) 11 (5):550–60. doi:https://doi.org/10.1097/00001648-200009000-00011.
- Robins, J., L. Orellana, and A. Rotnitzky. 2008. Estimation and extrapolation of optimal treatment and testing strategies. Statistics in Medicine 27 (23):4678–721. doi:https://doi.org/10.1002/sim.3301.
- Robins, J. M. 2004. Optimal structural nested models for optimal sequential decisions. In Proceedings of the second Seattle Symposium in Biostatistics. Lecture Notes in Statistics, 179, 189–326. New York: Springer.
- Watkins, C., and P. Dayan. 1992. Q-learning. Machine Learning 8 (3-4):279–92. doi:https://doi.org/10.1007/BF00992698.
- Zeng, D., and D. Y. Lin. 2008. Efficient resampling methods for nonsmooth estimating functions. Biostatistics 9 (2):355–63. doi:https://doi.org/10.1093/biostatistics/kxm034.
- Zhang, B., A. A. Tsiatis, M. Davidian, M. Zhang, and E. B. Laber. 2012a. Estimating optimal treatment regimes from a classification perspective. Stat 1 (1):103–14. doi:https://doi.org/10.1002/sta.411.
- Zhang, B., A. A. Tsiatis, E. B. Laber, and M. Davidian. 2012b. A robust method for estimating optimal treatment regimes. Biometrics 68 (4):1010–18. doi:https://doi.org/10.1111/j.1541-0420.2012.01763.x.
- Zhao, Y., D. Zeng, E. Laber, R. Song, M. Yuan, and M. Kosorok. 2015. Doubly robust learning for estimating individualized treatment with censored data. Biometrika 102 (1):151–68. doi:https://doi.org/10.1093/biomet/asu050.
- Zhao, Y., D. Zeng, A. J. Rush, and M. R. Kosorok. 2012. Estimating individualized treatment rules using outcome weighted learning. Journal of the American Statistical Association 107 (449):1106–18. doi:https://doi.org/10.1080/01621459.2012.695674.