432
Views
0
CrossRef citations to date
0
Altmetric
Efficient Computing

A Simple Divide-and-Conquer-based Distributed Method for the Accelerated Failure Time Model

, , & ORCID Icon
Pages 681-698 | Received 30 Nov 2021, Accepted 02 Aug 2023, Published online: 09 Oct 2023

References

  • Battey, H., Fan, J., Liu, H., Lu, J., and Zhu, Z. (2015), “Distributed Estimation and Inference with Statistical Guarantees,” arXiv preprint arXiv:1509.05457.
  • ———(2018), “Distributed Testing and Estimation under Sparse High Dimensional Models,” Annals of Statistics, 46, 1352–1382.
  • Brown, B., and Wang, Y.-G. (2007), “Induced Smoothing for Rank Regression with Censored Survival Times,” Statistics in Medicine, 26, 828–836. DOI: 10.1002/sim.2576.
  • Cai, T., Liu, M., and Xia, Y. (2019), “Privacy Preserving Integrative Regression Analysis of High-Dimensional Heterogeneous Data,” arXiv preprint arXiv:1902.06115.
  • Chen, L., and Zhou, Y. (2019), “Quantile Regression in Big Data: A Divide and Conquer based Strategy,” Computational Statistics & Data Analysis, 144, 106892. DOI: 10.1016/j.csda.2019.106892.
  • Chen, S. X., and Peng, L. (2021), “Distributed Statistical Inference for Massive Data,” The Annals of Statistics, 49, 2851–2869. DOI: 10.1214/21-AOS2062.
  • Chen, X., Liu, W., and Zhang, Y. (2019), “Quantile Regression Under Memory Constraint,” The Annals of Statistics, 47, 3244–3273. DOI: 10.1214/18-AOS1777.
  • Chen, X., and Xie, M. (2014), “A Split-and-Conquer Approach for Analysis of Extraordinarily Large Data,” Statistica Sinica, 24, 1655–1684. DOI: 10.5705/ss.2013.088.
  • Christensen, R., and Johnson, W. (1988), “Modelling Accelerated Failure Time with a Dirichlet Process,” Biometrika, 75, 693–704. DOI: 10.1093/biomet/75.4.693.
  • Chung, M., Long, Q., and Johnson, B. A. (2013), “A Tutorial on Rank-based Coefficient Estimation for Censored Data in Small-and Large-Scale Problems,” Statistics & Computing, 23, 601–614. DOI: 10.1007/s11222-012-9333-9.
  • Cox, D. R. (1972), “Regression Models and Life-Tables,” Journal of the Royal Statistical Society, Series B, 34, 187–202. DOI: 10.1111/j.2517-6161.1972.tb00899.x.
  • Fan, J., Wang, D., Wang, K., and Zhu, Z. (2019), “Distributed Estimation of Principal Eigenspaces,” Annals of Statistics, 47, 3009–3031.
  • Fygenson, M., and Ritov, Y. (1994), “Monotone Estimating Equations for Censored Data,” Annals of Statistics, 22, 732–746.
  • Gao, Y., Liu, W., Wang, H., Wang, X., Yan, Y., and Zhang, R. (2022), “A Review of Distributed Statistical Inference,” Statistical Theory and Related Fields, 6, 89–99. DOI: 10.1080/24754269.2021.1974158.
  • Garber, D., Shamir, O., and Srebro, N. (2017), “Communication-Efficient Algorithms for Distributed Stochastic Principal Component Analysis,” in Proceedings of the 34th International Conference on Machine Learning, pp. 1203–1212.
  • Hajirahimova, M. (2017), “The Big Data Era in Healthcare: Promises and Challenges,” Problems in Information Technology, 8, 64–72. DOI: 10.25045/jpit.v08.i1.08.
  • Hanson, T., and Johnson, W. O. (2002), “Modeling Regression Error with a Mixture of Polya Trees,” Journal of the American Statistical Association, 97, 1020–1033. DOI: 10.1198/016214502388618843.
  • ———(2004), “A Bayesian Semiparametric aft Model for Interval-Censored Data,” Journal of Computational and Graphical Statistics, 13, 341–361.
  • Heller, G. (2007), “Smoothed Rank Regression with Censored Data,” Journal of the American Statistical Association, 102, 552–559. DOI: 10.1198/016214506000001257.
  • Huang, C., and Huo, X. (2015), “A Distributed One-Step Estimator,” Mathematical Programming, 174, 41–76. DOI: 10.1007/s10107-019-01369-0.
  • Huang, J., and Ma, S. (2010), “Variable Selection in the Accelerated Failure Time Model via the Bridge Method,” Lifetime Data Analysis, 16, 176–195. DOI: 10.1007/s10985-009-9144-2.
  • Huang, J., Ma, S., and Xie, H. (2006), “Regularized Estimation in the Accelerated Failure Time Model with High-Dimensional Covariates,” Biometrics, 62, 813–820. DOI: 10.1111/j.1541-0420.2006.00562.x.
  • ———(2007), “Least Absolute Deviations Estimation for the Accelerated Failure Time Model,” Statistica Sinica, pp. 1533–1548.
  • Jin, Z., Lin, D., Wei, L., and Ying, Z. (2003), “Rank-based Inference for the Accelerated Failure Time Model,” Biometrika, 90, 341–353. DOI: 10.1093/biomet/90.2.341.
  • Jin, Z., Lin, D., and Ying, Z. (2006), “On Least-Squares Regression with Censored Data,” Biometrika, 93, 147–161. DOI: 10.1093/biomet/93.1.147.
  • Johnson, L. M., and Strawderman, R. L. (2009), “Induced Smoothing for the Semiparametric Accelerated Failure Time Model: Asymptotics and Extensions to Clustered Data,” Biometrika, 96, 577–590. DOI: 10.1093/biomet/asp025.
  • Jordan, M. I., Lee, J. D., and Yang, Y. (2019), “Communication-Efficient Distributed Statistical Inference,” Journal of the American Statistical Association, 114, 668–681. DOI: 10.1080/01621459.2018.1429274.
  • Kalbfleisch, J. D., and Prentice, R. L. (2011), The Statistical Analysis of Failure Time Data, Hoboken, NJ: Wiley.
  • Komárek, A., Lesaffre, E., and Hilton, J. F. (2005), “Accelerated Failure Time Model for Arbitrarily Censored Data with Smoothed Error Distribution,” Journal of Computational and Graphical Statistics, 14, 726–745. DOI: 10.1198/106186005X63734.
  • Kuo, L., and Mallick, B. (1997), “Bayesian Semiparametric Inference for the Accelerated Failure-Time Model,” Canadian Journal of Statistics, 25, 457–472. DOI: 10.2307/3315341.
  • Lee, J. D., Liu, Q., Sun, Y., and Taylor, J. E. (2017), “Communication-Efficient Sparse Regression,” Journal of Machine Learning Research, 18, 115–144.
  • Li, J., and Jin, B. (2018), “Multi-Threshold Accelerated Failure Time Model,” The Annals of Statistics, 46, 2657–2682. DOI: 10.1214/17-AOS1632.
  • Lian, H., and Fan, Z. (2017), “Divide-and-Conquer for Debiased l1-norm Support Vector Machine in Ultra-High Dimensions,” Journal of Machine Learning Research, 18, 6691–6716.
  • Lin, N., and Xi, R. (2010), “Fast Surrogates of u-statistics,” Computational Statistics & Data Analysis, 54, 16–24. DOI: 10.1016/j.csda.2009.08.009.
  • Lu, C. L., Wang, S., Ji, Z., Wu, Y., Xiong, L., Jiang, X., and Ohno-Machado, L. (2015), “Webdisco: A Web Service for Distributed Cox Model Learning Without Patient-Level Data Sharing,” Journal of the American Medical Informatics Association, 22, 1212–1219. DOI: 10.1093/jamia/ocv083.
  • Pang, M., Platt, R. W., Schuster, T., and Abrahamowicz, M. (2021), “Spline-based Accelerated Failure Time Model,” Statistics in Medicine, 40 481–497. DOI: 10.1002/sim.8786.
  • Peng, L., and Fine, J. P. (2006), “Rank Estimation of Accelerated Lifetime Models with Dependent Censoring,” Journal of the American Statistical Association, 101, 1085–1093. DOI: 10.1198/016214506000000131.
  • Prentice, R. L. (1978), “Linear Rank Tests with Right Censored Data,” Biometrika, 65, 167–179. DOI: 10.1093/biomet/65.1.167.
  • Qiu, Z., Wan, A. T. K., Zhou, Y., and Gilbert, P. B. (2017), “Smoothed Rank Regression for the Accelerated Failure Time Competing Risks Model with Missing Cause of Failure,” Statistica Sinica, 29, 23–46.
  • Shi, C., Lu, W., and Song, R. (2018), “A Massive Data Framework for m-estimators with Cubic-Rate,” Journal of the American Statistical Association, 113, 1698–1709. DOI: 10.1080/01621459.2017.1360779.
  • Tang, L., Zhou, L., and Song, P. X. K. (2020), “Distributed Simultaneous Inference in Generalized Linear Models via Confidence Distribution,” Journal of Multivariate Analysis, 176, 104567. DOI: 10.1016/j.jmva.2019.104567.
  • Tsiatis, A. A. (1990), “Estimating Regression Parameters Using Linear Rank Tests for Censored Data,” Annals of Statistics, 18, 354–372.
  • van der Vaart, A. W. (1998), Asymptotic Statistics, New York: Cambridge University Press.
  • Volgushev, S., Chao, S. K., and Cheng, G. (2019), “Distributed Inference for Quantile Regression Processes,” Annals of Statistics, 47, 1634–1662.
  • Wang, J., Kolar, M., Srebro, N., and Zhang, T. (2017), “Efficient Distributed Learning with Sparsity,” in Proceedings of the 34th International Conference on Machine Learning, pp. 3636–3645, JMLR. org.
  • Wang, X., Yang, Z., Chen, X., and Liu, W. (2019), “Distributed Inference for Linear Support Vector Machine,” Journal of Machine Learning Research, 20, 1–41.
  • Wang, Y., Palmer, N., Di, Q., Schwartz, J., Kohane, I., and Cai, T. (2021), “A Fast Divide-and-Conquer Sparse Cox Regression,” Biostatistics, 22, 381–401. DOI: 10.1093/biostatistics/kxz036.
  • Wei, L. J. (1992), “The Accelerated Failure Time Model: A Useful Alternative to the Cox Regression Model in Survival Analysis,” Statistics in Medicine, 11, 1871–1879. DOI: 10.1002/sim.4780111409.
  • Wei, L. J., Ying, Z., and Lin, D. (1990), “Linear Regression Analysis of Censored Survival Data based on Rank Tests,” Biometrika, 77, 845–851. DOI: 10.1093/biomet/77.4.845.
  • Xu, G., Shang, Z., and Cheng, G. (2018), “Optimal Tuning for Divide-and-Conquer Kernel Ridge Regression with Massive Data,” in International Conference on Machine Learning, pp. 5483–5491, PMLR.
  • ———(2019), “Distributed Generalized Cross-Validation for Divide-and-Conquer Kernel Ridge Regression and its Asymptotic Optimality,” Journal of Computational and Graphical Statistics, 28, 891–908.
  • Yang, S. (1999), “Censored Median Regression Using Weighted Empirical Survival and Hazard Functions,” Journal of the American Statistical Association, 94, 137–145. DOI: 10.1080/01621459.1999.10473830.
  • Ying, Z. (1993), “A Large Sample Study of Rank Estimation for Censored Regression Data,” Annals of Statistics, 21, 76–99.
  • Ying, Z., Jung, S.-H., and Wei, L.-J. (1995), “Survival Analysis with Median Regression Models,” Journal of the American Statistical Association, 90, 178–184. DOI: 10.1080/01621459.1995.10476500.
  • Zeng, D., and Lin, D. (2007), “Efficient Estimation for the Accelerated Failure Time Model,” Journal of the American Statistical Association, 102, 1387–1396. DOI: 10.1198/016214507000001085.
  • Zhang, Q., Zhang, S., Liu, J., Huang, J., and Ma, S. (2016), “Penalized Integrative Analysis Under the Accelerated Failure Time Model,” Statistica Sinica, 26, 493–508. DOI: 10.5705/ss.2014.194.
  • Zhang, Y., Duchi, J. C., and Wainwright, M. J. (2013), “Communication-Efficient Algorithms for Statistical Optimization,” Journal of Machine Learning Research, 14, 3321–3363.
  • ———(2015), “Divide and Conquer Kernel Ridge Regression: A Distributed Algorithm with Minimax Optimal Rates,” Journal of Machine Learning Research, 16, 3299–3340.
  • Zhao, T., Cheng, G., and Liu, H. (2016), “A Partially Linear Framework for Massive Heterogeneous Data,” Annals of Statistics, 44, 1400–1437.
  • Zhou, M. (2005), “Empirical Likelihood Analysis of the Rank Estimator for the Censored Accelerated Failure Time Model,” Biometrika, 92, 492–498. DOI: 10.1093/biomet/92.2.492.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.