1,866
Views
1
CrossRef citations to date
0
Altmetric
Machine Learning

Finite-Sample Two-Group Composite Hypothesis Testing via Machine Learning

ORCID Icon & ORCID Icon
Pages 856-865 | Received 10 Nov 2020, Accepted 14 Dec 2021, Published online: 18 Feb 2022

References

  • Allaire, J., and Chollet, F. (2020), “keras: R Interface to ‘Keras’,” R package version 2.2.5.0. Available at https://keras.rstudio.com.
  • Bach, F. (2017). “Breaking the Curse of Dimensionality with Convex Neural Networks,” The Journal of Machine Learning Research, 18, 629–681.
  • Bauer, P., and Kohne, K. (1994), “Evaluation of Experiments with Adaptive Interim Analyses,” Biometrics, 50, 1029–1041. DOI: 10.2307/2533441.
  • Berry, S. M., Carlin, B. P., Lee, J. J., and Muller, P. (2010), Bayesian Adaptive Methods for Clinical Trials, Boca Raton, FL: CRC Press.
  • Boser, B. E., Guyon, I. M., and Vapnik, V. N. (1992), “A Training Algorithm for Optimal Margin Classifiers,” in Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152. DOI: 10.1145/130385.130401.
  • Bretz, F., Koenig, F., Brannath, W., Glimm, E., and Posch, M. (2009), “Adaptive Designs for Confirmatory Clinical Trials,” Statistics in Medicine, 28, 1181–1217. DOI: 10.1002/sim.3538.
  • Chen, M. H., Dey, D. K., Müller, P., Sun, D., and Ye, K. (2010),” Bayesian Clinical Trials,” in Frontiers of Statistical Decision Making and Bayesian Analysis, eds. M.-H. Chen, P. Müller, D. Sun, K. Ye, and D. K. Dey, pp. 257–284, New York: Springer.
  • Chen, M. H., Ibrahim, J. G., Zeng, D., Hu, K., and Jia, C. (2014), “Bayesian Design of Superiority Clinical Trials for Recurrent Events Data with Applications to Bleeding and Transfusion Events in Myelodyplastic Syndrome,” Biometrics, 70, 1003–1013. DOI: 10.1111/biom.12215.
  • Chen, M., Jiang, H., Liao, W., and Zhao, T. (2019), “Nonparametric Regression on Low-Dimensional Manifolds using Deep ReLU Networks,” arXiv preprint, arXiv:1908.01842.
  • Chen, Y., and Liang, K. Y. (2010), “On the Asymptotic Behaviour of the Pseudolikelihood Ratio Test Statistic with Boundary Problems,” Biometrika, 97, 603–620. DOI: 10.1093/biomet/asq031.
  • Cheng, X., and Cloninger, A. (2019), “Classification Logit Two-sample Testing by Neural Networks,” arXiv preprint, arXiv:1909.11298.
  • Chollet, F., and Allaire, J. J. (2018), Deep Learning with R, Shelter Island, NY: Manning Publications.
  • Cui, L., Hung, H. J., and Wang, S. J. (1999), “Modification of Sample Size in Group Sequential Clinical Trials,” Biometrics, 55, 853–857. DOI: 10.1111/j.0006-341x.1999.00853.x.
  • Efron, B., and Tibshirani, R. J. (1994), An Introduction to the Bootstrap, Boca Raton, FL: CRC Press.
  • Elsäßer, A., Regnstrom, J., Vetter, T., Koenig, F., Hemmings, R. J., Greco, M., Papaluca-Amati, M., and Posch, M. (2014), “Adaptive Clinical Trial Designs for European Marketing Authorization: A Survey of Scientific Advice Letters from the European Medicines Agency,” Trials, 15, 383. DOI: 10.1186/1745-6215-15-383.
  • EMA (2007), Reflection Paper on Methodological Issues in Confirmatory Clinical Trials Planned with an Adaptive Design, London: EMEA.
  • FDA (2019), “Adaptive Design Clinical Trials for Drugs and Biologics Guidance for Industry.” Available at https://www.fda.gov/regulatory-information/search-fda-guidance-documents/adaptive-design-clinical-trials-drugs-and-biologics-guidance-industry.
  • Fligner, M. A., and Killeen, T. J. (1976), Distribution-Free Two-Sample Tests for Scale,” Journal of the American Statistical Association, 71, 210–213. DOI: 10.1080/01621459.1976.10481517.
  • Galili, T., and Meilijson, I. (2016), “An Example of an Improvable Rao–Blackwell Improvement, Inefficient Maximum Likelihood Estimator, and Unbiased Generalized Bayes Estimator,” The American Statistician, 70, 108–113. DOI: 10.1080/00031305.2015.1100683.
  • Gilead Inc. (2020), “Gilead Announces Results From Phase 3 Trial of Investigational Antiviral Remdesivir in Patients With Severe COVID-19.” Available at https://www.gilead.com/news-and-press/press-room/press-releases/2020/4/gilead-announces-results-from-phase-3-trial-of-investigational-antiviral-remdesivir-in-patients-with-severe-covid-19.
  • Goodfellow, I., Bengio, Y., and Courville, A. (2016), Deep Learning, Cambridge, MA: MIT Press.
  • Hinton, G., Srivastava, N., and Swersky, K. (2012), “Neural Networks for Machine Learning,” Coursera, video lectures, 264, 1.
  • Ho, T. K. (1995), “Random Decision Forests,” in IEEE Proceedings of 3rd International Conference on Document Analysis and Recognition, 1, 278–282.
  • Pinheiro, J. C., Liu, C., and Wu, Y. N. (2001), “Efficient Algorithms for Robust Estimation in Linear Mixed-Effects Models Using the Multivariate t Distribution,” Journal of Computational and Graphical Statistics, 10, 249–276. DOI: 10.1198/10618600152628059.
  • Kirchler, M., Khorasani, S., Kloft, M., and Lippert, C. (2020), “Two-Sample Testing Using Deep Learning,” in International Conference on Artificial Intelligence and Statistics, pp. 1387–1398.
  • Kübler, J. M., Jitkrittum, W., Schölkopf, B., and Muandet, K. (2020), “Learning Kernel Tests Without Data Splitting,” arXiv preprint, arXiv:2006.02286.
  • Lange, K. L., Little, R. J., and Taylor, J. M. (1989), “Robust Statistical Modeling Using the t Distribution,” Journal of the American Statistical Association, 84, 881–896. DOI: 10.2307/2290063.
  • Lehmann, E. L., and Romano, J. P. (2006), Testing Statistical Hypotheses, New York: Springer.
  • Levene, H. (1961), “Robust Tests for Equality of Variances,” Contributions to Probability and Statistics: Essays in Honor of Harold Hotelling, eds. I. Olkin, S. G. Ghurye, W. Hoeffding, W. G. Madow, and H. B. Mann, pp. 279–292, Palo Alto: Stanford University Press.
  • Lin, M., Lee, S., Zhen, B., Scott, J., Horne, A., Solomon, G., and Russek-Cohen, E. (2016), “CBER’s Experience with Adaptive Design Clinical Trials,” Therapeutic Innovation & Regulatory Science, 50, 195–203.
  • Liu, J., and Coull, B. (2017), “Robust Hypothesis Test for Nonlinear Effect with Gaussian Processes,” in Advances in Neural Information Processing Systems, pp. 795–803.
  • Liu, F., Xu, W., Lu, J., Zhang, G., Gretton, A., and Sutherland, D. J. (2020), “Learning Deep Kernels for Non-Parametric Two-Sample Tests,” arXiv preprint, arXiv:2002.09116.
  • Mehta, C. R., and Pocock, S. J. (2011), “Adaptive Increase in Sample Size When Interim Results are Promising: A Practical Guide with Examples,” Statistics in Medicine, 30, 3267–3284. DOI: 10.1002/sim.4102.
  • Mhaskar, H. N. (1996), “Neural Networks for Optimal Approximation of Smooth and Analytic Functions,” Neural Computation, 8, 164–177. DOI: 10.1162/neco.1996.8.1.164.
  • National Institutes of Health. (2020a), “Adaptive COVID-19 Treatment Trial (ACTT).” Available at https://clinicaltrials.gov/ct2/show/NCT04280705.
  • National Institutes of Health (2020b), “NIH Clinical Trial Shows Remdesivir Accelerates Recovery from Advanced COVID-19.” Available at https://www.niaid.nih.gov/news-events/nih-clinical-trial-shows-remdesivir-accelerates-recovery-advanced-covid-19.
  • National Institutes of Health (2020c), “Study to Evaluate the Safety and Antiviral Activity of Remdesivir (GS-5734TM ) in Participants With Severe Coronavirus Disease (COVID-19).” Available at https://clinicaltrials.gov/ct2/show/NCT04292899.
  • Neuenschwander, B., Capkun-Niggli, G., Branson, M., and Spiegelhalter, D. J. (2010), “Summarizing Historical Information on Controls in Clinical Trials,” Clinical Trials, 7, 5–18. DOI: 10.1177/1740774509356002.
  • Neyman, J., and Pearson, E. S. (1933), “IX. On the Problem of the Most Efficient Tests of Statistical Hypotheses,” Philosophical Transactions of the Royal Society of London, Series A, Containing Papers of a Mathematical or Physical Character, 231, 289–337.
  • Shen, Z., Yang, H., and Zhang, S. (2019), “Nonlinear Approximation via Compositions,” Neural Networks, 119, 74–84. DOI: 10.1016/j.neunet.2019.07.011.
  • Vogel, C. R. (2002), Computational Methods for Inverse Problems, Philadelphia: Society for Industrial and Applied Mathematics.
  • Wanke, P. F. (2008), “The Uniform Distribution as a First Practical Approach to New Product Inventory Management,” International Journal of Production Economics, 114, 811–819. DOI: 10.1016/j.ijpe.2008.04.004.
  • Welch, B. L. (1951), “On the Comparison of Several Mean Values: An Alternative Approach,” Biometrika, 38, 330–336. DOI: 10.1093/biomet/38.3-4.330.
  • Yarotsky, D. (2017), “Error Bounds for Approximations with Deep ReLU Networks,” Neural Networks, 94, 103–114. DOI: 10.1016/j.neunet.2017.07.002.
  • Zajicek, J. P., Hobart, J. C., Slade, A., Barnes, D., Mattison, P. G., and MUSEC Research Group. (2012), “Multiple Sclerosis and Extract of Cannabis: Results of the MUSEC Trial,” Journal of Neurology, Neurosurgery & Psychiatryiometrics, 83, 1125–1132.