325
Views
2
CrossRef citations to date
0
Altmetric
General

Hypothesis Testing for Matched Pairs with Missing Data by Maximum Mean Discrepancy: An Application to Continuous Glucose Monitoring

, , , &
Pages 357-369 | Received 23 Jun 2022, Accepted 02 Apr 2023, Published online: 30 May 2023

References

  • Ahlqvist, E., Tuomi, T., and Groop, L. (2019), “Clusters Provide a Better Holistic View of Type 2 Diabetes than Simple Clinical Features,” The Lancet Diabetes & Endocrinology, 7, 668–669. DOI: 10.1016/S2213-8587(19)30257-8.
  • Akritas, M. G., Antoniou, E. S., and Kuha, J. (2006), “Nonparametric Analysis of Factorial Designs with Random Missingness: Bivariate Data,” Journal of the American Statistical Association, 101, 1513–1526. DOI: 10.1198/016214506000000537.
  • Akritas, M. G., Kuha, J., and Osgood, D. W. (2002), “A Nonparametric Approach to Matched Pairs with Missing Data,” Sociological Methods & Research, 30, 425–454. DOI: 10.1177/0049124102030003006.
  • Amro, L., Konietschke, F., and Pauly, M. (2019), “Multiplication-Combination Tests for Incomplete Paired Data,” Statistics in Medicine, 38, 3243–3255. DOI: 10.1002/sim.8178.
  • Amro, L., and Pauly, M. (2017), “Permuting Incomplete Paired Data: A Novel Exact and Asymptotic Correct Randomization Test,” Journal of Statistical Computation and Simulation, 87, 1148–1159. DOI: 10.1080/00949655.2016.1249871.
  • Amro, L., Pauly, M., and Ramosaj, B. (2021), “Asymptotic-based Bootstrap Approach for Matched Pairs with Missingness in a Single Arm,” Biometrical Journal, 63, 1389–1405. DOI: 10.1002/bimj.202000051.
  • Aronszajn, N. (1950), “Theory of Reproducing Kernels,” Transactions of the American Mathematical Society, 68, 337–404. DOI: 10.1090/S0002-9947-1950-0051437-7.
  • Bacigál, T., Jágr, V., and Mesiar, R. (2011), “Non-Exchangeable Random Variables, Archimax Copulas and their Fitting to Real Data,” Kybernetika, 47, 519–531.
  • Battelino, T., Danne, T., Bergenstal, R. M., Amiel, S. A., Beck, R., Biester, T., Bosi, E., Buckingham, B. A., Cefalu, W. T., Close, K. L. et al. (2019), ‘Clinical Targets for Continuous Glucose Monitoring Data Interpretation: Recommendations from the International Consensus on Time in Range,” Diabetes Care, 42, 1593–1603. DOI: 10.2337/dci19-0028.
  • Beck, R. W., Bergenstal, R. M., Riddlesworth, T. D., Kollman, C., Li, Z., Brown, A. S., and Close, K. L. (2019), “Validation of Time in Range as an Outcome Measure for Diabetes Clinical Trials,” Diabetes Care, 42, 400–405. DOI: 10.2337/dc18-1444.
  • Berlinet, A., and Thomas-Agnan, C. (2011), Reproducing Kernel Hilbert Spaces in Probability and Statistics, New York: Springer.
  • Bigot, J., Gouet, R., Klein, T., and López, A. (2017), “Geodesic PCA in the wasserstein Space by Convex PCA,” Annales de l’Institut Henri Poincaré, 53, 1–26.
  • Chia, C. W., Egan, J. M., and Ferrucci, L. (2018), “Age-Related Changes in Glucose Metabolism, Hyperglycemia, and Cardiovascular Risk,” Circulation Research, 123, 886–904. DOI: 10.1161/CIRCRESAHA.118.312806.
  • Dennis, J. (2020), “Precision Medicine in Type 2 Diabetes: Using Individualized Prediction Models to Optimize Selection of Treatment,” Diabetes, 69, 2075–2085. DOI: 10.2337/dbi20-0002.
  • Derrick, B., Ruck, A., Toher, D., and White, P. (2018), “Tests for Equality of Variances between Two Samples which Contain both Paired Observations and Independent Observations,” Journal of Applied Quantitative Methods, 13, 36–47.
  • Dubey, P., and Müller, H.-G. (2020), “Functional Models for Time-Varying Random Objects,” Journal of the Royal Statistical Society, Series B, 82, 275–327. DOI: 10.1111/rssb.12337.
  • Durante, F., and Mesiar, R. (2010), “L ∞ Measure of Non-exchangeability for Bivariate Extreme Value and Archimax Copulas,” Journal of Mathematical Analysis and Applications, 369, 610–615.
  • Ekbohm, G. (1976), “Comparing Means in the Paired Case with Missing Data on one Response,” Biometrika, 63, 169–172. DOI: 10.1093/biomet/63.1.169.
  • Fernandez, T., and Gretton, A. (2019), “A Maximum-Mean-Discrepancy Goodness-of-Fit Test for Censored Data,” in The 22nd International Conference on Artificial Intelligence and Statistics, pp. 2966–2975, PMLR.
  • Ferraty, F., and Vieu, P. (2006), Nonparametric Functional Data Analysis: Theory and Practice (Vol. 76), New York: Springer.
  • Fong, Y., Huang, Y., Lemos, M. P., and Mcelrath, M. J. (2018), “Rank-based Two-Sample Tests for Paired Data with Missing Values,” Biostatistics, 19, 281–294. DOI: 10.1093/biostatistics/kxx039.
  • França, G., Rizzo, M. L., and Vogelstein, J. T. (2021), “Kernel k-groups via Hartigan’s Method,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 4411–4425. DOI: 10.1109/TPAMI.2020.2998120.
  • Gaigall, D. (2020), “Testing Marginal Homogeneity of a Continuous Bivariate Distribution with Possibly Incomplete Paired Data,” Metrika, 83, 437–465. DOI: 10.1007/s00184-019-00742-5.
  • Garreau, D., Jitkrittum, W., and Kanagawa, M. (2017), “Large Sample Analysis of the Median Heuristic,” arXiv preprint:1707.07269.
  • Ghosal, R., Varma, V. R., Volfson, D., Hillel, I., Urbanek, J., Hausdorff, J. M., Watts, A., and Zipunnikov, V. (2021), “Distributional Data Analysis via Quantile Functions and its Application to Modeling Digital Biomarkers of Gait in Alzheimer’s Disease,” Biostatistics, kxab041. DOI: 10.1093/biostatistics/kxab041.
  • Gretton, A., Borgwardt, K. M., Rasch, M. J., Schölkopf, B., and Smola, A. (2012), “A Kernel Two-Sample Test,” The Journal of Machine Learning Research, 13, 723–773.
  • Gude, F., Díaz-Vidal, P., Rúa-Pérez, C., Alonso-Sampedro, M., Fernández-Merino, C., Rey-García, J., Cadarso-Suárez, C., Pazos-Couselo, M., García-López, J. M., and Gonzalez-Quintela, A. (2017), “Glycemic Variability and its Association with Demographics and Lifestyles in a General Adult Population,” Journal of Diabetes Science and Technology, 11, 780–790. DOI: 10.1177/1932296816682031.
  • Guo, B., and Yuan, Y. (2017), “A Comparative Review of Methods for Comparing Means Using Partially Paired Data,” Statistical Methods in Medical Research, 26, 1323–1340. DOI: 10.1177/0962280215577111.
  • Herder, C., and Roden, M. (2022), “A Novel Diabetes Typology: Towards Precision Diabetology from Pathogenesis to Treatment,” Diabetology, 65, 1770–1781. DOI: 10.1007/s00125-021-05625-x.
  • Hofmann, T., Schölkopf, B., and Smola, A. J. (2008), “Kernel Methods in Machine Learning,” The Annals of Statistics, 36, 1171–1220. DOI: 10.1214/009053607000000677.
  • Janssen, A., and Pauls, T. (2003), “How Do Bootstrap and Permutation Tests Work?” The Annals of statistics, 31, 768–806. DOI: 10.1214/aos/1056562462.
  • Konietschke, F., Harrar, S. W., Lange, K., and Brunner, E. (2012), “Ranking Procedures for Matched Pairs with Missing Data - Asymptotic Theory and a Small Sample Approximation,” Computational Statistics & Data Analysis, 56, 1090–1102. DOI: 10.1016/j.csda.2011.03.022.
  • Kosorok, M. R. (2008), Introduction to Empirical Processes and Semiparametric Inference, New York: Springer.
  • Kosorok, M. R., and Laber, E. B. (2019), “Precision Medicine,” Annual Review of Statistics and its Application, 6, 263–286. DOI: 10.1146/annurev-statistics-030718-105251.
  • Kuan, P. F., and Huang, B. (2013), “A Simple and Robust Method for Partially Matched Samples using the p-values Pooling Approach,” Statistics in Medicine, 32, 3247–3259. DOI: 10.1002/sim.5758.
  • Leong, K. S., and Wilding, J. P. (1999), “Obesity and Diabetes,” Best Practice & Research Clinical Endocrinology & Metabolism, 13, 221–237. DOI: 10.1053/beem.1999.0017.
  • Leucht, A., and Neumann, M. H. (2013), “Dependent Wild Bootstrap for Degenerate U- and V-Statistics,” Journal of Multivariate Analysis, 117, 257–280. DOI: 10.1016/j.jmva.2013.03.003.
  • Liu, Y., and Fan, Y. (2023), “Biased-Sample Empirical Likelihood Weighting for Missing Data Problems: An Alternative to Inverse Probability Weighting,” Journal of the Royal Statistical Society, Series B, 85, 67–83. DOI: 10.1093/jrsssb/qkac006.
  • Lu, J., Ma, X., and Zhou, J. (2018), “Association of Time in Range, as Assessed by Continuous Glucose Monitoring, with Diabetic Retinopathy in Type 2 Diabetes,” Diabetes Care, 41, 2370–2376. DOI: 10.2337/dc18-1131.
  • Lu, J., Wang, C., Shen, Y., Chen, L., Zhang, L., Cai, J., Lu, W., Zhu, W., Hu, G., Xia, T., and Zhou, J. (2020), “Time in Range in Relation to All-Cause and Cardiovascular Mortality in Patients With Type 2 Diabetes: A Prospective Cohort Study,” Diabetes Care, 44, 549–555. DOI: 10.2337/dc20-1862.
  • Ma, X., and Wang, J. (2020), “Robust Inference Using Inverse Probability Weighting,” Journal of the American Statistical Association, 115, 1851–1860. DOI: 10.1080/01621459.2019.1660173.
  • Martínez-Camblor, P., Corral, N., and María de la Hera, J. (2013), “Hypothesis Test for Paired Samples in the Presence of Missing Data,” Journal of Applied Statistics, 40, 76–87. DOI: 10.1080/02664763.2012.734795.
  • Matabuena, M., Félix, P., García-Meixide, C., and Gude, F. (2022), “Kernel Machine Learning Methods to Handle Missing Responses with Complex Predictors. Application Modelling Five-year Glucose Changes using Distributional Representations,” Computer Methods and Programs in Biomedicine, 221, 106905. DOI: 10.1016/j.cmpb.2022.106905.
  • Matabuena, M., and Petersen, A. (2021), “Distributional Data Analysis of Accelerometer Data from the NHANES Database Using Nonparametric Survey Regression Models,” arXiv:2104.01165.
  • Matabuena, M., Petersen, A., Vidal, J. C., and Gude, F. (2021), “Glucodensities: a New Representation of Glucose Profiles Using Distributional Data Analysis,” Statistical Methods in Medical Research, 30, 1445–1464. DOI: 10.1177/0962280221998064.
  • Muandet, K., Fukumizu, K., Sriperumbudur, B., and Schölkopf, B. (2017), “Kernel Mean Embedding of Distributions: A Review and Beyond,” Foundations and Trends in Machine Learning, 10, 1–141. DOI: 10.1561/2200000060.
  • Neuhaus, G. (1977), “Functional Limit Theorems for u-statistics in the Degenerate Case,” Journal of Multivariate Analysis, 7, 424–439. DOI: 10.1016/0047-259X(77)90083-5.
  • Petersen, A., Liu, X., and Divani, A. (2021), “Wasserstein F-tests and Confidence Bands for the Fréchet Regression of Density Response Curves,” The Annals of Statistics, 49, 590–611. DOI: 10.1214/20-AOS1971.
  • Petersen, A., and Müller, H.-G. (2016), “Functional Data Analysis for Density Functions by Transformation to a Hilbert Space,” The Annals of Statistics, 44, 183–218. DOI: 10.1214/15-AOS1363.
  • Qi, Q., Yan, L., and Tian, L. (2019), “Testing Equality of Means in Partially Paired Data with Incompleteness in Single Response,” Statistical Methods in Medical Research, 28, 1508–1522. DOI: 10.1177/0962280218765007.
  • Rizzo, M. L., and Székely, G. J. (2010), “Disco Analysis: A Nonparametric Extension of Analysis of Variance,” The Annals of Applied Statistics, 4, 1034–1055. DOI: 10.1214/09-AOAS245.
  • Rubin, D. B. (1976), “Inference and Missing Data,” Biometrika, 63, 581–592. DOI: 10.1093/biomet/63.3.581.
  • Samawi, H. M., Helu, A., and Vogel, R. (2011), “A Nonparametric Test of Symmetry based on the Overlapping Coefficient,” Journal of Applied Statistics, 38, 885–898. DOI: 10.1080/02664761003692365.
  • Samawi, H. M., and Vogel, R. (2014), “Notes on Two Sample Tests for Partially Correlated (Paired) Data,” Journal of Applied Statistics, 41, 109–117. DOI: 10.1080/02664763.2013.830285.
  • Schölkopf, B., and Smola, A. (2002), Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond, Cambridge, MA: MIT Press.
  • Serfling, R. (1981), Approximation Theorems of Mathematical Statistics, Wiley Series in Probability and Statistics, New York: Wiley.
  • Shen, C., and Vogelstein, J. T. (2021), “The Exact Equivalence of Distance and Kernel Methods in Hypothesis Testing,” AStA Advances in Statistical Analysis, 105, 385–403. DOI: 10.1007/s10182-020-00378-1.
  • Smola, A., Gretton, A., Song, L., and Schölkopf, B. (2007), “A Hilbert Space Embedding for Distributions,” in Proceedings of the 18th International Conference on Algorithmic Learning Theory, pp. 13–31.
  • Sriperumbudur, B. K., Fukumizu, K., and Lanckriet, G. R. (2011), “Universality, Characteristic Kernels and RKHS Embedding of Measures,” Journal of Machine Learning Research, 12, 2389–2410.
  • Székely, G. J., and Rizzo, M. L. (2004), “Testing for Equal Distributions in High Dimension,” InterStat, 5, 1249–1272.
  • Székely, G. J., and Rizzo, M. L. (2013), “Energy Statistics: A Class of Statistics based on Distances,” Journal of Statistical Planning and Inference, 143, 1249–1272. DOI: 10.1016/j.jspi.2013.03.018.
  • Székely, G. J., Rizzo, M. L., and Bakirov, N. K. (2007), “Measuring and Testing Dependence by Correlation of Distances,” The Annals of Statistics, 35, 2769–2794. DOI: 10.1214/009053607000000505.
  • Takerkart, S., Auzias, G., Thirion, B., and Ralaivola, L. (2014), “Graph-based Inter-Subject Pattern Analysis of fMRI Data,” PLoS One, 9, e104586. DOI: 10.1371/journal.pone.0104586.
  • Tawn, J. (1988), “Bivariate Extreme Value Theory: Models and Estimation,” Biometrika, 75, 397–415. DOI: 10.1093/biomet/75.3.397.
  • Tenzer, Y., Mandel, M., and Zuk, O. (2022), “Testing Independence Under Biased Sampling,” Journal of the American Statistical Association, 117, 2194–2206. DOI: 10.1080/01621459.2021.1912758.
  • Tsiatis, A. (2007), Semiparametric Theory and Missing Data, New York: Springer.
  • Van Der Vaart, A. W., and Wellner, J. A. (1996), “Weak Convergence,” in Weak Convergence and Empirical Processes, pp. 16–28, New York: Springer.
  • Vapnik, V. (2000), The Nature of Statistical Learning Theory, New York: Springer.
  • Verbeke, G., and Molenberghs, G. (2009), Linear Mixed Models for Longitudinal Data, New York: Springer.
  • Xu, J., and Harrar, S. W. (2012), “Accurate Mean Comparisons for Paired Samples with Missing Data: An Application to a Smoking-Cessation Trial,” Biometrical Journal, 54, 281–295. DOI: 10.1002/bimj.201100053.
  • Yu, D., Lim, J., Liang, F., Kim, K., Kim, B. S., and Jang, W. (2012), “Permutation Test for Incomplete Paired Data with Application to cDNA Microarray Data,” Computational Statistics & Data Analysis, 56, 510–521. DOI: 10.1016/j.csda.2011.08.012.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.