Search in:

The American Statistician Volume 77, 2023 - Issue 4

Submit an article Journal homepage

325

Views

CrossRef citations to date

Altmetric

General

Hypothesis Testing for Matched Pairs with Missing Data by Maximum Mean Discrepancy: An Application to Continuous Glucose Monitoring

Marcos Matabuenaa Centro Singular de Investigación en Tecnoloxías Intelixentes (CiTIUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain;Correspondence[email protected]
View further author information

Paulo Félixa Centro Singular de Investigación en Tecnoloxías Intelixentes (CiTIUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain;View further author information

Marc Ditzhausb Otto-von-Guericke-Universität Magdeburg, Fakultät für Mathematik (FMA), Institut für Mathematische Stochastik (IMST), Magdeburg, Germany;View further author information

Juan Vidalc Departamento de Electrónica e Computación, Universidade de Santiago de Compostela, Santiago de Compostela, Spain;View further author information

Francisco Guded Unidad de Epidemiología Clínica, Hospital Clínico Universitario de Santiago, Santiago de Compostela, SpainView further author information

Pages 357-369 | Received 23 Jun 2022, Accepted 02 Apr 2023, Published online: 30 May 2023

Cite this article
https://doi.org/10.1080/00031305.2023.2200512
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions

References

Ahlqvist, E., Tuomi, T., and Groop, L. (2019), “Clusters Provide a Better Holistic View of Type 2 Diabetes than Simple Clinical Features,” The Lancet Diabetes & Endocrinology, 7, 668–669. DOI: 10.1016/S2213-8587(19)30257-8.
PubMed Web of Science ®Google Scholar
Akritas, M. G., Antoniou, E. S., and Kuha, J. (2006), “Nonparametric Analysis of Factorial Designs with Random Missingness: Bivariate Data,” Journal of the American Statistical Association, 101, 1513–1526. DOI: 10.1198/016214506000000537.
Web of Science ®Google Scholar
Akritas, M. G., Kuha, J., and Osgood, D. W. (2002), “A Nonparametric Approach to Matched Pairs with Missing Data,” Sociological Methods & Research, 30, 425–454. DOI: 10.1177/0049124102030003006.
Web of Science ®Google Scholar
Amro, L., Konietschke, F., and Pauly, M. (2019), “Multiplication-Combination Tests for Incomplete Paired Data,” Statistics in Medicine, 38, 3243–3255. DOI: 10.1002/sim.8178.
PubMed Web of Science ®Google Scholar
Amro, L., and Pauly, M. (2017), “Permuting Incomplete Paired Data: A Novel Exact and Asymptotic Correct Randomization Test,” Journal of Statistical Computation and Simulation, 87, 1148–1159. DOI: 10.1080/00949655.2016.1249871.
Web of Science ®Google Scholar
Amro, L., Pauly, M., and Ramosaj, B. (2021), “Asymptotic-based Bootstrap Approach for Matched Pairs with Missingness in a Single Arm,” Biometrical Journal, 63, 1389–1405. DOI: 10.1002/bimj.202000051.
PubMed Web of Science ®Google Scholar
Aronszajn, N. (1950), “Theory of Reproducing Kernels,” Transactions of the American Mathematical Society, 68, 337–404. DOI: 10.1090/S0002-9947-1950-0051437-7.
Web of Science ®Google Scholar
Bacigál, T., Jágr, V., and Mesiar, R. (2011), “Non-Exchangeable Random Variables, Archimax Copulas and their Fitting to Real Data,” Kybernetika, 47, 519–531.
Web of Science ®Google Scholar
Battelino, T., Danne, T., Bergenstal, R. M., Amiel, S. A., Beck, R., Biester, T., Bosi, E., Buckingham, B. A., Cefalu, W. T., Close, K. L. et al. (2019), ‘Clinical Targets for Continuous Glucose Monitoring Data Interpretation: Recommendations from the International Consensus on Time in Range,” Diabetes Care, 42, 1593–1603. DOI: 10.2337/dci19-0028.
PubMed Web of Science ®Google Scholar
Beck, R. W., Bergenstal, R. M., Riddlesworth, T. D., Kollman, C., Li, Z., Brown, A. S., and Close, K. L. (2019), “Validation of Time in Range as an Outcome Measure for Diabetes Clinical Trials,” Diabetes Care, 42, 400–405. DOI: 10.2337/dc18-1444.
PubMed Web of Science ®Google Scholar
Berlinet, A., and Thomas-Agnan, C. (2011), Reproducing Kernel Hilbert Spaces in Probability and Statistics, New York: Springer.
Google Scholar
Bigot, J., Gouet, R., Klein, T., and López, A. (2017), “Geodesic PCA in the wasserstein Space by Convex PCA,” Annales de l’Institut Henri Poincaré, 53, 1–26.
Google Scholar
Chia, C. W., Egan, J. M., and Ferrucci, L. (2018), “Age-Related Changes in Glucose Metabolism, Hyperglycemia, and Cardiovascular Risk,” Circulation Research, 123, 886–904. DOI: 10.1161/CIRCRESAHA.118.312806.
PubMed Web of Science ®Google Scholar
Dennis, J. (2020), “Precision Medicine in Type 2 Diabetes: Using Individualized Prediction Models to Optimize Selection of Treatment,” Diabetes, 69, 2075–2085. DOI: 10.2337/dbi20-0002.
PubMed Web of Science ®Google Scholar
Derrick, B., Ruck, A., Toher, D., and White, P. (2018), “Tests for Equality of Variances between Two Samples which Contain both Paired Observations and Independent Observations,” Journal of Applied Quantitative Methods, 13, 36–47.
Google Scholar
Dubey, P., and Müller, H.-G. (2020), “Functional Models for Time-Varying Random Objects,” Journal of the Royal Statistical Society, Series B, 82, 275–327. DOI: 10.1111/rssb.12337.
Google Scholar
Durante, F., and Mesiar, R. (2010), “L ∞ Measure of Non-exchangeability for Bivariate Extreme Value and Archimax Copulas,” Journal of Mathematical Analysis and Applications, 369, 610–615.
Web of Science ®Google Scholar
Ekbohm, G. (1976), “Comparing Means in the Paired Case with Missing Data on one Response,” Biometrika, 63, 169–172. DOI: 10.1093/biomet/63.1.169.
Web of Science ®Google Scholar
Fernandez, T., and Gretton, A. (2019), “A Maximum-Mean-Discrepancy Goodness-of-Fit Test for Censored Data,” in The 22nd International Conference on Artificial Intelligence and Statistics, pp. 2966–2975, PMLR.
Google Scholar
Ferraty, F., and Vieu, P. (2006), Nonparametric Functional Data Analysis: Theory and Practice (Vol. 76), New York: Springer.
Google Scholar
Fong, Y., Huang, Y., Lemos, M. P., and Mcelrath, M. J. (2018), “Rank-based Two-Sample Tests for Paired Data with Missing Values,” Biostatistics, 19, 281–294. DOI: 10.1093/biostatistics/kxx039.
PubMed Web of Science ®Google Scholar
França, G., Rizzo, M. L., and Vogelstein, J. T. (2021), “Kernel k-groups via Hartigan’s Method,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 4411–4425. DOI: 10.1109/TPAMI.2020.2998120.
PubMed Web of Science ®Google Scholar
Gaigall, D. (2020), “Testing Marginal Homogeneity of a Continuous Bivariate Distribution with Possibly Incomplete Paired Data,” Metrika, 83, 437–465. DOI: 10.1007/s00184-019-00742-5.
Web of Science ®Google Scholar
Garreau, D., Jitkrittum, W., and Kanagawa, M. (2017), “Large Sample Analysis of the Median Heuristic,” arXiv preprint:1707.07269.
Google Scholar
Ghosal, R., Varma, V. R., Volfson, D., Hillel, I., Urbanek, J., Hausdorff, J. M., Watts, A., and Zipunnikov, V. (2021), “Distributional Data Analysis via Quantile Functions and its Application to Modeling Digital Biomarkers of Gait in Alzheimer’s Disease,” Biostatistics, kxab041. DOI: 10.1093/biostatistics/kxab041.
PubMed Web of Science ®Google Scholar
Gretton, A., Borgwardt, K. M., Rasch, M. J., Schölkopf, B., and Smola, A. (2012), “A Kernel Two-Sample Test,” The Journal of Machine Learning Research, 13, 723–773.
Web of Science ®Google Scholar
Gude, F., Díaz-Vidal, P., Rúa-Pérez, C., Alonso-Sampedro, M., Fernández-Merino, C., Rey-García, J., Cadarso-Suárez, C., Pazos-Couselo, M., García-López, J. M., and Gonzalez-Quintela, A. (2017), “Glycemic Variability and its Association with Demographics and Lifestyles in a General Adult Population,” Journal of Diabetes Science and Technology, 11, 780–790. DOI: 10.1177/1932296816682031.
PubMedGoogle Scholar
Guo, B., and Yuan, Y. (2017), “A Comparative Review of Methods for Comparing Means Using Partially Paired Data,” Statistical Methods in Medical Research, 26, 1323–1340. DOI: 10.1177/0962280215577111.
PubMed Web of Science ®Google Scholar
Herder, C., and Roden, M. (2022), “A Novel Diabetes Typology: Towards Precision Diabetology from Pathogenesis to Treatment,” Diabetology, 65, 1770–1781. DOI: 10.1007/s00125-021-05625-x.
PubMed Web of Science ®Google Scholar
Hofmann, T., Schölkopf, B., and Smola, A. J. (2008), “Kernel Methods in Machine Learning,” The Annals of Statistics, 36, 1171–1220. DOI: 10.1214/009053607000000677.
Web of Science ®Google Scholar
Janssen, A., and Pauls, T. (2003), “How Do Bootstrap and Permutation Tests Work?” The Annals of statistics, 31, 768–806. DOI: 10.1214/aos/1056562462.
Web of Science ®Google Scholar
Konietschke, F., Harrar, S. W., Lange, K., and Brunner, E. (2012), “Ranking Procedures for Matched Pairs with Missing Data - Asymptotic Theory and a Small Sample Approximation,” Computational Statistics & Data Analysis, 56, 1090–1102. DOI: 10.1016/j.csda.2011.03.022.
Web of Science ®Google Scholar
Kosorok, M. R. (2008), Introduction to Empirical Processes and Semiparametric Inference, New York: Springer.
Google Scholar
Kosorok, M. R., and Laber, E. B. (2019), “Precision Medicine,” Annual Review of Statistics and its Application, 6, 263–286. DOI: 10.1146/annurev-statistics-030718-105251.
PubMed Web of Science ®Google Scholar
Kuan, P. F., and Huang, B. (2013), “A Simple and Robust Method for Partially Matched Samples using the p-values Pooling Approach,” Statistics in Medicine, 32, 3247–3259. DOI: 10.1002/sim.5758.
PubMed Web of Science ®Google Scholar
Leong, K. S., and Wilding, J. P. (1999), “Obesity and Diabetes,” Best Practice & Research Clinical Endocrinology & Metabolism, 13, 221–237. DOI: 10.1053/beem.1999.0017.
Web of Science ®Google Scholar
Leucht, A., and Neumann, M. H. (2013), “Dependent Wild Bootstrap for Degenerate U- and V-Statistics,” Journal of Multivariate Analysis, 117, 257–280. DOI: 10.1016/j.jmva.2013.03.003.
Web of Science ®Google Scholar
Liu, Y., and Fan, Y. (2023), “Biased-Sample Empirical Likelihood Weighting for Missing Data Problems: An Alternative to Inverse Probability Weighting,” Journal of the Royal Statistical Society, Series B, 85, 67–83. DOI: 10.1093/jrsssb/qkac006.
Google Scholar
Lu, J., Ma, X., and Zhou, J. (2018), “Association of Time in Range, as Assessed by Continuous Glucose Monitoring, with Diabetic Retinopathy in Type 2 Diabetes,” Diabetes Care, 41, 2370–2376. DOI: 10.2337/dc18-1131.
PubMed Web of Science ®Google Scholar
Lu, J., Wang, C., Shen, Y., Chen, L., Zhang, L., Cai, J., Lu, W., Zhu, W., Hu, G., Xia, T., and Zhou, J. (2020), “Time in Range in Relation to All-Cause and Cardiovascular Mortality in Patients With Type 2 Diabetes: A Prospective Cohort Study,” Diabetes Care, 44, 549–555. DOI: 10.2337/dc20-1862.
PubMed Web of Science ®Google Scholar
Ma, X., and Wang, J. (2020), “Robust Inference Using Inverse Probability Weighting,” Journal of the American Statistical Association, 115, 1851–1860. DOI: 10.1080/01621459.2019.1660173.
Web of Science ®Google Scholar
Martínez-Camblor, P., Corral, N., and María de la Hera, J. (2013), “Hypothesis Test for Paired Samples in the Presence of Missing Data,” Journal of Applied Statistics, 40, 76–87. DOI: 10.1080/02664763.2012.734795.
Web of Science ®Google Scholar
Matabuena, M., Félix, P., García-Meixide, C., and Gude, F. (2022), “Kernel Machine Learning Methods to Handle Missing Responses with Complex Predictors. Application Modelling Five-year Glucose Changes using Distributional Representations,” Computer Methods and Programs in Biomedicine, 221, 106905. DOI: 10.1016/j.cmpb.2022.106905.
PubMed Web of Science ®Google Scholar
Matabuena, M., and Petersen, A. (2021), “Distributional Data Analysis of Accelerometer Data from the NHANES Database Using Nonparametric Survey Regression Models,” arXiv:2104.01165.
Google Scholar
Matabuena, M., Petersen, A., Vidal, J. C., and Gude, F. (2021), “Glucodensities: a New Representation of Glucose Profiles Using Distributional Data Analysis,” Statistical Methods in Medical Research, 30, 1445–1464. DOI: 10.1177/0962280221998064.
PubMed Web of Science ®Google Scholar
Muandet, K., Fukumizu, K., Sriperumbudur, B., and Schölkopf, B. (2017), “Kernel Mean Embedding of Distributions: A Review and Beyond,” Foundations and Trends in Machine Learning, 10, 1–141. DOI: 10.1561/2200000060.
Web of Science ®Google Scholar
Neuhaus, G. (1977), “Functional Limit Theorems for u-statistics in the Degenerate Case,” Journal of Multivariate Analysis, 7, 424–439. DOI: 10.1016/0047-259X(77)90083-5.
Web of Science ®Google Scholar
Petersen, A., Liu, X., and Divani, A. (2021), “Wasserstein F-tests and Confidence Bands for the Fréchet Regression of Density Response Curves,” The Annals of Statistics, 49, 590–611. DOI: 10.1214/20-AOS1971.
Web of Science ®Google Scholar
Petersen, A., and Müller, H.-G. (2016), “Functional Data Analysis for Density Functions by Transformation to a Hilbert Space,” The Annals of Statistics, 44, 183–218. DOI: 10.1214/15-AOS1363.
Web of Science ®Google Scholar
Qi, Q., Yan, L., and Tian, L. (2019), “Testing Equality of Means in Partially Paired Data with Incompleteness in Single Response,” Statistical Methods in Medical Research, 28, 1508–1522. DOI: 10.1177/0962280218765007.
PubMed Web of Science ®Google Scholar
Rizzo, M. L., and Székely, G. J. (2010), “Disco Analysis: A Nonparametric Extension of Analysis of Variance,” The Annals of Applied Statistics, 4, 1034–1055. DOI: 10.1214/09-AOAS245.
Web of Science ®Google Scholar
Rubin, D. B. (1976), “Inference and Missing Data,” Biometrika, 63, 581–592. DOI: 10.1093/biomet/63.3.581.
Web of Science ®Google Scholar
Samawi, H. M., Helu, A., and Vogel, R. (2011), “A Nonparametric Test of Symmetry based on the Overlapping Coefficient,” Journal of Applied Statistics, 38, 885–898. DOI: 10.1080/02664761003692365.
Web of Science ®Google Scholar
Samawi, H. M., and Vogel, R. (2014), “Notes on Two Sample Tests for Partially Correlated (Paired) Data,” Journal of Applied Statistics, 41, 109–117. DOI: 10.1080/02664763.2013.830285.
Web of Science ®Google Scholar
Schölkopf, B., and Smola, A. (2002), Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond, Cambridge, MA: MIT Press.
Google Scholar
Serfling, R. (1981), Approximation Theorems of Mathematical Statistics, Wiley Series in Probability and Statistics, New York: Wiley.
Google Scholar
Shen, C., and Vogelstein, J. T. (2021), “The Exact Equivalence of Distance and Kernel Methods in Hypothesis Testing,” AStA Advances in Statistical Analysis, 105, 385–403. DOI: 10.1007/s10182-020-00378-1.
Web of Science ®Google Scholar
Smola, A., Gretton, A., Song, L., and Schölkopf, B. (2007), “A Hilbert Space Embedding for Distributions,” in Proceedings of the 18th International Conference on Algorithmic Learning Theory, pp. 13–31.
Google Scholar
Sriperumbudur, B. K., Fukumizu, K., and Lanckriet, G. R. (2011), “Universality, Characteristic Kernels and RKHS Embedding of Measures,” Journal of Machine Learning Research, 12, 2389–2410.
Web of Science ®Google Scholar
Székely, G. J., and Rizzo, M. L. (2004), “Testing for Equal Distributions in High Dimension,” InterStat, 5, 1249–1272.
Google Scholar
Székely, G. J., and Rizzo, M. L. (2013), “Energy Statistics: A Class of Statistics based on Distances,” Journal of Statistical Planning and Inference, 143, 1249–1272. DOI: 10.1016/j.jspi.2013.03.018.
Web of Science ®Google Scholar
Székely, G. J., Rizzo, M. L., and Bakirov, N. K. (2007), “Measuring and Testing Dependence by Correlation of Distances,” The Annals of Statistics, 35, 2769–2794. DOI: 10.1214/009053607000000505.
Web of Science ®Google Scholar
Takerkart, S., Auzias, G., Thirion, B., and Ralaivola, L. (2014), “Graph-based Inter-Subject Pattern Analysis of fMRI Data,” PLoS One, 9, e104586. DOI: 10.1371/journal.pone.0104586.
PubMed Web of Science ®Google Scholar
Tawn, J. (1988), “Bivariate Extreme Value Theory: Models and Estimation,” Biometrika, 75, 397–415. DOI: 10.1093/biomet/75.3.397.
Web of Science ®Google Scholar
Tenzer, Y., Mandel, M., and Zuk, O. (2022), “Testing Independence Under Biased Sampling,” Journal of the American Statistical Association, 117, 2194–2206. DOI: 10.1080/01621459.2021.1912758.
Web of Science ®Google Scholar
Tsiatis, A. (2007), Semiparametric Theory and Missing Data, New York: Springer.
Google Scholar
Van Der Vaart, A. W., and Wellner, J. A. (1996), “Weak Convergence,” in Weak Convergence and Empirical Processes, pp. 16–28, New York: Springer.
Google Scholar
Vapnik, V. (2000), The Nature of Statistical Learning Theory, New York: Springer.
Google Scholar
Verbeke, G., and Molenberghs, G. (2009), Linear Mixed Models for Longitudinal Data, New York: Springer.
Google Scholar
Xu, J., and Harrar, S. W. (2012), “Accurate Mean Comparisons for Paired Samples with Missing Data: An Application to a Smoking-Cessation Trial,” Biometrical Journal, 54, 281–295. DOI: 10.1002/bimj.201100053.
PubMed Web of Science ®Google Scholar
Yu, D., Lim, J., Liang, F., Kim, K., Kim, B. S., and Jang, W. (2012), “Permutation Test for Incomplete Paired Data with Application to cDNA Microarray Data,” Computational Statistics & Data Analysis, 56, 510–521. DOI: 10.1016/j.csda.2011.08.012.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Hypothesis Testing for Matched Pairs with Missing Data by Maximum Mean Discrepancy: An Application to Continuous Glucose Monitoring

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Hypothesis Testing for Matched Pairs with Missing Data by Maximum Mean Discrepancy: An Application to Continuous Glucose Monitoring

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date