1,127
Views
2
CrossRef citations to date
0
Altmetric
Theory and Methods

Asymptotic Distribution-Free Independence Test for High-Dimension Data

ORCID Icon, & ORCID Icon
Received 29 Apr 2022, Accepted 21 May 2023, Published online: 26 May 2023
 

Abstract

Test of independence is of fundamental importance in modern data analysis, with broad applications in variable selection, graphical models, and causal inference. When the data is high dimensional and the potential dependence signal is sparse, independence testing becomes very challenging without distributional or structural assumptions. In this article, we propose a general framework for independence testing by first fitting a classifier that distinguishes the joint and product distributions, and then testing the significance of the fitted classifier. This framework allows us to borrow the strength of the most advanced classification algorithms developed from the modern machine learning community, making it applicable to high dimensional, complex data. By combining a sample split and a fixed permutation, our test statistic has a universal, fixed Gaussian null distribution that is independent of the underlying data distribution. Extensive simulations demonstrate the advantages of the newly proposed test compared with existing methods. We further apply the new test to a single cell dataset to test the independence between two types of single cell sequencing measurements, whose high dimensionality and sparsity make existing methods hard to apply. Supplementary materials for this article are available online.

Supplementary Materials

The supplementary materials contain the technical proof and the code for replicating the numerical results.

Acknowledgments

This work used the Bridges-2 system, which is supported by NSF award OAC-1928147 at the Pittsburgh Supercomputing Center. This project was funded by National Institute of Mental Health (NIMH) grant R01MH123184 and NSF DMS-2015492.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 343.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.