Abstract
Testing a global hypothesis for a set of variables is a fundamental problem in statistics with a wide range of applications. A few well-known classical tests include the Hotelling’s T 2 test, the F-test, and the empirical Bayes based score test. These classical tests, however, are not robust to the signal strength and could have a substantial loss of power when signals are weak or moderate, a situation we commonly encounter in contemporary applications. In this article, we propose a minimax optimal ridge-type set test (MORST), a simple and generic method for testing a global hypothesis. The power of MORST is robust and considerably higher than that of the classical tests when the strength of signals is weak or moderate. In the meantime, MORST only requires a slight increase in computation compared to these existing tests, making it applicable to the analysis of massive genome-wide data. We also provide the generalizations of MORST that are parallel to the traditional Wald test and Rao’s score test in asymptotic settings. Extensive simulations demonstrated the robust power of MORST and that the Type I error of MORST was well controlled. We applied MORST to the analysis of the whole-genome sequencing data from the Atherosclerosis Risk in Communities study, where MORST detected 20%–250% more signal regions than the classical tests. Supplementary materials for this article are available online.
Supplementary Materials
Supplementary materials include the derivation of , the proofs of Theorems 1 and 2, Corollaries 1 and 2, and other technical lemmas, the algorithm to compute , additional simulation results, as well as the genomic landscapes of significant sliding windows in ARIC WGS data analysis.
Acknowledgments
The authors thank the referees for their constructive comments that have helped greatly improve the article. The authors also thank Dr. Eric Boerwinkle for providing the Atherosclerosis Risk in Communities whole genome sequencing data.