1,438
Views
27
CrossRef citations to date
0
Altmetric
Theory and Methods

Accumulation Tests for FDR Control in Ordered Hypothesis Testing

&
Pages 837-849 | Received 01 Jun 2015, Published online: 13 Apr 2017
 

ABSTRACT

Multiple testing problems arising in modern scientific applications can involve simultaneously testing thousands or even millions of hypotheses, with relatively few true signals. In this article, we consider the multiple testing problem where prior information is available (for instance, from an earlier study under different experimental conditions), that can allow us to test the hypotheses as a ranked list to increase the number of discoveries. Given an ordered list of n hypotheses, the aim is to select a data-dependent cutoff k and declare the first k hypotheses to be statistically significant while bounding the false discovery rate (FDR). Generalizing several existing methods, we develop a family of “accumulation tests” to choose a cutoff k that adapts to the amount of signal at the top of the ranked list. We introduce a new method in this family, the HingeExp method, which offers higher power to detect true signals compared to existing techniques. Our theoretical results prove that these methods control a modified FDR on finite samples, and characterize the power of the methods in the family. We apply the tests to simulated data, including a high-dimensional model selection problem for linear regression. We also compare accumulation tests to existing methods for multiple testing on a real data problem of identifying differential gene expression over a dosage gradient. Supplementary materials for this article are available online.

Supplementary Materials

In the supplementary materials we provide additional plots for further examination of the results of our simulated data experiments (Section A) and proofs for all the theoretical results presented in the paper (Section B).

Notes

2 There is one minor caveat here: if the p-values are conservative then it’s possible to have P{pi=1}>0; in this case, accumulation functions such as ForwardStop or HingeExp will fail, since h(1) = +∞. In practice, for any such choice of h, we would want to truncate h at some large but finite value, that is, replace h(t) with h(t)∧C for some large positive C. Since this will reduce the expected value only slightly, that is ∫1 t = 0h(t)∧C dt is only slightly smaller than 1, the FDR and FDP bounds would only be slightly worse.

3 Note that the second assumption in (Equation17) ensures that T is uniquely defined in each of these cases, that is in the middle case where f(1)<μ0-αμ0-μ1<f(0), the inverse f − 1 is well-defined at the value where it is applied.

5 Some additional plots exploring these results are in the supplementary materials.

6 Some additional plots exploring these results are in the supplementary materials.

8 While several existing methods for multiple testing yield guarantees for FDR control even in the case of dependent p-values, the methods we are aware of are quite conservative and yield nearly zero power in this experiment. Specifically, Benjamini and Yekutieli’s modification (Citation2001) of the BH method, the Holm-Bonferroni method (Holm Citation1979), and the Bonferroni correction each yielded, even at target FDR level α = 0.9, no more than two discoveries for this gene expression experiment; in contrast, the accumulation test methods examined here yield thousands of discoveries.

9 Data available at http://www.ncbi.nlm.nih.gov/sites/GDSbrowser?acc=GDS2324 or via the GEOquery package (Davis and Meltzer Citation2007) in R.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.