1,438
Views
27
CrossRef citations to date
0
Altmetric
Theory and Methods

Accumulation Tests for FDR Control in Ordered Hypothesis Testing

&
Pages 837-849 | Received 01 Jun 2015, Published online: 13 Apr 2017
 

ABSTRACT

Multiple testing problems arising in modern scientific applications can involve simultaneously testing thousands or even millions of hypotheses, with relatively few true signals. In this article, we consider the multiple testing problem where prior information is available (for instance, from an earlier study under different experimental conditions), that can allow us to test the hypotheses as a ranked list to increase the number of discoveries. Given an ordered list of n hypotheses, the aim is to select a data-dependent cutoff k and declare the first k hypotheses to be statistically significant while bounding the false discovery rate (FDR). Generalizing several existing methods, we develop a family of “accumulation tests” to choose a cutoff k that adapts to the amount of signal at the top of the ranked list. We introduce a new method in this family, the HingeExp method, which offers higher power to detect true signals compared to existing techniques. Our theoretical results prove that these methods control a modified FDR on finite samples, and characterize the power of the methods in the family. We apply the tests to simulated data, including a high-dimensional model selection problem for linear regression. We also compare accumulation tests to existing methods for multiple testing on a real data problem of identifying differential gene expression over a dosage gradient. Supplementary materials for this article are available online.

Supplementary Materials

In the supplementary materials we provide additional plots for further examination of the results of our simulated data experiments (Section A) and proofs for all the theoretical results presented in the paper (Section B).

Notes

2 There is one minor caveat here: if the p-values are conservative then it’s possible to have P{pi=1}>0; in this case, accumulation functions such as ForwardStop or HingeExp will fail, since h(1) = +∞. In practice, for any such choice of h, we would want to truncate h at some large but finite value, that is, replace h(t) with h(t)∧C for some large positive C. Since this will reduce the expected value only slightly, that is ∫1 t = 0h(t)∧C dt is only slightly smaller than 1, the FDR and FDP bounds would only be slightly worse.

3 Note that the second assumption in (Equation17) ensures that T is uniquely defined in each of these cases, that is in the middle case where f(1)<μ0-αμ0-μ1<f(0), the inverse f − 1 is well-defined at the value where it is applied.

5 Some additional plots exploring these results are in the supplementary materials.

6 Some additional plots exploring these results are in the supplementary materials.

8 While several existing methods for multiple testing yield guarantees for FDR control even in the case of dependent p-values, the methods we are aware of are quite conservative and yield nearly zero power in this experiment. Specifically, Benjamini and Yekutieli’s modification (Citation2001) of the BH method, the Holm-Bonferroni method (Holm Citation1979), and the Bonferroni correction each yielded, even at target FDR level α = 0.9, no more than two discoveries for this gene expression experiment; in contrast, the accumulation test methods examined here yield thousands of discoveries.

9 Data available at http://www.ncbi.nlm.nih.gov/sites/GDSbrowser?acc=GDS2324 or via the GEOquery package (Davis and Meltzer Citation2007) in R.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 343.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.