19,283
Views
53
CrossRef citations to date
0
Altmetric
Supplementing or Replacing p

An Introduction to Second-Generation p-Values

, , , &

References

  • Barnard, G. (1949), “Statistical Inference,” Journal of the Royal Statistical Society, Series B, 11, 115–149. DOI: 10.1111/j.2517-6161.1949.tb00028.x.
  • Bayarri, M. J., Benjamin, D. J., Berger, J. O., and Selke, T. M. (2016), “Rejection Odds and Rejection Ratios: A Proposal for Statistical Practice in Testing Hypotheses,” Journal of Mathematical Psychology, 72, 90–103. DOI: 10.1016/j.jmp.2015.12.007.
  • Benjamin, D. J., Berger, J. O. et al. (2018), “Redefine Statistical Significance.” Nature Human Behavior, 2, 6–10. DOI: 10.1038/s41562-017-0189-z.
  • Benjamini, Y., and Hochberg, Y. (1995), “Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing,” Journal of the Royal Statistical Society, Series B, 57, 289–300. DOI: 10.1111/j.2517-6161.1995.tb02031.x.
  • Berger, J. O. (2003), “Could Fisher, Jeffreys and Neyman Have Agreed on Testing?” Statistical Science, 18, 1–32. DOI: 10.1214/ss/1056397485.
  • Berger, J. O., and Sellke, T. (1987), “Testing a Point Null Hypothesis: The Irreconcilability of P Values and Evidence,” Journal of the American Statistical Association, 82, 112–122. DOI: 10.2307/2289131.
  • Berkson, J. (1942), “Tests of Significance Considered as Evidence,” Journal of the American Statistical Association, 37, 325–335. DOI: 10.1080/01621459.1942.10501760.
  • Blume, J. D. (2002), “Likelihood Methods for Measuring Statistical Evidence,” Statistics in Medicine, 21, 2563–2599. DOI: 10.1002/sim.1216.
  • Blume, J. D. (2011), “Likelihood and Its Evidential Framework,” in Handbook of the Philosophy of Science: Philosophy of Statistics, eds. D. M. Gabbay and J. Woods, San Diego, CA: North Holland, pp. 493–511.
  • Blume, J. D., McGowan, L. D., Greevy, R. A., and Dupont, W. D. (2018), “Second-Generation p-Values: Improved Rigor, Reproducibility, & Transparency in Statistical Analyses,” PLoS One, 13, e0188299. DOI: 10.1371/journal.pone.0188299.
  • Blume, J. D., and Peipert, J. F. (2003), “What Your Statistician Never Told You About P-Values,” Journal of the American Association of Gynecologic Laparoscopists, 10, 439–444, PMID: 14738627. DOI: 10.1016/S1074-3804(05)60143-0.
  • Cohen, J. (1994), “The Earth Is Round (p <.05),” American Psychologist, 49, 997–1003.
  • Cornfield, J. (1966), “Sequential Trials, Sequential Analysis, and the Likelihood Principle,” The American Statistician, 20, 18–23. DOI: 10.2307/2682711.
  • Cristea, I. A., and Ioannidis, J. P. A. (2018), “P-Values in Display Items Are Ubiquitous and Almost Invariably Significant: A Survey of Top Science Journals,” PLoS One, 13, e0197440. DOI: 10.1371/journal.pone.0197440.
  • Dupont, W. D. (1983), “Sequential Stopping Rules and Sequentially Adjusted p-Values: Does One Require the Other?” (with discussion), Controlled Clinical Trials, 4, 3–35. DOI: 10.1016/S0197-2456(83)80003-8.
  • Edwards, W., Lindman, H., and Savage, L. J. (1963), “Bayesian Statistical Inference for Psychological Research,” Psychological Review, 70, 193–242. DOI: 10.1037/h0044139.
  • Fisher, R. A. (1959), Statistical Methods and Scientific Inference (2nd ed.), New York: Hafner.
  • Good, I. J. (2007), “C420. The Existence of Sharp Null Hypotheses,” Journal of Statistical Computation and Simulation, 49, 241–242. DOI: 10.1080/00949659408811587.
  • Good, I. J., and Osteyee, D. B. (1974), Information, Weight of Evidence, the Singularity Between Probability Measures and Signal Detection, Lecture Notes in Mathematics, New York: Springer-Verlag.
  • Goodman, S. N.(1993), “p-Values, Hypothesis Tests, and Likelihood: Implications for Epidemiology of a Neglected Historical Debate,” American Journal of Epidemiology, 137, 485–496. DOI: 10.1093/oxfordjournals.aje.a116700.
  • Greenland, S., Senn, S., Rothman, K. J., Carlin, J. B., Poole, C., Goodman, S. N., and Altman, D. G. (2016), “Statistical Tests, P Values, Confidence Intervals, and Power: A Guide to Misinterpretations,” European Journal of Epidemiology, 31, 337–350. DOI: 10.1007/s10654-016-0149-3.
  • ICPCG (2018), International Consortium for Prostate Cancer Genetics, Genome Wide Association Study of Familial Prostate Cancer, available at https://www.icpcg.org/; dbGaP Study Accession: phs000733.v1.p1, available at: https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000733.v1.p1.
  • Ioannidis, J. P. A. (2018), “The Proposal to Lower P-Value Thresholds to 0.005,” Journal of the American Statistical Association, 319, 1429–1430. DOI: 10.1001/jama.2018.1536.
  • Johnson, V. E. (2013), “Revised Standards for Statistical Evidence,” Proceedings of the National Academy of Sciences, 110, 19313–19317. DOI: 10.1073/pnas.1313476110.
  • Kass, R. E., and Raftery, A. E. (1995), “Bayes Factors,” Journal of the American Statistical Association, 90, 773–795. DOI: 10.1080/01621459.1995.10476572.
  • Lakens, D., Adolfi, F. G., Albers, C. J., Anvari, F., Apps, M. A., Argamon, S. E., Baguley, T., Becker, R. B., Benning, S. D., Bradford, D. E., and Buchanan, E. M. (2018), “Justify Your Alpha,” Nature Human Behaviour, 2, 168–171. DOI: 10.1038/s41562-018-0311-x.
  • Lehmann, E. L. (1986), Testing Statistical Hypotheses (2nd ed.), New York: Wiley [1st ed. (1959)].
  • Morrison, D. E., and Henkel, R. E. (1970), The Significance Test Controversy, Chicago, IL: Aldine.
  • Royall, R. M. (1986), “The Effect of Sample Size on the Meaning of Significance Tests,” The American Statistician, 40, 313–315. DOI: 10.2307/2684616.
  • Royall, R. M. (1997), Statistical Evidence: A Likelihood Paradigm, London: Chapman and Hall.
  • Savage, L. J. (1962), “The Foundations of Statistics Reconsidered,” in Studies in Subjective Probability, eds. H. E. Hyburg Jr. and H. E. Smokler, New York: Wiley.
  • Schaid, D. J., and Chang, B. L. (2005), “Description of the International Consortium for Prostate Cancer Genetics, and Failure to Replicate Linkage of Hereditary Prostate Cancer to 20q13,” Prostate, 63, 276–290. DOI: 10.1002/pros.20198.
  • Spiegelhalter, D. J., Abrams, K. R., and Myles, J. P. (2004), Bayesian Approach to Clinical Trials and Health-Care Evaluation, West Sussex, England: Wiley.
  • Storey, J. D. (2002), “A Direct Approach to False Discovery Rates,” Journal of the Royal Statistical Society, Series B, 64, 479–498. DOI: 10.1111/1467-9868.00346.
  • Storey, J. D. (2003), “The Positive False Discovery Rate: A Bayesian Interpretation and the q-Value,” Annals of Statistics, 31, 2013–2035. DOI: 10.1214/aos/1074290335.
  • Wacholder, S., Chanock, S., Garcia-Closas, M., El Ghormli, L., and Rothman, N. (2004), “Assessing the Probability That a Positive Report Is False: An Approach for Molecular Epidemiology Studies,” Journal of the National Cancer Institute, 96, 434–442. DOI: 10.1093/jnci/djh075.
  • Wasserstein, R. L., and Lazar, N. A. (2016), “The ASA’s Statement on p-Values: Context, Process, and Purpose,” The American Statistician, 70, 129–133. DOI: 10.1080/00031305.2016.1154108.