References
- Benjamin, D. J., Berger, J. O., Johannesson, M., Nosek, B. A., Wagenmakers, E.-J., Berk, R., Bollen, K. A., Brembs, B., Brown, L., Camerer, C., and Cesarini, D., et al. (2018), “Redefine Statistical Significance,” Nature Human Behaviour, 2, 6–10. DOI: 10.1038/s41562-017-0189-z.
- Chambers, C. (2019a), “The Registered Reports Revolution—Lessons in Cultural Reform,” Significance, 16, 23–27.
- Chambers, C. (2019b), “What’s Next for Registered Reports,” Nature, 573, 187–189.
- Copas, J. B. (1997), “Using Regression Models for Prediction: Shrinkage and Regression to the Mean,” Statistical Methods in Medical Research, 6, 167–183. DOI: 10.1177/096228029700600206.
- Deeks, J. J., Higgins, J. P., and Altman, D. G. (2019), “Analysing Data and Undertaking Meta-Analyses” (Chapter 10), in Cochrane Handbook for Systematic Reviews of Interventions, Chichester: Wiley, pp. 241–284.
- Gibson, E. W. (2020), “The Role of p-Values in Judging the Strength of Evidence and Realistic Replication Expectations,” Statistics in Biopharmaceutical Research, 1–13.
- Goodman, S. N. (1992), “A Comment on Replication, p-Values and Evidence,” Statistics in Medicine, 11, 875–879. DOI: 10.1002/sim.4780110705.
- Goodman, S. N. (2016), “Aligning Statistical and Scientific Reasoning,” Science, 352, 1180–1181.
- Goodman, S. N., Fanelli, D., and Ioannidis, J. P. A. (2016), “What Does Research Reproducibility Mean?,” Science Translational Medicine, 8, 341ps12. DOI: 10.1126/scitranslmed.aaf5027.
- Held, L. (2020a), “A New Standard for the Analysis and Design of Replication Studies” (with discussion), Journal of the Royal Statistical Society, Series A, 183, 431–469.
- Held, L. (2020b), “The Harmonic Mean χ2 Test to Substantiate Scientific Findings,” Journal of the Royal Statistical Society, Series C, 69, 697–708.
- Held, L., Micheloud, C., and Pawel, S. (2020), “The Assessment of Replication Success Based on Relative Effect Size,” Technical Report, arXiv no. 2009.07782.
- National Academies of Sciences, Engineering, and Medicine (2019), Reproducibility and Replicability in Science, Washington, DC: The National Academies Press.
- Neuenschwander, B., Roychoudhury, S., and Branson, M. (2018), “Predictive Evidence Threshold Scaling: Does the Evidence Meet a Confirmatory Standard?,” Statistics in Biopharmaceutical Research, 10, 76–84.
- Pawel, S. and Held, L. (2020), “Probabilistic Forecasting of Replication Studies,” PLOS ONE, 15, e0231416. DOI: 10.1371/journal.pone.0231416.
- Senn, S. (2007), Statistical Issues in Drug Development (2nd ed.), Chichester: Wiley.
- Simonsohn, U. (2015), “Small Telescopes: Detectability and the Evaluation of Replication Results,” Psychological Science, 26, 559–569. DOI: 10.1177/0956797614567341.