Search in:

The American Statistician Volume 73, 2019 - Issue sup1: Statistical Inference in the 21st Century: A World Beyond p < 0.05

Submit an article Journal homepage

Open access

5,884

Views

CrossRef citations to date

Altmetric

Reforming Institutions: Changing Publication Policies and Statistical Education

The World of Research Has Gone Berserk: Modeling the Consequences of Requiring “Greater Statistical Stringency” for Scientific Publication

Harlan CampbellDepartment of Statistics, University of British Columbia, Vancouver, CanadaCorrespondence[email protected]

Paul GustafsonDepartment of Statistics, University of British Columbia, Vancouver, Canada

Pages 358-373 | Received 15 Mar 2018, Accepted 23 Nov 2018, Published online: 20 Mar 2019

Cite this article
https://doi.org/10.1080/00031305.2018.1555101
CrossMark

Sample our Mathematics & Statistics journals, sign in here to start your FREE access for 14 days

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

ABSTRACT

In response to growing concern about the reliability and reproducibility of published science, researchers have proposed adopting measures of “greater statistical stringency,” including suggestions to require larger sample sizes and to lower the highly criticized “p < 0.05” significance threshold. While pros and cons are vigorously debated, there has been little to no modeling of how adopting these measures might affect what type of science is published. In this article, we develop a novel optimality model that, given current incentives to publish, predicts a researcher’s most rational use of resources in terms of the number of studies to undertake, the statistical power to devote to each study, and the desirable prestudy odds to pursue. We then develop a methodology that allows one to estimate the reliability of published research by considering a distribution of preferred research strategies. Using this approach, we investigate the merits of adopting measures of “greater statistical stringency” with the goal of informing the ongoing debate.

KEYWORDS:

Meta-research
Null hypothesis significance Testing
Publication
Reliability
Reproducibility
Statistical power

Acknowledgments

We wish to gratefully acknowledge Prof. Will Welch and Prof. John Petkau for their valuable suggestions and advice. Furthermore, we wish to acknowledge funding from the Natural Sciences and Engineering Research Council of Canada (NSERC grant number RGPIN 183772-13).

Related Research Data

Why most discovered true associations are inflated.

Source: Ovid Technologies (Wolters Kluwer Health)

THE SIMPLE ECONOMICS OF RESEARCH PORTFOLIOS

Source: Oxford University Press (OUP)

How Often Do They Really Occur?

Source: SAGE Publications

Negative results are disappearing from most disciplines and countries

Source: Springer Science and Business Media LLC

How should we rate research? Counting number of publications may be best research performance measure.

Source: BMJ

Is Most Published Research Really False

Source: Annual Reviews

Redefine statistical significance

Source: Springer Science and Business Media LLC

The tyranny of power: is there a better way to calculate sample size?

Source: BMJ

How should novelty be valued in science

Source: eLife Sciences Publications Ltd

Assessing Type S (Sign) and Type M (Magnitude) Errors

Source: SAGE Publications

Sample size calculations in randomised trials: mandatory and mystical

Source: Elsevier BV

A Missing Pillar of Scientific Culture

Source: Hogrefe Publishing Group

Why Selective Publication of Statistically Significant Results Can Be Effective

Source: Public Library of Science

Consequences of Prejudice Against the Null Hypothesis

Source: American Psychological Association (APA)

Drug development: Raise standards for preclinical cancer research

Source: Springer Science and Business Media LLC

Why most published research findings are false: problems in the analysis

Source: Public Library of Science (PLoS)

Measuring the effectiveness of scientific gatekeeping

Source: Proceedings of the National Academy of Sciences

Freewheelin’ scientists: citing Bob Dylan in the biomedical literature

Source: BMJ

Low Power and Striking Results — A Surprise but Not a Paradox

Source: Massachusetts Medical Society

Moving to a World Beyond “p < 0.05”

Source: Informa UK Limited

Confusion Over Measures of Evidence (p's) Versus Errors (α's) in Classical Statistical Testing

Source: Informa UK Limited

Justify your alpha

Source: Apollo - University of Cambridge Repository

The statistical power of abnormal-social psychological research: a review.

Source: American Psychological Association (APA)

Estimating effect size: Bias resulting from the significance criterion in editorial decisions

Source: Wiley

Publication metrics and success on the academic job market

Source: Elsevier Ltd.

Exploring Small, Confirming Big: An alternative system to The New Statistics for advancing cumulative and replicable psychological research

Source: Elsevier BV

Reply to Gelman, Gaudart, Pericchi: More reasons to revise standards for statistical evidence

Source: Proceedings of the National Academy of Sciences

Trust in numbers

Source: Wiley

Conditional equivalence testing: An alternative remedy for publication bias

Source: Public Library of Science (PLoS)

In pursuit of resistance: pragmatic recommendations for doing science within one’s means

Source: Springer Science and Business Media LLC

A comprehensive review of reporting practices in psychological journals: Are effect sizes really enough?:

Source: SAGE Publications

Explicación del tamaño muestral empleado: una exigencia irracional de las revistas biomédicas

Source: Sociedad Española de Salud Pública y Administración Sanitaria (SESPAS)

Improving the Standard for Basic and Preclinical Research

Source: Ovid Technologies (Wolters Kluwer Health)

A Powerful Nudge? Presenting Calculable Consequences of Underpowered Research Shifts Incentives Toward Adequately Powered Designs

Source: SAGE Publications

The Prior Odds of Testing a True Effect in Cognitive and Social Psychology

Source: SAGE Publications

The continuing unethical conduct of underpowered clinical trials.

Source: American Medical Association (AMA)

The Economics of Reproducibility in Preclinical Research

Source: Public Library of Science (PLoS)

What Constitutes Strong Psychological Science? The (Neglected) Role of Diagnosticity and A Priori Theorizing.

Source: SAGE Publications

The N-pact factor: evaluating the quality of empirical journals with respect to sample size and statistical power.

Source: Public Library of Science (PLoS)

Lowering the P Value Threshold.

Source: American Medical Association (AMA)

Registered Reports: Realigning incentives in scientific publishing

Source: Elsevier Masson

Most published research findings are false-but a little replication goes a long way.

Source: Public Library of Science (PLoS)

Misunderstanding publication bias: editors are not blameless after all.

Source: (:unav)

The earth is flat (p < 0.05): significance thresholds and the crisis of unreplicable research

Source: PeerJ

One hundred years of social psychology quantitatively described

Source: SAGE Publications

Why publishing everything is more effective than selective publishing of statistically significant results

Source: Public Library of Science (PLoS)

Justifying small-n research in scientifically amazing settings: challenging the notion that only "big-n" studies are worthwhile.

Source: American Physiological Society

Current incentives for scientists lead to underpowered studies with erroneous conclusions

Source: Public Library of Science (PLoS)

Biostatistics series module 5: Determining sample size

Source: Wolters Kluwer Medknow Publications

Evaluating replicability of laboratory experiments in economics

Source: American Association for the Advancement of Science

Publication decisions revisited: the effect of the outcome of statistical tests on the decision to publish and vice versa

Source: Informa UK Limited

Statistical reporting deficiencies in ecotoxicology

Source: Wiley

The Burden of the "False-Negatives" in Clinical Development: Analyses of Current and Alternative Scenarios and Corrective Measures.

Source: Wiley

Social Biases and Solutions for Procedural Objectivity

Source: Cambridge University Press (CUP)

Could It Be Better to Discard 90% of the Data? A Statistical Paradox

Source: Informa UK Limited

Deep impact: unintended consequences of journal rank

Source: Frontiers Media S.A.

Obtaining evidence by a single well-powered trial or several modestly powered trials:

Source: SAGE Publications

Empirical assessment of published effect sizes and power in the recent cognitive neuroscience and psychology literature

Source: Cold Spring Harbor Laboratory

Linking provided by

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

The World of Research Has Gone Berserk: Modeling the Consequences of Requiring “Greater Statistical Stringency” for Scientific Publication

Related Research Data

Information for

Open access

Opportunities

Help and information

The World of Research Has Gone Berserk: Modeling the Consequences of Requiring “Greater Statistical Stringency” for Scientific Publication

ABSTRACT

Acknowledgments

Related Research Data

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature