Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2018 May 15:9:699.
doi: 10.3389/fpsyg.2018.00699. eCollection 2018.

Manipulating the Alpha Level Cannot Cure Significance Testing

Affiliations
Review

Manipulating the Alpha Level Cannot Cure Significance Testing

David Trafimow et al. Front Psychol. .

Abstract

We argue that making accept/reject decisions on scientific hypotheses, including a recent call for changing the canonical alpha level from p = 0.05 to p = 0.005, is deleterious for the finding of new discoveries and the progress of science. Given that blanket and variable alpha levels both are problematic, it is sensible to dispense with significance testing altogether. There are alternatives that address study design and sample size much more directly than significance testing does; but none of the statistical tools should be taken as the new magic method giving clear-cut mechanical answers. Inference should not be based on single studies at all, but on cumulative evidence from multiple independent studies. When evaluating the strength of the evidence, we should consider, for example, auxiliary assumptions, the strength of the experimental design, and implications for applications. To boil all this down to a binary decision based on a p-value threshold of 0.05, 0.01, 0.005, or anything else, is not acceptable.

Keywords: decision making; null hypothesis testing; p-value; significance testing; statistical significance.

PubMed Disclaimer

References

    1. Amrhein V., Greenland S. (2018). Remove, rather than redefine, statistical significance. Nat. Hum. Behav. 2:4 10.1038/s41562-017-0224-0 - DOI - PubMed
    1. Amrhein V., Korner-Nievergelt F., Roth T. (2017). The earth is flat (p > 0.05): significance thresholds and the crisis of unreplicable research. PeerJ. 5:e3544. 10.7717/peerj.3544 - DOI - PMC - PubMed
    1. Amrhein V., Trafimow D., Greenland S. (2018). Abandon statistical inference. PeerJ Preprints 6:e26857v1. 10.7287/peerj.preprints.26857v1 - DOI
    1. Balluerka N., Gómez J., Hidalgo D. (2005). The controversy over null hypothesis significance testing revisited. Methodology 1, 55–77. 10.1027/1614-1881.1.2.55 - DOI
    1. Benjamin D. J., Berger J. O., Johannesson M., Nosek B. A., Wagenmakers E.-J., Berk R., et al. (2018). Redefine statistical significance. Nat. Hum. Behav. 2, 6–10. 10.1038/s41562-017-0189-z - DOI - PubMed