Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Apr;25(4):225-30.
doi: 10.1007/s10654-010-9440-x. Epub 2010 Mar 26.

The ongoing tyranny of statistical significance testing in biomedical research

Affiliations

The ongoing tyranny of statistical significance testing in biomedical research

Andreas Stang et al. Eur J Epidemiol. 2010 Apr.

Abstract

Since its introduction into the biomedical literature, statistical significance testing (abbreviated as SST) caused much debate. The aim of this perspective article is to review frequent fallacies and misuses of SST in the biomedical field and to review a potential way out of the fallacies and misuses associated with SSTs. Two frequentist schools of statistical inference merged to form SST as it is practised nowadays: the Fisher and the Neyman-Pearson school. The P-value is both reported quantitatively and checked against the alpha-level to produce a qualitative dichotomous measure (significant/nonsignificant). However, a P-value mixes the estimated effect size with its estimated precision. Obviously, it is not possible to measure these two things with one single number. For the valid interpretation of SSTs, a variety of presumptions and requirements have to be met. We point here to four of them: study size, correct statistical model, correct causal model, and absence of bias and confounding. It has been stated that the P-value is perhaps the most misunderstood statistical concept in clinical research. As in the social sciences, the tyranny of SST is still highly prevalent in the biomedical literature even after decades of warnings against SST. The ubiquitous misuse and tyranny of SST threatens scientific discoveries and may even impede scientific progress. In the worst case, misuse of significance testing may even harm patients who eventually are incorrectly treated because of improper handling of P-values. For a proper interpretation of study results, both estimated effect size and estimated precision are necessary ingredients.

PubMed Disclaimer

Comment in

Similar articles

Cited by

References

    1. Cancer. 1998 Jul 15;83(2):354-9 - PubMed
    1. Epidemiology. 2001 May;12(3):291-4 - PubMed
    1. Environ Health Perspect. 2007 Oct;115(10):1519-26 - PubMed
    1. JAMA. 2002 Jul 17;288(3):321-33 - PubMed
    1. Ann Intern Med. 1994 Aug 1;121(3):200-6 - PubMed

LinkOut - more resources