Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Nov;36(11):4322-4331.
doi: 10.1111/jocs.15960. Epub 2021 Sep 3.

The frequent insignificance of a "significant" p-value

Affiliations
Review

The frequent insignificance of a "significant" p-value

David C McGiffin et al. J Card Surg. 2021 Nov.

Abstract

Null hypothesis significance testing (NHST) and p-values are widespread in the cardiac surgical literature but are frequently misunderstood and misused. The purpose of the review is to discuss major disadvantages of p-values and suggest alternatives. We describe diagnostic tests, the prosecutor's fallacy in the courtroom, and NHST, which involve inter-related conditional probabilities, to help clarify the meaning of p-values, and discuss the enormous sampling variability, or unreliability, of p-values. Finally, we use a cardiac surgical database and simulations to explore further issues involving p-values. In clinical studies, p-values provide a poor summary of the observed treatment effect, whereas the three-number summary provided by effect estimates and confidence intervals is more informative and minimizes over-interpretation of a "significant" result. p-values are an unreliable measure of the strength of evidence; if used at all they give only, at best, a very rough guide to decision making. Researchers should adopt Open Science practices to improve the trustworthiness of research and, where possible, use estimation (three-number summaries) or other better techniques.

Keywords: Bayes theorem; clinical review; null hypothesis; p values; replication; significance testing.

PubMed Disclaimer

References

REFERENCES

    1. Grunkemeier GL, Wu Y, Furnary AP. What is the value of a p value? Ann Thorac Surg. 2009;87:1337-1343.
    1. Baduashvili A, Evans AT, Cutler T. How to understand and teach P values: a diagnostic test framework. J Clin Epidemiol. 2020;122:49-55.
    1. Gigerenzer G. What are natural frequencies? BMJ. 2011;343:d6386. https://doi.org/10.1136/bmj.d6386
    1. Levitin D. A Field Guide to Lies and Statistics. Penguin Random House; 2016.
    1. Evett I, Weir B. Interpreting DNA Evidence: Statistical Genetics for Forensic Scientists. Sinaver Associates; 1998.

LinkOut - more resources