The frequent insignificance of a "significant" p-value
- PMID: 34477260
- DOI: 10.1111/jocs.15960
The frequent insignificance of a "significant" p-value
Abstract
Null hypothesis significance testing (NHST) and p-values are widespread in the cardiac surgical literature but are frequently misunderstood and misused. The purpose of the review is to discuss major disadvantages of p-values and suggest alternatives. We describe diagnostic tests, the prosecutor's fallacy in the courtroom, and NHST, which involve inter-related conditional probabilities, to help clarify the meaning of p-values, and discuss the enormous sampling variability, or unreliability, of p-values. Finally, we use a cardiac surgical database and simulations to explore further issues involving p-values. In clinical studies, p-values provide a poor summary of the observed treatment effect, whereas the three-number summary provided by effect estimates and confidence intervals is more informative and minimizes over-interpretation of a "significant" result. p-values are an unreliable measure of the strength of evidence; if used at all they give only, at best, a very rough guide to decision making. Researchers should adopt Open Science practices to improve the trustworthiness of research and, where possible, use estimation (three-number summaries) or other better techniques.
Keywords: Bayes theorem; clinical review; null hypothesis; p values; replication; significance testing.
© 2021 Wiley Periodicals LLC.
References
REFERENCES
-
- Grunkemeier GL, Wu Y, Furnary AP. What is the value of a p value? Ann Thorac Surg. 2009;87:1337-1343.
-
- Baduashvili A, Evans AT, Cutler T. How to understand and teach P values: a diagnostic test framework. J Clin Epidemiol. 2020;122:49-55.
-
- Gigerenzer G. What are natural frequencies? BMJ. 2011;343:d6386. https://doi.org/10.1136/bmj.d6386
-
- Levitin D. A Field Guide to Lies and Statistics. Penguin Random House; 2016.
-
- Evett I, Weir B. Interpreting DNA Evidence: Statistical Genetics for Forensic Scientists. Sinaver Associates; 1998.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources