Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations
- PMID: 27209009
- PMCID: PMC4877414
- DOI: 10.1007/s10654-016-0149-3
Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations
Abstract
Misinterpretation and abuse of statistical tests, confidence intervals, and statistical power have been decried for decades, yet remain rampant. A key problem is that there are no interpretations of these concepts that are at once simple, intuitive, correct, and foolproof. Instead, correct use and interpretation of these statistics requires an attention to detail which seems to tax the patience of working scientists. This high cognitive demand has led to an epidemic of shortcut definitions and interpretations that are simply wrong, sometimes disastrously so-and yet these misinterpretations dominate much of the scientific literature. In light of this problem, we provide definitions and a discussion of basic statistics that are more general and critical than typically found in traditional introductory expositions. Our goal is to provide a resource for instructors, researchers, and consumers of statistics whose knowledge of statistical theory and technique may be limited but who wish to avoid and spot misinterpretations. We emphasize how violation of often unstated analysis protocols (such as selecting analyses for presentation based on the P values they produce) can lead to small P values even if the declared test hypothesis is correct, and can lead to large P values even if that hypothesis is incorrect. We then provide an explanatory list of 25 misinterpretations of P values, confidence intervals, and power. We conclude with guidelines for improving statistical interpretation and reporting.
Keywords: Confidence intervals; Hypothesis testing; Null testing; P value; Power; Significance tests; Statistical testing.
Comment in
-
Disengaging from statistical significance.Eur J Epidemiol. 2016 May;31(5):443-4. doi: 10.1007/s10654-016-0158-2. Epub 2016 Jun 7. Eur J Epidemiol. 2016. PMID: 27272951 No abstract available.
-
Recommendations on the use and nonuse of the p value in biomedical research.Am J Health Syst Pharm. 2017 Aug 15;74(16):1262-1266. doi: 10.2146/ajhp160443. Am J Health Syst Pharm. 2017. PMID: 28790078 No abstract available.
References
-
- Trafimow D, Marks M. Editorial. Basic Appl Soc Psychol. 2015;37:1–2. doi: 10.1080/01973533.2015.1012991. - DOI
-
- Ashworth A. Veto on the use of null hypothesis testing and p intervals: right or wrong? Taylor & Francis Editor. 2015. Resources online, http://editorresources.taylorandfrancisgroup.com/veto-on-the-use-of-null.... Accessed 27 Feb 2016.
-
- Flanagan O. Journal’s ban on null hypothesis significance testing: reactions from the statistical arena. 2015. Stats Life online, https://www.statslife.org.uk/opinion/2114-journal-s-ban-on-null-hypothes.... Accessed 27 Feb 2016.
-
- Altman DG, Machin D, Bryant TN, Gardner MJ, editors. Statistics with confidence. 2. London: BMJ Books; 2000.