A small p-value from an observed data is not evidence of adequate power for future similar-sized studies: a cautionary note
- PMID: 19063996
- DOI: 10.1016/j.cct.2008.11.005
A small p-value from an observed data is not evidence of adequate power for future similar-sized studies: a cautionary note
Abstract
Background: p-values are ubiquitous in medical research, but are often misunderstood. In addition to being misused or perhaps even abused at post-statistical analysis stage of making scientific inference and interpretations, p-values can also be a source of confusion at the design stage.
Methods: Application of standard test statistic on observed data may result in a small p-value which in turn may give the impression that a new study that has the same sample size as the observed data, perhaps even smaller, would have adequate power. We used re-sampling method and computed statistical power to illustrate the fallacy of this conclusion. We have also calculated power using analytical formulae.
Results: We analyzed data consisting of two group comparisons with binary as well as continuous outcome variables. For the binary outcome, the event rates for the outcome of interest in the illustrative data were 15/43 (35%) and 22/34 (65%), respectively (p-value=0.0093). Using these data, a bootstrap-based empirical power was estimated to be 75.4%. One random sample with only two-third of the original data had a p-value of 0.0066, but only an empirical power of 57.4%. Similar results were observed for a continuous outcome.
Conclusion: Our results show that the number of zeros after the decimal point in a p-value from an observed sample cannot and should not be used to gauge the adequacy of sample size for a future study that is expected to have sufficient power to detect an effect as big as the observed.
Similar articles
-
Mid-course sample size modification in clinical trials based on the observed treatment effect.Stat Med. 2003 Mar 30;22(6):971-93. doi: 10.1002/sim.1457. Stat Med. 2003. PMID: 12627413
-
Statistical power of negative randomized controlled trials presented at American Society for Clinical Oncology annual meetings.J Clin Oncol. 2007 Aug 10;25(23):3482-7. doi: 10.1200/JCO.2007.11.3670. J Clin Oncol. 2007. PMID: 17687153
-
Inadequate statistical power of negative clinical trials in urological literature.J Urol. 2006 Jul;176(1):263-6. doi: 10.1016/S0022-5347(06)00505-2. J Urol. 2006. PMID: 16753417
-
[Analysis and importance of statistical power and sample size in empirical scientific research].Wiad Lek. 2008;61(7-9):211-5. Wiad Lek. 2008. PMID: 19172834 Review. Polish.
-
Seven ways to increase power without increasing N.NIDA Res Monogr. 1994;142:184-95. NIDA Res Monogr. 1994. PMID: 9243537 Review.
Cited by
-
Bayesian evaluation of informative hypotheses in cluster-randomized trials.Behav Res Methods. 2019 Feb;51(1):126-137. doi: 10.3758/s13428-018-1149-x. Behav Res Methods. 2019. PMID: 30350025 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical