A small p-value from an observed data is not evidence of adequate power for future similar-sized studies: a cautionary note

Eshetu G Atenafu¹, Jemila S Hamid, Derek Stephens, Teresa To, Joseph Beyene

Affiliations

PMID: 19063996
DOI: 10.1016/j.cct.2008.11.005

Comparative Study

A small p-value from an observed data is not evidence of adequate power for future similar-sized studies: a cautionary note

Eshetu G Atenafu et al. Contemp Clin Trials. 2009 Mar.

. 2009 Mar;30(2):155-7.

doi: 10.1016/j.cct.2008.11.005. Epub 2008 Dec 3.

Authors

Eshetu G Atenafu¹, Jemila S Hamid, Derek Stephens, Teresa To, Joseph Beyene

Affiliation

¹ Child Health Evaluative Sciences, The Hospital for Sick Children, Toronto, Ontario, Canada.

PMID: 19063996
DOI: 10.1016/j.cct.2008.11.005

Abstract

Background: p-values are ubiquitous in medical research, but are often misunderstood. In addition to being misused or perhaps even abused at post-statistical analysis stage of making scientific inference and interpretations, p-values can also be a source of confusion at the design stage.

Methods: Application of standard test statistic on observed data may result in a small p-value which in turn may give the impression that a new study that has the same sample size as the observed data, perhaps even smaller, would have adequate power. We used re-sampling method and computed statistical power to illustrate the fallacy of this conclusion. We have also calculated power using analytical formulae.

Results: We analyzed data consisting of two group comparisons with binary as well as continuous outcome variables. For the binary outcome, the event rates for the outcome of interest in the illustrative data were 15/43 (35%) and 22/34 (65%), respectively (p-value=0.0093). Using these data, a bootstrap-based empirical power was estimated to be 75.4%. One random sample with only two-third of the original data had a p-value of 0.0066, but only an empirical power of 57.4%. Similar results were observed for a continuous outcome.

Conclusion: Our results show that the number of zeros after the decimal point in a p-value from an observed sample cannot and should not be used to gauge the adequacy of sample size for a future study that is expected to have sufficient power to detect an effect as big as the observed.

PubMed Disclaimer

Cited by

Bayesian evaluation of informative hypotheses in cluster-randomized trials.
Moerbeek M. Moerbeek M. Behav Res Methods. 2019 Feb;51(1):126-137. doi: 10.3758/s13428-018-1149-x. Behav Res Methods. 2019. PMID: 30350025 Free PMC article.

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- ClinicalKey
- Elsevier Science
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A small p-value from an observed data is not evidence of adequate power for future similar-sized studies: a cautionary note

Affiliation

A small p-value from an observed data is not evidence of adequate power for future similar-sized studies: a cautionary note

Authors

Affiliation

Abstract

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical