Why is a small sample size not enough?

Ying Cao¹, Ronald C Chen¹, Aaron J Katz^{1

2}

Affiliations

¹ Department of Radiation Oncology, University of Kansas Medical Center, Kansas City, KS, United States.
² Department of Population Health, University of Kansas Medical Center, Kansas City, KS, United States.

PMID: 38934301
PMCID: PMC11379640
DOI: 10.1093/oncolo/oyae162

Why is a small sample size not enough?

Ying Cao et al. Oncologist. 2024.

. 2024 Sep 6;29(9):761-763.

doi: 10.1093/oncolo/oyae162.

Authors

Ying Cao¹, Ronald C Chen¹, Aaron J Katz^{1

2}

Affiliations

¹ Department of Radiation Oncology, University of Kansas Medical Center, Kansas City, KS, United States.
² Department of Population Health, University of Kansas Medical Center, Kansas City, KS, United States.

PMID: 38934301
PMCID: PMC11379640
DOI: 10.1093/oncolo/oyae162

Abstract

Background: Clinical studies are often limited by resources available, which results in constraints on sample size. We use simulated data to illustrate study implications when the sample size is too small.

Methods and results: Using 2 theoretical populations each with N = 1000, we randomly sample 10 from each population and conduct a statistical comparison, to help make a conclusion about whether the 2 populations are different. This exercise is repeated for a total of 4 studies: 2 concluded that the 2 populations are statistically significantly different, while 2 showed no statistically significant difference.

Conclusions: Our simulated examples demonstrate that sample sizes play important roles in clinical research. The results and conclusions, in terms of estimates of means, medians, Pearson correlations, chi-square test, and P values, are unreliable with small samples.

Keywords: data analysis; random sample; sample size; simulation; statistics.

PubMed Disclaimer

Conflict of interest statement

The authors indicated no financial relationships.

Figures

**Figure 1.**
(A) Two random samples of n = 10 each were drawn from 2 normally distributed populations each with N = 1000. Population 1 has means 0 and SD 1, and population 2 has mean 0.5 and SD 0.5. (B-D) Images illustrate new random samples using the same methodology as in panel (A).

See this image and copyright information in PMC

References

1. Hastie T, Tibshirani R, Friedman J.. Chapter 18. In: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2nd ed. Springer Series in Statistics. Springer; 2016.
1. Halsey L, Curran-Everett D, Bowler S, et al.. The fickle P value generates irreproducible results. Nat Methods. 2015;12(3):179-185. - PubMed
1. R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; 2022. https://www.R-project.org/

MeSH terms

Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- PubMed Central
- Silverchair Information Systems

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Why is a small sample size not enough?

Affiliations

Why is a small sample size not enough?

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources