Why is a small sample size not enough?
- PMID: 38934301
- PMCID: PMC11379640
- DOI: 10.1093/oncolo/oyae162
Why is a small sample size not enough?
Abstract
Background: Clinical studies are often limited by resources available, which results in constraints on sample size. We use simulated data to illustrate study implications when the sample size is too small.
Methods and results: Using 2 theoretical populations each with N = 1000, we randomly sample 10 from each population and conduct a statistical comparison, to help make a conclusion about whether the 2 populations are different. This exercise is repeated for a total of 4 studies: 2 concluded that the 2 populations are statistically significantly different, while 2 showed no statistically significant difference.
Conclusions: Our simulated examples demonstrate that sample sizes play important roles in clinical research. The results and conclusions, in terms of estimates of means, medians, Pearson correlations, chi-square test, and P values, are unreliable with small samples.
Keywords: data analysis; random sample; sample size; simulation; statistics.
© The Author(s) 2024. Published by Oxford University Press.
Conflict of interest statement
The authors indicated no financial relationships.
Figures
References
-
- Hastie T, Tibshirani R, Friedman J.. Chapter 18. In: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2nd ed. Springer Series in Statistics. Springer; 2016.
-
- Halsey L, Curran-Everett D, Bowler S, et al.. The fickle P value generates irreproducible results. Nat Methods. 2015;12(3):179-185. - PubMed
-
- R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; 2022. https://www.R-project.org/
MeSH terms
LinkOut - more resources
Full Text Sources
