Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 6;29(9):761-763.
doi: 10.1093/oncolo/oyae162.

Why is a small sample size not enough?

Affiliations

Why is a small sample size not enough?

Ying Cao et al. Oncologist. .

Abstract

Background: Clinical studies are often limited by resources available, which results in constraints on sample size. We use simulated data to illustrate study implications when the sample size is too small.

Methods and results: Using 2 theoretical populations each with N = 1000, we randomly sample 10 from each population and conduct a statistical comparison, to help make a conclusion about whether the 2 populations are different. This exercise is repeated for a total of 4 studies: 2 concluded that the 2 populations are statistically significantly different, while 2 showed no statistically significant difference.

Conclusions: Our simulated examples demonstrate that sample sizes play important roles in clinical research. The results and conclusions, in terms of estimates of means, medians, Pearson correlations, chi-square test, and P values, are unreliable with small samples.

Keywords: data analysis; random sample; sample size; simulation; statistics.

PubMed Disclaimer

Conflict of interest statement

The authors indicated no financial relationships.

Figures

Figure 1.
Figure 1.
(A) Two random samples of n = 10 each were drawn from 2 normally distributed populations each with N = 1000. Population 1 has means 0 and SD 1, and population 2 has mean 0.5 and SD 0.5. (B-D) Images illustrate new random samples using the same methodology as in panel (A).

References

    1. Hastie T, Tibshirani R, Friedman J.. Chapter 18. In: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2nd ed. Springer Series in Statistics. Springer; 2016.
    1. Halsey L, Curran-Everett D, Bowler S, et al.. The fickle P value generates irreproducible results. Nat Methods. 2015;12(3):179-185. - PubMed
    1. R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; 2022. https://www.R-project.org/

LinkOut - more resources