Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb 24;16(2):e0243668.
doi: 10.1371/journal.pone.0243668. eCollection 2021.

Effect size, sample size and power of forced swim test assays in mice: Guidelines for investigators to optimize reproducibility

Affiliations

Effect size, sample size and power of forced swim test assays in mice: Guidelines for investigators to optimize reproducibility

Neil R Smalheiser et al. PLoS One. .

Abstract

A recent flood of publications has documented serious problems in scientific reproducibility, power, and reporting of biomedical articles, yet scientists persist in their usual practices. Why? We examined a popular and important preclinical assay, the Forced Swim Test (FST) in mice used to test putative antidepressants. Whether the mice were assayed in a naïve state vs. in a model of depression or stress, and whether the mice were given test agents vs. known antidepressants regarded as positive controls, the mean effect sizes seen in the experiments were indeed extremely large (1.5-2.5 in Cohen's d units); most of the experiments utilized 7-10 animals per group which did have adequate power to reliably detect effects of this magnitude. We propose that this may at least partially explain why investigators using the FST do not perceive intuitively that their experimental designs fall short-even though proper prospective design would require ~21-26 animals per group to detect, at a minimum, large effects (0.8 in Cohen's d units) when the true effect of a test agent is unknown. Our data provide explicit parameters and guidance for investigators seeking to carry out prospective power estimation for the FST. More generally, altering the real-life behavior of scientists in planning their experiments may require developing educational tools that allow them to actively visualize the inter-relationships among effect size, sample size, statistical power, and replicability in a direct and intuitive manner.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Mean immobility times of control groups carried out on naïve mice vs. depressive models (same data as in Table 3).
Fig 2
Fig 2. Effect sizes of known antidepressants across all assays (N = 29; see Table 1).

Similar articles

Cited by

References

    1. Ioannidis JP. Why most published research findings are false. PLoS Med. 2005. August;2(8):e124 10.1371/journal.pmed.0020124 - DOI - PMC - PubMed
    1. Ioannidis JP. How to make more published research true. PLoS Med. 2014. October 21;11(10):e1001747 10.1371/journal.pmed.1001747 - DOI - PMC - PubMed
    1. Higginson AD, Munafò MR. Current Incentives for Scientists Lead to Underpowered Studies with Erroneous Conclusions. PLoS Biol. 2016. November 10;14(11):e2000995 10.1371/journal.pbio.2000995 are wrong - DOI - PMC - PubMed
    1. Button KS, Ioannidis JP, Mokrysz C, Nosek BA, Flint J, Robinson ES, et al. Power failure: why small sample size undermines the reliability of neuroscience. Nat Rev Neurosci. 2013. May;14(5):365–76. 10.1038/nrn3475 - DOI - PubMed
    1. Curran-Everett D. Explorations in statistics: statistical facets of reproducibility. Adv Physiol Educ. 2016. June;40(2):248–52. 10.1152/advan.00042.2016 - DOI - PubMed

Publication types

Substances