Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 26;16(1):30.
doi: 10.1186/s13229-025-00663-3.

Better statistical reporting does not lead to statistical rigour: lessons from two decades of pseudoreplication in mouse-model studies of neurological disorders

Affiliations

Better statistical reporting does not lead to statistical rigour: lessons from two decades of pseudoreplication in mouse-model studies of neurological disorders

Constantinos Eleftheriou et al. Mol Autism. .

Abstract

Background: Accurately determining the sample size ("N") of a dataset is a key consideration for experimental design. Misidentification of sample size can lead to pseudoreplication, a process of artificially inflating the number of experimental replicates which systematically underestimates variability, overestimates effect sizes and invalidates statistical tests performed on the data. While many journals have adopted stringent requirements with regard to statistical reporting over the last decade, it remains unknown whether such efforts have had a meaningful impact on statistical rigour.

Methods: Here, we evaluated the prevalence of this type of statistical error among neuroscience studies involving animal models of Fragile-X Syndrome (FXS) and those using animal models of neurological disorders at large published between 2001 and 2024.

Results: We found that pseudoreplication was present in the majority of publication, increasing over time despite marked improvements in statistical reporting over the last decade. This trend generalised beyond the FXS literature to rodent studies of neurological disorders at large between 2012 and 2024, suggesting that pseudoreplication remains a widespread issue in the literature.

Limitations: The scope of this study was limited to rodent-model studies of neurological disorders which had the potential for being pseudoreplicated, by allowing repeat observations from individual animals. We did not consider reviews or articles whose experimental design could not allow for pseudoreplication, for example studies which reported only behavioural results, or studies which did not use inferential statistics.

Conclusions: These observations identify an urgent need for better standards in experimental design and increased vigilance for this type of error during peer review. While reporting standards have significantly improved over the past two decades, this alone has not been enough to curb the prevalence of pseudoreplication. We offer suggestions for how this can be remedied as well as quantifying the severity of this particular type of statistical error. Although the examined literature concerns a specific neuroscience-related area of research, the implications of pseudoreplication apply to all fields of empirical research.

Keywords: Animal models; Autism; Fragile X; Pseudoreplication; Statistics.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: Peter Kind is an Associate Editor for Molecular Autism. The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The prevalence of pseudoreplication in the Fragile-X mouse model literature has remained fairly constant despite marked improvements in statistical reporting. (A) Proportion of articles suspected of pseudoreplication in at least one figure (orange line), and proportion reporting adequate statistical details (green line) in articles sampled between 2001 and 2024. Each time-point shows the bootstrap resampled median and 95% percentile interval of the bootstrapped distribution. (B) Average percentage of articles suspected of pseudoreplication between 2001–2012 and 2013–2024. Whisker plots show median ± 95% CI of bootstrapped distribution per group. (C) Average percentage of articles reporting adequate statistical details between 2001–2012 and 2013–2024. (D) Probability of an article being suspected of pseudoreplication when reporting adequate (dark green) or inadequate (light green) statistical details between 2001–2012 and 2013–2024. (E) Median citation rate for articles where pseudoreplication was present (dark orange) or absent (light orange) between 2001–2012 and 2013–2024. (F) Median citation rate for articles reporting adequate (dark green) or inadequate (light green) statistical details between 2001–2012 and 2013–2024
Fig. 2
Fig. 2
The prevalence of pseudoreplication across all publications using animal models of neurological disorder has remained high despite improvements in statistical reporting. (A) Average percentage of articles suspected of pseudoreplication between 2001–2012 and 2013–2024. Whisker plots show median ± 95% CI of bootstrapped distribution per group. (B) Average percentage of articles reporting adequate statistical details between 2001–2012 and 2013–2024. (C) Probability of an article being suspected of pseudoreplication when reporting adequate (dark green) or inadequate (light green) statistical details between 2001–2012 and 2013–2024. (D) Median citation rate for articles where pseudoreplication was present (dark orange) or absent (light orange) between 2001–2012 and 2013–2024. (E) Median citation rate for articles reporting adequate (dark green) or inadequate (light green) statistical details between 2001–2012 and 2013–2024
Fig. 3
Fig. 3
A typical example of pseudoreplication and its consequences for Student’s t-test. (A) Many experimental designs use within-animal samples to draw conclusions about the effect of a gene or environmental condition. The total variability in an effect can be split into a within-animal component (between cells, in this case) and a between-animal component. The intra-class correlation coefficient, ρIC, is a measure of how these sources of variation are related. (B) Schematic of variability relationships between and within animals for low (left population) and high (right population) intra-class correlation. Note that animals in the population on the left have high variance between cells (within animal), whereas animals in the population on the right have low cell variance in any given animal. (C) Pseudoreplicating by considering within-animal replicates as experimental replicates inflates the true Type-I error rate (false positive rate). X indicates the example case given in the text. The curves show how the true Type-I error rate varies with the number of within animal replicates for commonly stipulated significance levels (5%, 1%, 0.1%) and for the possible range of between-animal replicates (solid curves = 2 animals, dotted curves = infinite animals). For all curves, ρIC is set to 0.5. (D) The combined effect of within-animal replicates and intra-class correlation (ρIC) on the Type-I error rate for a significance threshold of 5% in the presence of pseudoreplication. Between animal standard deviation is shown normalised to within-animal standard deviation for comparison with corresponding values of ρIC

Similar articles

References

    1. Gardenier J, Resnik D. The misuse of statistics: concepts, tools, and a research agenda. Account Res. 2002;9(2):65–74. - PubMed
    1. Hurlbert SH. Pseudoreplication and the design of ecological field experiments. Ecol Monogr. 1984;54(2):187–211.
    1. Lazic SE. The problem of pseudoreplication in neuroscientific studies: is it affecting your analysis? BMC Neurosci. 2010;11(1):5. - PMC - PubMed
    1. Forstmeier W, Wagenmakers EJ, Parker TH. Detecting and avoiding likely false-positive findings – a practical guide. Biol Rev. 2017;92(4):1941–68. - PubMed
    1. Ioannidis JPA. Why most published research findings are false. PLoS Med. 2005;2(8):e124. - PMC - PubMed

LinkOut - more resources