Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan;43(1):244-254.
doi: 10.1002/hbm.25154. Epub 2020 Aug 25.

Reproducibility in the absence of selective reporting: An illustration from large-scale brain asymmetry research

Collaborators, Affiliations

Reproducibility in the absence of selective reporting: An illustration from large-scale brain asymmetry research

Xiang-Zhen Kong et al. Hum Brain Mapp. 2022 Jan.

Abstract

The problem of poor reproducibility of scientific findings has received much attention over recent years, in a variety of fields including psychology and neuroscience. The problem has been partly attributed to publication bias and unwanted practices such as p-hacking. Low statistical power in individual studies is also understood to be an important factor. In a recent multisite collaborative study, we mapped brain anatomical left-right asymmetries for regional measures of surface area and cortical thickness, in 99 MRI datasets from around the world, for a total of over 17,000 participants. In the present study, we revisited these hemispheric effects from the perspective of reproducibility. Within each dataset, we considered that an effect had been reproduced when it matched the meta-analytic effect from the 98 other datasets, in terms of effect direction and significance threshold. In this sense, the results within each dataset were viewed as coming from separate studies in an "ideal publishing environment," that is, free from selective reporting and p hacking. We found an average reproducibility rate of 63.2% (SD = 22.9%, min = 22.2%, max = 97.0%). As expected, reproducibility was higher for larger effects and in larger datasets. Reproducibility was not obviously related to the age of participants, scanner field strength, FreeSurfer software version, cortical regional measurement reliability, or regional size. These findings constitute an empirical illustration of reproducibility in the absence of publication bias or p hacking, when assessing realistic biological effects in heterogeneous neuroscience data, and given typically-used sample sizes.

Keywords: P-hacking; multisite collaboration; publication bias; reproducibility; team science.

PubMed Disclaimer

Conflict of interest statement

Disclosures are listed in the Supporting Information Appendix.

Figures

FIGURE 1
FIGURE 1
Reproducibility in the absence of selective reporting, estimated based on outputs of the ENIGMA cortical asymmetry project. (a) Sample size distribution of the 99 datasets. (b) Effect size distribution of the 70 hemispheric effects of interest. (c) Reproducibility distribution of the 70 hemispheric effects. The reproducibility was assessed by comparing each dataset in turn to the meta‐analytic effect from the 98 others, to avoid overlap (see Methods). (d) Scatter plot of the correlation between the reproducibility and the effect size. (e) Relations of reproducibility, effect size and dataset sample size. Each line plots the mean and 95% confidence interval for reproducibility. We used the meta‐analytic effect size over all 99 datasets for visualization purposes. The figure key shows the types of cortical measure (orange indicates surface area; green indicates cortical thickness), as well as groupings by true effect sizes
FIGURE 2
FIGURE 2
Region‐wise effect sizes and reproducibility rates of hemispheric asymmetry effects. We used the meta‐analytic effect sizes over all 99 datasets for visualization purposes. Effect sizes are in Cohen's d
FIGURE 3
FIGURE 3
Scatter plots for the relationships between regional sizes, hemispheric asymmetry effect sizes, and reproducibility rates of the asymmetries. (a) Plots for region sizes. (b) Plots for measurement reliability (intraclass correlation coefficient, ICC). Effect sizes are in Cohen's d. Unit: thickness = mm, area = mm2, region size = mm2

References

    1. Aarts, A. A. , Anderson, J. E. , Anderson, C. J. , Attridge, P. R. , Attwood, A. , Axt, J. , … Collaboration, O.S . (2015). Estimating the reproducibility of psychological science. Science, 349(6251), aac4716. - PubMed
    1. Baker, M. (2016). 1,500 scientists lift the lid on reproducibility. Nature, 533, 452–454. - PubMed
    1. Bakker, M. , van Dijk, A. , & Wicherts, J. M. (2012). The rules of the game called psychological science. Perspectives on Psychological Science, 7, 543–554. - PubMed
    1. Benjamin, D. J. , Berger, J. O. , Johannesson, M. , Nosek, B. A. , Wagenmakers, E. J. , Berk, R. , … Johnson, V. E. (2018). Redefine statistical significance. Nature Human Behaviour, 2, 6–10. - PubMed
    1. Boedhoe, P. S. W. , Heymans, M. W. , Schmaal, L. , Abe, Y. , Alonso, P. , Ameis, S. H. , … Twisk, J. W. R. (2018). An empirical comparison of meta‐ and mega‐analysis with data from the ENIGMA obsessive‐compulsive disorder working Group. Frontiers in Neuroinformatics, 12, 102. - PMC - PubMed

Publication types