. 2024 Oct 16;14(1):24289.

doi: 10.1038/s41598-024-72968-x.

Bridging big data in the ENIGMA consortium to combine non-equivalent cognitive measures

Eamonn Kennedy^{1

2

3}, Shashank Vadlamani⁴, Hannah M Lindsey^{4

5}, Pui-Wa Lei⁶, Mary Jo-Pugh^{4

7}, Paul M Thompson^{8

9}, David F Tate^{4

5}, Frank G Hillary^{10

11

12}, Emily L Dennis^{4

5}, Elisabeth A Wilde^{4

5}; ENIGMA Clinical Endpoints Working Group

Collaborators, Affiliations

Collaborators

ENIGMA Clinical Endpoints Working Group:
Maheen Adamson, Martin Alda, Silvia Alonso-Lana, Sonia Ambrogi, Tim J Anderson, Celso Arango, Robert F Asarnow, Mihai Avram, Rosa Ayesa-Arriola, Talin Babikian, Nerisa Banaj, Laura J Bird, Stefan Borgwardt, Amy Brodtmann, Katharina Brosch, Karen Caeyenberghs, Vince D Calhoun, Nancy D Chiaravalloti, David X Cifu, Benedicto Crespo-Facorro, John C Dalrymple-Alford, Kristen Dams-O'Connor, Udo Dannlowski, David Darby, Nicholas Davenport, John DeLuca, Covadonga M Diaz-Caneja, Seth G Disner, Ekaterina Dobryakova, Stefan Ehrlich, Carrie Esopenko, Fabio Ferrarelli, Lea E Frank, Carol Franz, Paola Fuentes-Claramonte, Helen Genova, Christopher C Giza, Janik Goltermann, Dominik Grotegerd, Marius Gruber, Alfonso Gutierrez-Zotes, Minji Ha, Jan Haavik, Charles Hinkin, Kristen R Hoskinson, Daniela Hubl, Andrei Irimia, Andreas Jansen, Michael Kaess, Xiaojian Kang, Kimbra Kenney, Barbora Keřková, Mohamed Salah Khlif, Minah Kim, Jochen Kindler, Tilo Kircher, Karolina Knížková, Knut K Kolskår, Denise Krch, William S Kremen, Taylor Kuhn, Veena Kumari, Jun Soo Kwon, Roberto Langella, Sarah Laskowitz, Jungha Lee, Jean Lengenfelder, Spencer W Liebel, Victoria Liou-Johnson, Sara M Lippa, Marianne Løvstad, Astri Lundervold, Cassandra Marotta, Craig A Marquardt, Paulo Mattos, Ahmad Mayeli, Carrie R McDonald, Susanne Meinert, Tracy R Melzer, Jessica Merchán-Naranjo, Chantal Michel, Rajendra A Morey, Benson Mwangi, Daniel J Myall, Igor Nenadić, Mary R Newsome, Abraham Nunes, Terence O'Brien, Viola Oertel, John Ollinger, Alexander Olsen, Victor Ortiz García de la Foz, Mustafa Ozmen, Heath Pardoe, Marise Parent, Fabrizio Piras, Federica Piras, Edith Pomarol-Clotet, Jonathan Repple, Geneviève Richard, Jonathan Rodriguez, Mabel Rodriguez, Kelly Rootes-Murdy, Jared Rowland, Nicholas P Ryan, Raymond Salvador, Anne-Marthe Sanders, Andre Schmidt, Jair C Soares, Gianfranco Spalleta, Filip Španiel, Alena Stasenko, Frederike Stein, Benjamin Straube, April Thames, Florian Thomas-Odenthal, Sophia I Thomopoulos, Erin Tone, Ivan Torres, Maya Troyanskaya, Jessica A Turner, Kristine M Ulrichsen, Guillermo Umpierrez, Elisabet Vilella, Lucy Vivash, William C Walker, Emilio Werden, Lars T Westlye, Krista Wild, Adrian Wroblewski, Mon-Ju Wu, Glenn R Wylie, Lakshmi N Yatham, Giovana B Zunta-Soares

Affiliations

¹ Department of Neurology, University of Utah School of Medicine, Salt Lake City, UT, USA. eamonn.kennedy@utah.edu.
² Division of Epidemiology, University of Utah, Salt Lake City, UT, USA. eamonn.kennedy@utah.edu.
³ George E. Wahlen Veterans Affairs Medical Center, Salt Lake City, UT, USA. eamonn.kennedy@utah.edu.
⁴ Department of Neurology, University of Utah School of Medicine, Salt Lake City, UT, USA.
⁵ George E. Wahlen Veterans Affairs Medical Center, Salt Lake City, UT, USA.
⁶ Department of Educational Psychology, Counseling, and Special Education, Pennsylvania State University, University Park, PA, USA.
⁷ Division of Epidemiology, University of Utah, Salt Lake City, UT, USA.
⁸ Imaging Genetics Center, Stevens Neuroimaging & Informatics Institute, Keck School of Medicine of USC, Marina del Rey, CA, USA.
⁹ Departments of Neurology, Pediatrics, Psychiatry, Radiology, Engineering, and Ophthalmology, USC, Los Angeles, CA, USA.
¹⁰ Department of Psychology, Penn State University, State College, PA, USA.
¹¹ Department of Neurology, Hershey Medical Center, State College, PA, USA.
¹² Social Life and Engineering Science Imaging Center, Penn State University, State College, PA, USA.

PMID: 39414844
PMCID: PMC11484938
DOI: 10.1038/s41598-024-72968-x

Bridging big data in the ENIGMA consortium to combine non-equivalent cognitive measures

Eamonn Kennedy et al. Sci Rep. 2024.

. 2024 Oct 16;14(1):24289.

doi: 10.1038/s41598-024-72968-x.

Authors

Collaborators

ENIGMA Clinical Endpoints Working Group:
Maheen Adamson, Martin Alda, Silvia Alonso-Lana, Sonia Ambrogi, Tim J Anderson, Celso Arango, Robert F Asarnow, Mihai Avram, Rosa Ayesa-Arriola, Talin Babikian, Nerisa Banaj, Laura J Bird, Stefan Borgwardt, Amy Brodtmann, Katharina Brosch, Karen Caeyenberghs, Vince D Calhoun, Nancy D Chiaravalloti, David X Cifu, Benedicto Crespo-Facorro, John C Dalrymple-Alford, Kristen Dams-O'Connor, Udo Dannlowski, David Darby, Nicholas Davenport, John DeLuca, Covadonga M Diaz-Caneja, Seth G Disner, Ekaterina Dobryakova, Stefan Ehrlich, Carrie Esopenko, Fabio Ferrarelli, Lea E Frank, Carol Franz, Paola Fuentes-Claramonte, Helen Genova, Christopher C Giza, Janik Goltermann, Dominik Grotegerd, Marius Gruber, Alfonso Gutierrez-Zotes, Minji Ha, Jan Haavik, Charles Hinkin, Kristen R Hoskinson, Daniela Hubl, Andrei Irimia, Andreas Jansen, Michael Kaess, Xiaojian Kang, Kimbra Kenney, Barbora Keřková, Mohamed Salah Khlif, Minah Kim, Jochen Kindler, Tilo Kircher, Karolina Knížková, Knut K Kolskår, Denise Krch, William S Kremen, Taylor Kuhn, Veena Kumari, Jun Soo Kwon, Roberto Langella, Sarah Laskowitz, Jungha Lee, Jean Lengenfelder, Spencer W Liebel, Victoria Liou-Johnson, Sara M Lippa, Marianne Løvstad, Astri Lundervold, Cassandra Marotta, Craig A Marquardt, Paulo Mattos, Ahmad Mayeli, Carrie R McDonald, Susanne Meinert, Tracy R Melzer, Jessica Merchán-Naranjo, Chantal Michel, Rajendra A Morey, Benson Mwangi, Daniel J Myall, Igor Nenadić, Mary R Newsome, Abraham Nunes, Terence O'Brien, Viola Oertel, John Ollinger, Alexander Olsen, Victor Ortiz García de la Foz, Mustafa Ozmen, Heath Pardoe, Marise Parent, Fabrizio Piras, Federica Piras, Edith Pomarol-Clotet, Jonathan Repple, Geneviève Richard, Jonathan Rodriguez, Mabel Rodriguez, Kelly Rootes-Murdy, Jared Rowland, Nicholas P Ryan, Raymond Salvador, Anne-Marthe Sanders, Andre Schmidt, Jair C Soares, Gianfranco Spalleta, Filip Španiel, Alena Stasenko, Frederike Stein, Benjamin Straube, April Thames, Florian Thomas-Odenthal, Sophia I Thomopoulos, Erin Tone, Ivan Torres, Maya Troyanskaya, Jessica A Turner, Kristine M Ulrichsen, Guillermo Umpierrez, Elisabet Vilella, Lucy Vivash, William C Walker, Emilio Werden, Lars T Westlye, Krista Wild, Adrian Wroblewski, Mon-Ju Wu, Glenn R Wylie, Lakshmi N Yatham, Giovana B Zunta-Soares

Affiliations

¹ Department of Neurology, University of Utah School of Medicine, Salt Lake City, UT, USA. eamonn.kennedy@utah.edu.
² Division of Epidemiology, University of Utah, Salt Lake City, UT, USA. eamonn.kennedy@utah.edu.
³ George E. Wahlen Veterans Affairs Medical Center, Salt Lake City, UT, USA. eamonn.kennedy@utah.edu.
⁴ Department of Neurology, University of Utah School of Medicine, Salt Lake City, UT, USA.
⁵ George E. Wahlen Veterans Affairs Medical Center, Salt Lake City, UT, USA.
⁶ Department of Educational Psychology, Counseling, and Special Education, Pennsylvania State University, University Park, PA, USA.
⁷ Division of Epidemiology, University of Utah, Salt Lake City, UT, USA.
⁸ Imaging Genetics Center, Stevens Neuroimaging & Informatics Institute, Keck School of Medicine of USC, Marina del Rey, CA, USA.
⁹ Departments of Neurology, Pediatrics, Psychiatry, Radiology, Engineering, and Ophthalmology, USC, Los Angeles, CA, USA.
¹⁰ Department of Psychology, Penn State University, State College, PA, USA.
¹¹ Department of Neurology, Hershey Medical Center, State College, PA, USA.
¹² Social Life and Engineering Science Imaging Center, Penn State University, State College, PA, USA.

PMID: 39414844
PMCID: PMC11484938
DOI: 10.1038/s41598-024-72968-x

Abstract

Investigators in neuroscience have turned to Big Data to address replication and reliability issues by increasing sample size. These efforts unveil new questions about how to integrate data across distinct sources and instruments. The goal of this study was to link scores across common auditory verbal learning tasks (AVLTs). This international secondary analysis aggregated multisite raw data for AVLTs across 53 studies totaling 10,505 individuals. Using the ComBat-GAM algorithm, we isolated and removed the component of memory scores associated with site effects while preserving instrumental effects. After adjustment, a continuous item response theory model used multiple memory items of varying difficulty to estimate each individual's latent verbal learning ability on a single scale. Equivalent raw scores across AVLTs were then found by linking individuals through the ability scale. Harmonization reduced total cross-site score variance by 37% while preserving meaningful memory effects. Age had the largest impact on scores overall (- 11.4%), while race/ethnicity variable was not significant (p > 0.05). The resulting tools were validated on dually administered tests. The conversion tool is available online so researchers and clinicians can convert memory scores across instruments. This work demonstrates that global harmonization initiatives can address reproducibility challenges across the behavioral sciences.

Keywords: Harmonization; Item response theory; Mega analysis; Traumatic brain injury; Verbal learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Fig. 1**
Comparing multisite data of total of trials scores before and after ComBat harmonization and adjustment for (a) CVLT, (b) RAVLT, and (c) HVLT. Results are sorted by median score per study. Variation in site medians were reduced after harmonization. Full details for all sites are available in Supplementary Fig. S1.

**Fig. 2**
Comparing proportions of memory items recalled before and after harmonization. Mean scores for each site (dots) are shown broken out by instrument (color) and item (Top: Trial 1 immediate free recall, Middle: Total sum of all Trials, Bottom: Long-delay free recall scores).

**Fig. 3**
Visualizing covariate effects on covariate unadjusted, harmonized scores. (a) Boxplots of scores stratified by group (TBI vs. control) and sex/gender indicated that males and those with history of TBI had significantly lower scores on average. Age-related declines (b) and the beneficial effects of education (c) on scores were consistent across all AVLTs.

**Fig. 4**
Visualizing and Validating Conversions. (a) Average scores as a function of individual ability are shown approximated as cubic polynomial fits for immediate, short, and long delay trials. Scores shown are not normed or T-scored. Horizontal lines of equivalent ability connect equivalent scores across tests, which facilitates the construction of crosswalks. (b) Scatter plot and fit to the sum of learning Trial scores for a subset of cases who were administered both the CVLT and RAVLT (n = 36). The confidence area of the dually assessed data is shown in blue and agrees with the derived crosswalk for CVLT- > RAVLT (n = 9362, black dotted line).

**Fig. 5**
The distribution of unadjusted ability scores are shown for each site, ranked by ability and color-coded by median age per site.

See this image and copyright information in PMC

References

1. Thompson, P. M. et al. ENIGMA and global neuroscience: A decade of large-scale studies of the brain in health and disease across more than 40 countries. Transl. Psychiatry10, 100 (2020). - PMC - PubMed
1. Nagaraj, A., Shears, E. & de Vaan, M. Improving data access democratizes and diversifies science. Proc. Natl. Acad. Sci. U. S. A.117, 23490–23498 (2020). - PMC - PubMed
1. Rajtmajer, S. M., Errington, T. M. & Hillary, F. G. How failure to falsify in high-volume science contributes to the replication crisis. Elife11, e78830 (2022). - PMC - PubMed
1. Kennedy, E. et al. Harmonizing PTSD severity scales across instruments and sites. Neuropsychology Accepted. - PMC - PubMed
1. Pan, F.-F., Huang, L., Chen, K.-L., Zhao, Q.-H. & Guo, Q.-H. A comparative study on the validations of three cognitive screening tests in identifying subtle cognitive decline. BMC Neurol.20, 78 (2020). - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Bridging big data in the ENIGMA consortium to combine non-equivalent cognitive measures

Collaborators

Affiliations

Bridging big data in the ENIGMA consortium to combine non-equivalent cognitive measures

Authors

Collaborators

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous