Big Data, Small Bias: Harmonizing Diffusion MRI-Based Structural Connectomes to Mitigate Site-Related Bias in Data Integration
- PMID: 40563239
- PMCID: PMC12198055
- DOI: 10.1002/hbm.70256
Big Data, Small Bias: Harmonizing Diffusion MRI-Based Structural Connectomes to Mitigate Site-Related Bias in Data Integration
Abstract
Diffusion MRI-based structural connectomes are increasingly used to investigate brain connectivity changes associated with various disorders. However, small sample sizes in individual studies, along with highly heterogeneous disorder-related manifestations, underscore the need to pool datasets across multiple studies to be able to identify coherent and generalizable connectivity patterns linked to these disorders. Yet, combining datasets introduces site-related differences due to variations in scanner hardware or acquisition protocols. These differences highlight the necessity for statistical data harmonization to mitigate site-related effects on structural connectomes while preserving the biological information associated with participant demographics and the disorders. While several paradigms exist for harmonizing normally distributed neuroimaging measures, this paper represents the first effort to establish a harmonization framework specifically tailored for the structural connectome. We conduct a thorough investigation of various statistical harmonization methods, adapting them to accommodate the unique distributional characteristics and graph-based properties of structural connectomes. Through rigorous evaluation, we show that our MATCH algorithm, based on the gamma-distributed model, consistently outperforms existing approaches in modeling structural connectomes, enabling the effective removal of site-related biases in both edge-based and downstream graph analyses while preserving biological variability. Two real-world applications further highlight the utility of our harmonization framework in addressing challenges in multi-site structural connectome analysis. Specifically, harmonization with MATCH enhances the generalizability of connectome-based machine learning predictors to new datasets and increases statistical power for detecting group-level differences. Our work provides essential guidelines for harmonizing multi-site structural connectomes, paving the way for more robust discoveries through collaborative research in the era of team science and big data.
Keywords: ComBat; CovBat; big data; diffusion MRI; gamma generalized linear model; harmonization; multi‐site analysis; structural connectome.
© 2025 The Author(s). Human Brain Mapping published by Wiley Periodicals LLC.
Conflict of interest statement
The authors declare the following potential conflicts of interest: Timothy P.L. Roberts holds stock in Prism Clinical Imaging, has a partnership interest in Proteus Neurodynamics, and has received consulting fees from Fieldline Inc. and WestCan Proton Therapy. Russell T. Shinohara has received consulting fees from Octave Bioscience and the American Medical Association. All other authors (Rui S. Shen, Drew Parker, Andrew A. Chen, Birkan Tunc, Benjamin E. Yerys, and Ragini Verma) report no conflicts of interest.
Figures







Similar articles
-
Harmonization of Structural Brain Connectivity Through Distribution Matching.Hum Brain Mapp. 2025 Jun 15;46(9):e70257. doi: 10.1002/hbm.70257. Hum Brain Mapp. 2025. PMID: 40545448 Free PMC article.
-
Riemannian frameworks for the harmonization of resting-state functional MRI scans.Med Image Anal. 2024 Jan;91:103043. doi: 10.1016/j.media.2023.103043. Epub 2023 Nov 25. Med Image Anal. 2024. PMID: 38029722 Free PMC article.
-
Harmonization of Structural Brain Connectivity through Distribution Matching.bioRxiv [Preprint]. 2025 May 8:2024.09.05.611489. doi: 10.1101/2024.09.05.611489. bioRxiv. 2025. Update in: Hum Brain Mapp. 2025 Jun 15;46(9):e70257. doi: 10.1002/hbm.70257. PMID: 39314357 Free PMC article. Updated. Preprint.
-
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2. Cochrane Database Syst Rev. 2018. PMID: 29357120 Free PMC article.
-
Home treatment for mental health problems: a systematic review.Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150. Health Technol Assess. 2001. PMID: 11532236
References
-
- Abraham, A. , Milham M. P., di Martino A., et al. 2017. “Deriving Reproducible Biomarkers From Multi‐Site Resting‐State Data: An Autism‐Based Example.” NeuroImage 147: 736–745. - PubMed
-
- Andersson, J. L. , Skare S., and Ashburner J.. 2003. “How to Correct Susceptibility Distortions in Spin‐Echo Echo‐Planar Images: Application to Diffusion Tensor Imaging.” Neuroimage 20, no. 2: 870–888. - PubMed
-
- Antunes, R. S. , da André Costa C., Küderle A., Yari I. A., and Eskofier B.. 2022. “Federated Learning for Healthcare: Systematic Review and Architecture Proposal.” ACM Transactions on Intelligent Systems and Technology 13, no. 4: 1–23.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources