Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar:248:118822.
doi: 10.1016/j.neuroimage.2021.118822. Epub 2021 Dec 25.

Privacy-preserving harmonization via distributed ComBat

Affiliations

Privacy-preserving harmonization via distributed ComBat

Andrew A Chen et al. Neuroimage. 2022 Mar.

Abstract

Challenges in clinical data sharing and the need to protect data privacy have led to the development and popularization of methods that do not require directly transferring patient data. In neuroimaging, integration of data across multiple institutions also introduces unwanted biases driven by scanner differences. These scanner effects have been shown by several research groups to severely affect downstream analyses. To facilitate the need of removing scanner effects in a distributed data setting, we introduce distributed ComBat, an adaptation of a popular harmonization method for multivariate data that borrows information across features. We present our fast and simple distributed algorithm and show that it yields equivalent results using data from the Alzheimer's Disease Neuroimaging Initiative. Our method enables harmonization while ensuring maximal privacy protection, thus facilitating a broad range of downstream analyses in functional and structural imaging studies.

Keywords: ComBat; Distributed analysis; Harmonization; Privacy-preserving; Site effect.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare no competing interests.

Figures

Fig. 1.
Fig. 1.. Distributed ComBat illustration.
The procedure to perform distributed ComBat harmonization is outlined as follows. a, Each site sends its deidentified summary statistics to a central site for estimation of regression coefficients which are then passed back to the sites. b, Each site sends summary statistics to a central site for estimation of the population variance which is then passed back to the sites. c, The sites can then use the global regression coefficients and variance estimates to perform the remaining ComBat steps and obtain harmonized data.
Fig. 2.
Fig. 2.. Distributed ComBat parameter estimates.
Scatter plots compare parameter estimates from distributed ComBat versus those obtained from ComBat with all data at one location. a and b show empirical Bayes point estimates for location and scale respectively. c displays the regression coefficients obtained from each method.

References

    1. Al-Rubaie M, Wu P, Chang JM, Kung S, 2017. Privacy-preserving PCA on horizontally-partitioned data. In: Proceedings of the IEEE Conference on Dependable and Secure Computing, pp. 280–287. doi:10.1109/DESEC.2017.8073817. - DOI
    1. Avants B, Klein A, Tustison N, Woo J, Gee JC, 2010. Evaluation of open-access, automated brain extraction methods on multi-site multi-disorder data. In: Proceedings of the 16th Annual Meeting for the Organization of Human Brain Mapping.
    1. Avants BB, Tustison NJ, Wu J, Cook PA, Gee JC, 2011. An open source multivariate framework for n-tissue segmentation with evaluation on public data. Neuroinformatics 9 (4), 381–400. doi:10.1007/s12021-011-9109-y. - DOI - PMC - PubMed
    1. Bartlett EA, DeLorenzo C, Sharma P, Yang J, Zhang M, Petkova E, Weissman M, McGrath PJ, Fava M, Ogden RT, Kurian BT, Malchow A, Cooper CM, Trombello JM, McInnis M, Adams P, Oquendo MA, Pizzagalli DA, Trivedi M, Parsey RV, 2018. Pretreatment and early-treatment cortical thickness is associated with SSRI treatment response in major depressive disorder. Neuropsychopharmacology 43 (11), 2221–2230. doi:10.1038/s41386-018-0122-9. - DOI - PMC - PubMed
    1. Beer JC, Tustison NJ, Cook PA, Davatzikos C, Sheline YI, Shinohara RT, Linn KA, 2020. Longitudinal ComBat: a method for harmonizing longitudinal multi-scanner imaging data. NeuroImage 220, 117129. doi:10.1016/j.neuroimage.2020.117129. - DOI - PMC - PubMed

Publication types