Multi-channel attention-fusion neural network for brain age estimation: Accuracy, generality, and interpretation with 16,705 healthy MRIs across lifespan

doi:10.1016/j.media.2021.102091

. 2021 Aug:72:102091.

doi: 10.1016/j.media.2021.102091. Epub 2021 Apr 30.

Multi-channel attention-fusion neural network for brain age estimation: Accuracy, generality, and interpretation with 16,705 healthy MRIs across lifespan

Affiliations

¹ Boston Children's Hospital and Harvard Medical School, 300 Longwood Ave., Boston, MA, USA.
² Massachusetts General Hospital and Harvard Medical School, 55 Fruit St., Boston, MA, USA.
³ Boston Children's Hospital and Harvard Medical School, 300 Longwood Ave., Boston, MA, USA. Electronic address: yangming.ou@childrens.harvard.edu.

PMID: 34038818
PMCID: PMC8316301
DOI: 10.1016/j.media.2021.102091

Multi-channel attention-fusion neural network for brain age estimation: Accuracy, generality, and interpretation with 16,705 healthy MRIs across lifespan

Sheng He et al. Med Image Anal. 2021 Aug.

. 2021 Aug:72:102091.

doi: 10.1016/j.media.2021.102091. Epub 2021 Apr 30.

Authors

Affiliations

¹ Boston Children's Hospital and Harvard Medical School, 300 Longwood Ave., Boston, MA, USA.
² Massachusetts General Hospital and Harvard Medical School, 55 Fruit St., Boston, MA, USA.
³ Boston Children's Hospital and Harvard Medical School, 300 Longwood Ave., Boston, MA, USA. Electronic address: yangming.ou@childrens.harvard.edu.

PMID: 34038818
PMCID: PMC8316301
DOI: 10.1016/j.media.2021.102091

Abstract

Brain age estimated by machine learning from T1-weighted magnetic resonance images (T1w MRIs) can reveal how brain disorders alter brain aging and can help in the early detection of such disorders. A fundamental step is to build an accurate age estimator from healthy brain MRIs. We focus on this step, and propose a framework to improve the accuracy, generality, and interpretation of age estimation in healthy brain MRIs. For accuracy, we used one of the largest sample sizes (N = 16,705). For each subject, our proposed algorithm first explicitly splits the T1w image, which has been commonly treated as a single-channel 3D image in other studies, into two 3D image channels representing contrast and morphometry information. We further proposed a "fusion-with-attention" deep learning convolutional neural network (FiA-Net) to learn how to best fuse the contrast and morphometry image channels. FiA-Net recognizes varying contributions across image channels at different brain anatomy and different feature layers. In contrast, multi-channel fusion does not exist for brain age estimation, and is mostly attention-free in other medical image analysis tasks (e.g., image synthesis, or segmentation), where treating channels equally may not be optimal. For generality, we used lifespan data 0-97 years of age for real-world utility; and we thoroughly tested FiA-Net for multi-site and multi-scanner generality by two phases of cross-validations in discovery and replication data, compared to most other studies with only one phase of cross-validation. For interpretation, we directly measured each artificial neuron's correlation with the chronological age, compared to other studies looking at the saliency of features where salient features may or may not predict age. Overall, FiA-Net achieved a mean absolute error (MAE) of 3.00 years and Pearson correlation r=0.9840 with known chronological ages in healthy brain MRIs 0-97 years of age, comparing favorably with state-of-the-art algorithms and studies for accuracy and generality across sites and datasets. We also provided interpretations on how different artificial neurons and real neuroanatomy contribute to the age estimation.

Keywords: Age prediction; Attention network; Deep learning; Lifespan brain MRI; Multi-channel fusion.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

**Figure 1:**
Overview of the proposed network architecture. It has three branches: two channel-specific networks (FiA-Net_con and FiA-Net_mor, top and bottom paths in blue colors) provide channel-specific age estimation results, and one attention-driven fusion network (FiA-Net_fus, middle path in orange) provides the final age estimation results. In channel-specific branches, the *Res*_i(i = 1, 2, 3, 4) boxes are residual network blocks, and $f_{i}^{m_{i}}$ are the intermediate deep features of channel image m_i after the i^th residual block. In the fusion branch, the F_i boxes are the fusion blocks. GAP is the global average pooling.

**Figure 2:**
Explicit split of a 3D T1w image into two 3D images representing two channels of information (contrast and morphometry). (a) A subject’s T1w image was registered to the SRI24 atlas, leading to a 3D registered intensity image (the first channel, contrast information) and a 3D RAVENS image (the second channel, morphometry information), both residing in the SRI24 atlas space. (b) Randomly-chosen subjects in every ten years of the age range for their two channels of images. Each column shows the MRI slices in the axial (top row), sagittal (middle row), and coronal (bottom row) planes. All images resided in the SRI24 atlas space.

**Figure 3:**
Three different fusion strategies. Orange arrows represent the fusion.

**Figure 4:**
The proposed attention-driven fusion block in the i^th layer. (a) Overview of the fusion block, which contains four attention mechanisms: hard attention, illustrated in (b) and the output features are denoted as $f_{i}^{u_{1}}$ ; soft attention, illustrated in (c) and the output features are denoted as $f_{i}^{u_{2}}$ ; and two mutual attentions, illustrated in (d) and the output features are denoted as $f_{i}^{u_{3}}$ and $f_{i}^{u_{4}}$ .

**Figure 5:**
Our two-phase validation strategy. The first cross-validation happened in the discovery cohort, which was split in five folds of equal sample sizes (Test i) for five cross-validations. In each cross validation, the set ”Test i” (dark orange box) was used for evaluation and the rest four folds (light orange box) were used for training ”Model i” (blue boxes). In the second phase of validation, each trained ”Model i” (blue box) was applied on the completely-unseen replication cohort (green box) to evaluate accuracy and generality.

**Figure 6:**
Strategy to interpret the predictive value of each neuron in the deep neural network.

**Figure 7:**
Further understanding of the important choices in our algorithm. The predictive value of the four attention mechanisms and their concatenations ( $f_{4}^{u_{1}}$ , $f_{4}^{u_{2}}$ , $f_{4}^{u_{3}}$ , $f_{4}^{u_{4}}$ , and $f_{4}^{u_{c}}$ ) at the last layer of the fusion branch FiA-Net_fus.

**Figure 8:**
Accuracy comparisons among three algorithms using the same data and the same 5-fold cross-validation strategies, using MAE as the accuracy metric. The solid red line in each panel describes the ideal predictions where the predicted ages are identical to the chronological ages. Each green dot represent a subject.

**Figure 9:**
Accuracy comparisons among three algorithms using the same data and the same 5-fold cross-validation strategies, using CS as the accuracy metric. The CS curve of brain age estimation using the different networks in cross validation.

**Figure 10:**
Accuracy comparison among different studies that used different datasets. Each column is one study. They used different datasets and had different age ranges. Red dots, following the red scale bar on the left, are the Mean Absolution Error (MAE) in each study. Blue bars, following the blue scale bar on the right, are the age ranges in each study. Our proposed study is represented by the blue bar on the most right part of the figure. The gray rectangle box highlights four studies, in the right part of the figure, which used > 6,000 subjects and lifespan data. Therefore, these studies in the gray rectangle box are more comparable, and among them, our study had the lowest MAE.

**Figure 11:**
Interpretation of four most-weighted neurons in the last layer of FiA-Net_fus. (a)-(d): voxel-wise correlations with chronological ages in the four neurons on different age groups. (e): voxel-wise correlations with chronological ages in each neuron over 0–97 years. The neurons were ranked by their weights in the last layer of FiA-Net_fus in descending order. (f): average correlations in 62 auto-segmented brain structures.

**Figure 12:**
Age estimation errors (MAE) as a function of sample size at each age. The color curves are the MAEs of different algorithms, and they comply with the scales in the left y axis. The gray bars are the numbers of samples at each age, and they follow the scales in the right y axis.

See this image and copyright information in PMC

Cited by

Assessing Machine Learning Models for Predicting Age with Intracranial Vessel Tortuosity and Thickness Information.
Yoon HS, Oh J, Kim YC. Yoon HS, et al. Brain Sci. 2023 Oct 26;13(11):1512. doi: 10.3390/brainsci13111512. Brain Sci. 2023. PMID: 38002472 Free PMC article.
Exploring the relationship among Alzheimer's disease, aging and cognitive scores through neuroimaging-based approach.
Sun J, Han JJ, Chen W. Sun J, et al. Sci Rep. 2024 Nov 10;14(1):27472. doi: 10.1038/s41598-024-78712-9. Sci Rep. 2024. PMID: 39523370 Free PMC article.
Inferring neurocognition using artificial intelligence on brain MRIs.
Hussain MA, Grant PE, Ou Y. Hussain MA, et al. Front Neuroimaging. 2024 Nov 27;3:1455436. doi: 10.3389/fnimg.2024.1455436. eCollection 2024. Front Neuroimaging. 2024. PMID: 39664769 Free PMC article. Review.
[¹⁸F]FDG PET integrated with structural MRI for accurate brain age prediction.
Xue L, Fu Y, Gao X, Feng G, Qian S, Wei L, Li L, Zhuo C, Zhang H, Tian M. Xue L, et al. Eur J Nucl Med Mol Imaging. 2024 Oct;51(12):3617-3629. doi: 10.1007/s00259-024-06784-w. Epub 2024 Jun 6. Eur J Nucl Med Mol Imaging. 2024. PMID: 38839623
The role of cortical structural variance in deep learning-based prediction of fetal brain age.
Kwon H, You S, Yun HJ, Jeong S, De León Barba AP, Lemus Aguilar ME, Vergara PJ, Davila SU, Grant PE, Lee JM, Im K. Kwon H, et al. Front Neurosci. 2024 May 23;18:1411334. doi: 10.3389/fnins.2024.1411334. eCollection 2024. Front Neurosci. 2024. PMID: 38846713 Free PMC article.

See all "Cited by" articles

References

1. Alexander LM, Escalera J, Ai L, Andreotti C, Febre K, Mangone A, Vega-Potler N, Langer N, Alexander A, Kovacs M, et al., 2017. An open resource for transdiagnostic research in pediatric mental health and learning disorders. Scientific data 4, 170181. - PMC - PubMed
1. Aycheh HM, Seong JK, Shin JH, Na DL, Kang B, Seo SW, Sohn KA, 2018. Biological brain age prediction using cortical thickness data: a large scale cohort study. Frontiers in aging neuroscience 10, 252. - PMC - PubMed
1. Bashyam VM, Erus G, Doshi J, Habes M, Nasralah I, Truelove-Hill M, Srinivasan D, Mamourian L, Pomponio R, Fan Y, et al., 2020. MRI signatures of brain age and disease over the lifespan based on a deep brain network and 14 468 individuals worldwide. Brain 143, 2312–2324. - PMC - PubMed
1. Bau D, Zhu JY, Strobelt H, Lapedriza A, Zhou B, Torralba A, 2020. Understanding the role of individual units in a deep neural network. Proceedings of the National Academy of Sciences 117, 30071–30078. - PMC - PubMed
1. Becker BG, Klein T, Wachinger C, Initiative ADN, et al., 2018. Gaussian process uncertainty in age estimation as a measure of brain abnormality. NeuroImage 175, 246–258. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

R03 HD104891/HD/NICHD NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

[1] Alexander LM, Escalera J, Ai L, Andreotti C, Febre K, Mangone A, Vega-Potler N, Langer N, Alexander A, Kovacs M, et al., 2017. An open resource for transdiagnostic research in pediatric mental health and learning disorders. Scientific data 4, 170181. - PMC - PubMed

[2] Alexander LM, Escalera J, Ai L, Andreotti C, Febre K, Mangone A, Vega-Potler N, Langer N, Alexander A, Kovacs M, et al., 2017. An open resource for transdiagnostic research in pediatric mental health and learning disorders. Scientific data 4, 170181. - PMC - PubMed

[3] Aycheh HM, Seong JK, Shin JH, Na DL, Kang B, Seo SW, Sohn KA, 2018. Biological brain age prediction using cortical thickness data: a large scale cohort study. Frontiers in aging neuroscience 10, 252. - PMC - PubMed

[4] Aycheh HM, Seong JK, Shin JH, Na DL, Kang B, Seo SW, Sohn KA, 2018. Biological brain age prediction using cortical thickness data: a large scale cohort study. Frontiers in aging neuroscience 10, 252. - PMC - PubMed

[5] Bashyam VM, Erus G, Doshi J, Habes M, Nasralah I, Truelove-Hill M, Srinivasan D, Mamourian L, Pomponio R, Fan Y, et al., 2020. MRI signatures of brain age and disease over the lifespan based on a deep brain network and 14 468 individuals worldwide. Brain 143, 2312–2324. - PMC - PubMed

[6] Bashyam VM, Erus G, Doshi J, Habes M, Nasralah I, Truelove-Hill M, Srinivasan D, Mamourian L, Pomponio R, Fan Y, et al., 2020. MRI signatures of brain age and disease over the lifespan based on a deep brain network and 14 468 individuals worldwide. Brain 143, 2312–2324. - PMC - PubMed

[7] Bau D, Zhu JY, Strobelt H, Lapedriza A, Zhou B, Torralba A, 2020. Understanding the role of individual units in a deep neural network. Proceedings of the National Academy of Sciences 117, 30071–30078. - PMC - PubMed

[8] Bau D, Zhu JY, Strobelt H, Lapedriza A, Zhou B, Torralba A, 2020. Understanding the role of individual units in a deep neural network. Proceedings of the National Academy of Sciences 117, 30071–30078. - PMC - PubMed

[9] Becker BG, Klein T, Wachinger C, Initiative ADN, et al., 2018. Gaussian process uncertainty in age estimation as a measure of brain abnormality. NeuroImage 175, 246–258. - PubMed

[10] Becker BG, Klein T, Wachinger C, Initiative ADN, et al., 2018. Gaussian process uncertainty in age estimation as a measure of brain abnormality. NeuroImage 175, 246–258. - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Multi-channel attention-fusion neural network for brain age estimation: Accuracy, generality, and interpretation with 16,705 healthy MRIs across lifespan

Affiliations

Multi-channel attention-fusion neural network for brain age estimation: Accuracy, generality, and interpretation with 16,705 healthy MRIs across lifespan

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources