Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Dec 14:17:1238646.
doi: 10.3389/fnins.2023.1238646. eCollection 2023.

Toward hippocampal volume measures on ultra-high field magnetic resonance imaging: a comprehensive comparison study between deep learning and conventional approaches

Affiliations

Toward hippocampal volume measures on ultra-high field magnetic resonance imaging: a comprehensive comparison study between deep learning and conventional approaches

Junyan Lyu et al. Front Neurosci. .

Abstract

The hippocampus is a complex brain structure that plays an important role in various cognitive aspects such as memory, intelligence, executive function, and path integration. The volume of this highly plastic structure is identified as one of the most important biomarkers of specific neuropsychiatric and neurodegenerative diseases. It has also been extensively investigated in numerous aging studies. However, recent studies on aging show that the performance of conventional approaches in measuring the hippocampal volume is still far from satisfactory, especially in terms of delivering longitudinal measures from ultra-high field magnetic resonance images (MRIs), which can visualize more boundary details. The advancement of deep learning provides an alternative solution to measuring the hippocampal volume. In this work, we comprehensively compared a deep learning pipeline based on nnU-Net with several conventional approaches including Freesurfer, FSL and DARTEL, for automatically delivering hippocampal volumes: (1) Firstly, we evaluated the segmentation accuracy and precision on a public dataset through cross-validation. Results showed that the deep learning pipeline had the lowest mean (L = 1.5%, R = 1.7%) and the lowest standard deviation (L = 5.2%, R = 6.2%) in terms of volume percentage error. (2) Secondly, sub-millimeter MRIs of a group of healthy adults with test-retest 3T and 7T sessions were used to extensively assess the test-retest reliability. Results showed that the deep learning pipeline achieved very high intraclass correlation coefficients (L = 0.990, R = 0.986 for 7T; L = 0.985, R = 0.983 for 3T) and very small volume percentage differences (L = 1.2%, R = 0.9% for 7T; L = 1.3%, R = 1.3% for 3T). (3) Thirdly, a Bayesian linear mixed effect model was constructed with respect to the hippocampal volumes of two healthy adult datasets with longitudinal 7T scans and one disease-related longitudinal dataset. It was found that the deep learning pipeline detected both the subtle and disease-related changes over time with high sensitivity as well as the mild differences across subjects. Comparison results from the aforementioned three aspects showed that the deep learning pipeline significantly outperformed the conventional approaches by large margins. Results also showed that the deep learning pipeline can better accommodate longitudinal analysis purposes.

Keywords: aging; convolutional neural network; deep learning; hippocampus; longitudinal study; volume estimation.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Pearson correlation coefficient analyses between automated methods and manual delineation on ADNI-HarP: scatter plots wherein each solid line shows a linear fit with 95% confidence interval. A higher r value indicates stronger correlation.
Figure 2
Figure 2
Boxplots with medians and interquartile ranges of volume percentage differences (VPDs) for different methods on HCP and TOMCAT. A lower VPD indicates better test–retest reliability.
Figure 3
Figure 3
Boxplots with means and 95% confidence intervals of the variance ratio, between-subject variability and within-subject variability for all four methods under comparison on CEREBRUM-7T, TOMCAT, and ADNI3. The higher variance ratios indicate better discrimination between subjects, and higher within-subject reproducibility between the test–retest conditions.

References

    1. Ashburner J. (2007). A fast diffeomorphic image registration algorithm. NeuroImage 38, 95–113. doi: 10.1016/j.neuroimage.2007.07.007 - DOI - PubMed
    1. Ashburner J., Friston K. J. (2005). Unified segmentation. NeuroImage 26, 839–851. doi: 10.1016/j.neuroimage.2005.02.018 - DOI - PubMed
    1. Balboni E., Nocetti L., Carbone C., Dinsdale N., Genovese M., Guidi G., et al. . (2022). The impact of transfer learning on 3D deep learning convolutional neural network segmentation of the hippocampus in mild cognitive impairment and Alzheimer disease subjects. Hum. Brain Mapp. 43, 3427–3438. doi: 10.1002/hbm.25858, PMID: - DOI - PMC - PubMed
    1. Barnes J., Bartlett J. W., van de Pol L. A., Loy C. T., Scahill R. I., Frost C., et al. . (2009). A meta-analysis of hippocampal atrophy rates in Alzheimer's disease. Neurobiol. Aging 30, 1711–1723. doi: 10.1016/j.neurobiolaging.2008.01.010 - DOI - PMC - PubMed
    1. Bazin P. L., Weiss M., Dinse J., Schafer A., Trampel R., Turner R. (2014). A computational framework for ultra-high resolution cortical segmentation at 7Tesla. NeuroImage 93, 201–209. doi: 10.1016/j.neuroimage.2013.03.077, PMID: - DOI - PubMed

LinkOut - more resources