Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Mar 11;1(1):100006.
doi: 10.1016/j.ynirp.2021.100006. eCollection 2021 Mar.

Intracranial volume segmentation for neurodegenerative populations using multicentre FLAIR MRI

Affiliations

Intracranial volume segmentation for neurodegenerative populations using multicentre FLAIR MRI

Justin DiGregorio et al. Neuroimage Rep. .

Abstract

Intracranial volume (ICV) segmentation, also known as brain extraction or skull-stripping, is a critical preprocessing step in analytical pipelines for studying neurodegenerative diseases in magnetic resonance imaging (MRI). While the fluid-attenuated inversion recovery (FLAIR) MRI modality has emerged as an important sequence for analyzing cerebrovascular and neurodegenerative disease, most existing automated ICV segmentation methods have been developed for T1-weighted or multi-modal inputs. Additionally, many methods have been designed using single centre data of healthy subjects and encounter difficulties using images with varying acquisition parameters and neurodegenerative pathology. In this work, we develop and evaluate 2 traditional and 8 deep learning algorithms for ICV segmentation in FLAIR MRI. Training and testing were completed on 175 ​vol (8317 images) from 2 dementia and 1 vascular disease cohort. A human phantom FLAIR MRI dataset from a repeatedly scanned, healthy individual was also utilized for reliability analysis. Images were acquired from 47 imaging centres with varying scanners and parameters. To measure and compare performance, we present a novel framework for evaluating the effectiveness of computer generated segmentations on multicentre datasets. The evaluation framework includes assessments of algorithm accuracy, generalization capabilities, robustness to pathology and spatial location, and volumetric measurement reliability - all important dimensions for establishing proof of effectiveness (a prerequisite to clinical translation). The top performing method was a multiple resolution U-Net (MultiResUNet), which achieved a mean Dice similarity coefficient greater than 98% and was robust across pathology levels and spatial locations. Our results confirm a FLAIR-based ICV analytical pipeline can alone be utilized for large-scale neurodegenerative disease research. The presented evaluation framework can be deployed by other researchers to assess the viability of tools proposed for automated analysis of diverse, clinical MRI datasets.

Keywords: Brain extraction; Deep learning; FLAIR MRI; ICV segmentation; Skull-stripping.

PubMed Disclaimer

Conflict of interest statement

This manuscript has not been published and is not under consideration for publication elsewhere. All authors have approved the manuscript and agree with its submission to Neuroimage: Reports. We have no conflicts of interest to disclose.

Figures

Fig. 1
Fig. 1
Encoding/decoding units for U-Net, Res U-Net, and Dense U-Net. Encoding units reside between max pooling layers and decoding units reside between transposed convolutional layers.
Fig. 2
Fig. 2
Encoding/decoding unit for MultiResUNet. Encoding units reside between max pooling layers while decoding units reside between transposed convolutional layers.
Fig. 3
Fig. 3
Summary of the data split strategies and data preparation process prior to algorithm training and testing. All algorithms were trained, validated, and tested using the same subsets of data.
Fig. 4
Fig. 4
Sample images showing variation of scans from different datasets and different scanner vendors.
Fig. 5
Fig. 5
Volume histograms before and after intensity standardization. The intensity ranges used to estimate different tissue volumes via thresholding are shown by the colours on the standardized histograms. Pixels representing the brain (GM/WM), CSF, and WML are shown as turquoise, green, and blue, respectively.
Fig. 6
Fig. 6
Sample images showing variation of scans with different CSF loads and different WML loads.
Fig. 7
Fig. 7
Slices from SIMON volumes acquired on GE, Philips, and Siemens scanners. Despite all being acquired with the CDIP protocol (Yushkevich et al., 2006), differences in scanning hardware manifest themselves in the output volumes.
Fig. 8
Fig. 8
Sample ICV segmentations from data split #1 models. The top rows are 3 different ADNI volumes, the middle rows are 3 different CAIN volumes, and the bottom rows are 3 different CCNA volumes. Red overlays show the ground truth (GT) annotations, green overlays show traditional method outputs, and turquoise overlays show deep learning method outputs.
Fig. 9
Fig. 9
DSC, HD, and EF distributions across all ICV segmentation methods for data split #1.
Fig. 10
Fig. 10
DSC, HD, and EF distributions across all ICV segmentation methods for data split #2.
Fig. 11
Fig. 11
B-A plots between manual ground truth and algorithm predicted ICV in millilitres for data split #1 and data split #2. Mean of methods is given on the x-axis and difference between methods is given on the y-axis.
Fig. 12
Fig. 12
DSC distributions across all ICV segmentation methods for data split #1 as a function of scanner vendor.
Fig. 13
Fig. 13
DSC distributions across all ICV segmentation methods for data split #2 as a function of dataset.
Fig. 14
Fig. 14
DSC distributions across select ICV segmentation methods for data split #1 as a function of WML load, CSF load, and MoCA categorization.
Fig. 15
Fig. 15
Sample segmentations for challenging cases across select ICV segmentation methods. Red overlays show ground truth delineations, green overlays show traditional algorithm predictions, and turquoise overlays show 2D CNN predictions. The top row is a CVD case from the CAIN database, the middle row is an MCI case from the CCNA database, and the bottom row is an AD case from the ADNI database.
Fig. 16
Fig. 16
Average regional DSC across select ICV segmentation methods. Each region represents progressive 20% increments of volume slices. Standard deviation is shown as black error bars.
Fig. 17
Fig. 17
Error maps for select ICV segmentation methods. Averaged error maps of segmentations are shown for slices 5, 15, 25, 35, and 45 in the 55 slice registration atlas space.
Fig. 18
Fig. 18
ICV measurements computed on SIMON as a function of method and scanner vendor.
Fig. 19
Fig. 19
Coefficients of variation (CoV) between scanners and methods.
Fig. 20
Fig. 20
Average intensity standardized histograms of all CAIN, ADNI, and CCNA volumes from each scanner vendor.

Similar articles

References

    1. Akkus Z., Galimzianova A., Hoogi A., Rubin D.L., Erickson B.J. Deep learning for brain MRI segmentation: state of the art and future directions. J. Digit. Imag. 2017;30(4):449–459. - PMC - PubMed
    1. Ali H.M., Kaiser M.S., Mahmud M. International Conference on Brain Informatics. Springer; Cham: 2019. Application of convolutional neural network in segmenting brain regions from MRI data; pp. 136–146. December.
    1. Aribisala B.S., Hernández M.C.V., Royle N.A., Morris Z., Maniega S.M., Bastin M.E., et al. Brain atrophy associations with white matter lesions in the ageing brain: the Lothian Birth Cohort 1936. Eur. Radiol. 2013;23(4):1084–1092. - PubMed
    1. Avants B.B., Epstein C.L., Grossman M., Gee J.C. Symmetric diffeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain. Med. Image Anal. 2008;12(1):26–41. - PMC - PubMed
    1. Avants B.B., Tustison N.J., Song G., Cook P.A., Klein A., Gee J.C. A reproducible evaluation of ANTs similarity metric performance in brain image registration. Neuroimage. 2011;54(3):2033–2044. - PMC - PubMed

LinkOut - more resources