Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Nov;31(11):1751-62.
doi: 10.1002/hbm.20973.

Scan-rescan reliability of subcortical brain volumes derived from automated segmentation

Affiliations

Scan-rescan reliability of subcortical brain volumes derived from automated segmentation

Rajendra A Morey et al. Hum Brain Mapp. 2010 Nov.

Abstract

Large-scale longitudinal studies of regional brain volume require reliable quantification using automated segmentation and labeling. However, repeated MR scanning of the same subject, even if using the same scanner and acquisition parameters, does not result in identical images due to small changes in image orientation, changes in prescan parameters, and magnetic field instability. These differences may lead to appreciable changes in estimates of volume for different structures. This study examined scan-rescan reliability of automated segmentation algorithms for measuring several subcortical regions, using both within-day and across-day comparison sessions in a group of 23 normal participants. We found that the reliability of volume measures including percent volume difference, percent volume overlap (Dice's coefficient), and intraclass correlation coefficient (ICC), varied substantially across brain regions. Low reliability was observed in some structures such as the amygdala (ICC = 0.6), with higher reliability (ICC = 0.9) for other structures such as the thalamus and caudate. Patterns of reliability across regions were similar for automated segmentation with FSL/FIRST and FreeSurfer (longitudinal stream). Reliability was associated with the volume of the structure, the ratio of volume to surface area for the structure, the magnitude of the interscan interval, and the method of segmentation. Sample size estimates for detecting changes in brain volume for a range of likely effect sizes also differed by region. Thus, longitudinal research requires a careful analysis of sample size and choice of segmentation method combined with a consideration of the brain structure(s) of interest and the magnitude of the anticipated effects.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Scatter plots showing correlation between segmented amygdala volumes (mm3) from scan 1A and 1B for FSL/FIRST and FreeSurfer. Left hemisphere volumes are in green and right hemisphere volumes in orange. [Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]
Figure 2
Figure 2
Scatter plots showing correlation between segmented hippocampus volumes (mm3) from scan 1A and 1B for FSL/FIRST and FreeSurfer. Left hemisphere volumes are in green and right hemisphere volumes in orange. [Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]
Figure 3
Figure 3
Percent volume difference for scans with a 1‐h and 1‐week interscan interval for nine subcortical brain structures segmented with FIRST and FreeSurfer. [Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]
Figure 4
Figure 4
Percent volume overlap for scans with a 1‐h and 1‐week interscan interval for nine subcortical brain structures segmented with FIRST and FreeSurfer. [Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]
Figure 5
Figure 5
The T1 images and segmentations for subject 7 are shown for scan 1A (left panel) and for scan 1B (center panel). Segmentation of the L‐putamen (circled) is dramatically different on the lateral surface for scan 1A (3,777 voxels) compared to scan 1B (7,317 voxels). This resulted in lower intraclass correlations for 1A1B (0.29) and 1A2A (0.29) as compared to 2A2B (0.96) and 1B2B (0.95). There is no obvious artifact visible in scan 1A that might explain this discrepancy. [Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]
Figure 6
Figure 6
Sample size requirements (y‐axis) for FreeSurfer (left) and FSL/FIRST (right) assuming a within subject design with two observations to achieve 80% power and 5% alpha level are shown for a range of effect sizes (x‐axis) and each of the nine subcortical structures. Note that the sample size is scaled by log10 to enhance visualization of curves at higher effect sizes. [Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]

References

    1. Agartz I, Okuguwa G, Nordstrom M, Greitz D, Magnotta V, Sedvall G ( 2001): Reliability and reproducibility of brain tissue volumetry from segmented MR scans. Eur Arch Psychiatry Clin Neurosci 251: 255–261. - PubMed
    1. Altman DG, Bland JM ( 1994): Quartiles, quintiles, centiles, and other quantiles. BMJ 309: 996. - PMC - PubMed
    1. Barnes J, Foster J, Boyes RG, Pepple T, Moore EK, Schott JM, Frost C, Scahill RI, Fox NC ( 2008): A comparison of methods for the automated calculation of volumes and atrophy rates in the hippocampus. Neuroimage 40: 1655–1671. - PubMed
    1. Bartzokis G, Mintz J, Marx P, Osborn D, Gutkind D, Chiang F, Phelan CK, Marder SR ( 1993): Reliability of in vivo volume measures of hippocampus and other brain structures using MRI. Magn Reson Imaging 11: 993–1006. - PubMed
    1. Castellanos FX, Lee PP, Sharp W, Jeffries NO, Greenstein DK, Clasen LS, Blumenthal JD, James RS, Ebens CL, Walter JM, et al ( 2002): Developmental trajectories of brain volume abnormalities in children and adolescents with attention‐deficit/hyperactivity disorder. JAMA 288: 1740–1748. - PubMed

Publication types