Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan;51(1):278-291.
doi: 10.1002/mp.16582. Epub 2023 Jul 20.

Investigation of autosegmentation techniques on T2-weighted MRI for off-line dose reconstruction in MR-linac workflow for head and neck cancers

Affiliations

Investigation of autosegmentation techniques on T2-weighted MRI for off-line dose reconstruction in MR-linac workflow for head and neck cancers

Brigid A McDonald et al. Med Phys. 2024 Jan.

Abstract

Background: In order to accurately accumulate delivered dose for head and neck cancer patients treated with the Adapt to Position workflow on the 1.5T magnetic resonance imaging (MRI)-linear accelerator (MR-linac), the low-resolution T2-weighted MRIs used for daily setup must be segmented to enable reconstruction of the delivered dose at each fraction.

Purpose: In this pilot study, we evaluate various autosegmentation methods for head and neck organs at risk (OARs) on on-board setup MRIs from the MR-linac for off-line reconstruction of delivered dose.

Methods: Seven OARs (parotid glands, submandibular glands, mandible, spinal cord, and brainstem) were contoured on 43 images by seven observers each. Ground truth contours were generated using a simultaneous truth and performance level estimation (STAPLE) algorithm. Twenty total autosegmentation methods were evaluated in ADMIRE: 1-9) atlas-based autosegmentation using a population atlas library (PAL) of 5/10/15 patients with STAPLE, patch fusion (PF), random forest (RF) for label fusion; 10-19) autosegmentation using images from a patient's 1-4 prior fractions (individualized patient prior [IPP]) using STAPLE/PF/RF; 20) deep learning (DL) (3D ResUNet trained on 43 ground truth structure sets plus 45 contoured by one observer). Execution time was measured for each method. Autosegmented structures were compared to ground truth structures using the Dice similarity coefficient, mean surface distance (MSD), Hausdorff distance (HD), and Jaccard index (JI). For each metric and OAR, performance was compared to the inter-observer variability using Dunn's test with control. Methods were compared pairwise using the Steel-Dwass test for each metric pooled across all OARs. Further dosimetric analysis was performed on three high-performing autosegmentation methods (DL, IPP with RF and 4 fractions [IPP_RF_4], IPP with 1 fraction [IPP_1]), and one low-performing (PAL with STAPLE and 5 atlases [PAL_ST_5]). For five patients, delivered doses from clinical plans were recalculated on setup images with ground truth and autosegmented structure sets. Differences in maximum and mean dose to each structure between the ground truth and autosegmented structures were calculated and correlated with geometric metrics.

Results: DL and IPP methods performed best overall, all significantly outperforming inter-observer variability and with no significant difference between methods in pairwise comparison. PAL methods performed worst overall; most were not significantly different from the inter-observer variability or from each other. DL was the fastest method (33 s per case) and PAL methods the slowest (3.7-13.8 min per case). Execution time increased with a number of prior fractions/atlases for IPP and PAL. For DL, IPP_1, and IPP_RF_4, the majority (95%) of dose differences were within ± 250 cGy from ground truth, but outlier differences up to 785 cGy occurred. Dose differences were much higher for PAL_ST_5, with outlier differences up to 1920 cGy. Dose differences showed weak but significant correlations with all geometric metrics (R2 between 0.030 and 0.314).

Conclusions: The autosegmentation methods offering the best combination of performance and execution time are DL and IPP_1. Dose reconstruction on on-board T2-weighted MRIs is feasible with autosegmented structures with minimal dosimetric variation from ground truth, but contours should be visually inspected prior to dose reconstruction in an end-to-end dose accumulation workflow.

Keywords: MR-linac; autosegmentation; head and neck cancers.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Statement

The remaining authors have no conflicts of interest to disclose.

Figures

Figure 1:
Figure 1:
Execution time per case for each autosegmentation method. Data is represented as mean and standard deviation (error bars).
Figure 2:
Figure 2:
Dice similarity coefficient (DSC) and mean surface distance (MSD) in mm for the autosegmentation methods compared to ground truth contours and the pair-wise comparison of inter-observer variability. Distributions are shown as box plots, with the five horizontal bars in each distribution representing the minimum, first quartile, median, third quartile, and maximum.
Figure 3:
Figure 3:
Segmentations for six of the seven OARs (excluding the brainstem since it cannot be visualized in the same plane as all the other segmentations) for one example patient. Ground truth segmentations are in green, and each set of autosegmented contours is in red. The spinal cord contour is missing in the IPP_PF methods because the segmentation failed.
Figure 4:
Figure 4:
Heat map of p-values for Dunn’s test with the inter-observer variability as a control. Reported values are Bonferroni adjusted p-values. Red boxes indicate non-significant results (p>0.05), and blue boxes indicate significant results (p<0.05). White represents missing values (IPP_PF methods failed for the spinal cord for every test case). (Abbreviations: DL = deep learning, IPP = individualized patient prior, PAL = population atlas library, ST = STAPLE, PF = patch fusion, RF = random forest. 1, 2, 3, and 4 represent the number of prior fractions in IPP. 5, 10, and 15 represent the number of atlases in PAL. Glnd_Submand = submandibular gland. L = left, R = right.)
Figure 5:
Figure 5:
Heat map of p-values for the Steel-Dwass test for pair-wise comparison between autosegmentation methods pooled over all ROIs. Red boxes indicate non-significant results (p>0.05), and blue boxes indicate significant results (p<0.05). (Abbreviations: DL = deep learning, IPP = individualized patient prior, PAL = population atlas library, ST = STAPLE, PF = patch fusion, RF = random forest. 1, 2, 3, and 4 represent the number of prior fractions in IPP. 5, 10, and 15 represent the number of atlases in PAL.)
Figure 6:
Figure 6:
Differences (cGy) in dosimetric performance criteria (ΔDmean and ΔDmax) between plans based on ground truth contours and select autosegmentation methods for 5 patient cases. Positive/negative values mean that the dose was higher/lower in the autosegmented structure than in the ground truth structure, respectively. Treatment sites for the five patient cases are: case 1: larynx – left supraglottis; case 2: left hypopharynx; case 3: larynx – left glottis; case 4: oropharynx – right tonsil; case 5: oropharynx – right tonsil. (Abbreviations: Dmean = mean dose, Dmax = maximum dose, L = left, R = right, Glnd_Submand = submandibular gland, DL = deep learning, IPP_1 = individualized patient prior with 1 prior fraction, IPP_RF_4 = individualized patient prior with 4 prior fractions and random forest, and PAL_ST_5 = population atlas library with 5 atlases and STAPLE.)

Similar articles

Cited by

References

    1. Lagendijk JJW, Raaymakers BW, van Vulpen M. The Magnetic Resonance Imaging-Linac System. Semin Radiat Oncol. 2014;24:207–209. doi:10.1016/j.semradonc.2014.02.009 - DOI - PubMed
    1. Raaymakers BW, Jürgenliemk-Schulz IM, Bol GH, et al. First patients treated with a 1.5 T MRI-Linac: Clinical proof of concept of a high-precision, high-field MRI guided radiotherapy treatment. Phys Med Biol. 2017;62(23):L41–L50. doi:10.1088/1361-6560/aa9517 - DOI - PubMed
    1. Mutic S, Dempsey JF. The ViewRay System: Magnetic Resonance-Guided and Controlled Radiotherapy. Semin Radiat Oncol. 2014;24(3):196–199. doi:10.1016/j.semradonc.2014.02.008 - DOI - PubMed
    1. McDonald BA, Vedam S, Yang J, et al. Initial Feasibility and Clinical Implementation of Daily MR-Guided Adaptive Head and Neck Cancer Radiation Therapy on a 1.5T MR-Linac System: Prospective R-IDEAL 2a/2b Systematic Clinical Evaluation of Technical Innovation. Int J Radiat Oncol. 2021;109(5):1606–1618. doi:10.1016/j.ijrobp.2020.12.015 - DOI - PMC - PubMed
    1. Winkel D, Bol GH, Kroon PS, et al. Adaptive radiotherapy: The Elekta Unity MR-linac concept. Clin Transl Radiat Oncol. 2019;18:54–59. doi:10.1016/j.ctro.2019.04.001 - DOI - PMC - PubMed

MeSH terms