Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul;26(7):e70152.
doi: 10.1002/acm2.70152.

A semi-automated workflow for cohort-wise preparation of radiotherapy data for dose-response modeling, including autosegmentation of organs at risk

Affiliations

A semi-automated workflow for cohort-wise preparation of radiotherapy data for dose-response modeling, including autosegmentation of organs at risk

Louise Mövik et al. J Appl Clin Med Phys. 2025 Jul.

Abstract

Background: Preparing retrospective dose data for risk modeling using large study cohorts can be time consuming as it often requires patient-wise manual interventions. This is especially the case when considering organs at risk (OARs) not systematically delineated historically. Therefore, we aimed to develop and test a semi-automated workflow for cohort-wise preparation of radiotherapy data from the oncology information system (OIS), including OAR autosegmentation, for risk modeling purposes.

Methods: A semi-automated workflow, including cohort-wise data extraction from a clinical OIS, cleanup, autosegmentation, quality controls (QCs), and data injection into a research OIS was iteratively developed using 106 patient cases. We evaluated two deep learning (DL)-based methods and compared with four atlas-based methods for autosegmentation of the proximal bronchial tree (PBT), the heart, and the esophagus that were possible to integrate into the workflow. One method was an in-house DL-based model using OARs manually contoured by experts for 100 cases. Geometric and dosimetric agreements with manually contoured OARs were evaluated for 20 independent cases. The final workflow was tested on 50 independent cases.

Results: The DL-based methods were better than the atlas-based at segmenting the PBT (mean Dice similarity coefficient (DSC) 0.81-0.83 versus 0.59-0.80) and the esophagus (mean DSC 0.76-0.77 versus 0.39-0.46). The methods performed similarly for the heart (mean DSC 0.90-0.95 (DL-based) and 0.84-0.90 (atlas-based)). Our in-house autosegmentation model had the highest mean DSC for all OARs. The final version of the workflow successfully prepared data for 80% of the test cases without case-specific manual interventions.

Conclusions: The semi-automated workflow enabled efficient cohort-wise preparation of OIS data for risk modeling purposes. Our in-house DL-based segmentation model outperformed the other methods.

Keywords: automation; autosegmentation; large‐scale studies; modeling.

PubMed Disclaimer

Conflict of interest statement

This study is partly funded by a research agreement between Sahlgrenska University Hospital and Varian Medical Systems (a Siemens Healthineers company).

Figures

FIGURE 1
FIGURE 1
Illustration of the semi‐automated workflow for cohort‐wise preparation of radiotherapy data for risk modeling purposes. The circles represent independent automatic processes that are executed sequentially from left to right. Solid arrows indicate data generation. Dashed arrows indicate reading of data. AutoSeg, autosegmentation; DICOM, digital imaging and communications in medicine; OIS, oncology information system; QC, quality control.
FIGURE 2
FIGURE 2
Geometric and dosimetric comparisons between each autosegmentation of the proximal bronchial tree and the corresponding manual contour (circles). Yellow circles correspond to autosegmentations flagged in the quality control implemented as an example. All doses were normalized to the prescribed dose (PD). The shape of the violin is the kernel density. In each violin, the white circle is the median and the bottom and top of the shaded area the 25th and 75th percentiles. The numbers at the bottom represent the mean ± 1 standard deviation. DSC, Dice similarity coefficient; DMean, mean dose; HD95, 95th percentile of the Hausdorff distances; D2%, the minimum dose in the 2% of the volume that receives the highest dose; MSD, mean surface distance; PD, prescribed dose.
FIGURE 3
FIGURE 3
Geometric and dosimetric comparisons between each autosegmentation of the heart and the corresponding manual contour (circles). Yellow circles correspond to autosegmentations flagged in the quality control implemented as an example. All doses were normalized to the prescribed dose (PD). The shape of the violin is the kernel density. In each violin, the white circle is the median and the bottom and top of the shaded area the 25th and 75th percentiles. The numbers at the bottom represent the mean ± 1 standard deviation. DSC, Dice similarity coefficient; DMean, mean dose; HD95, 95th percentile of the Hausdorff distances; D2%, the minimum dose in the 2% of the volume that receives the highest dose; MSD, mean surface distance; PD, prescribed dose.
FIGURE 4
FIGURE 4
Geometric and dosimetric comparisons between each autosegmentation of the esophagus and the corresponding manual contour (circles). Yellow circles correspond to autosegmentations flagged in the quality control implemented as an example. All doses were normalized to the prescribed dose (PD). The shape of the violin is the kernel density. In each violin, the white circle is the median and the bottom and top of the shaded area the 25th and 75th percentiles. The numbers at the bottom represent the mean ± 1 standard deviation. DSC, Dice similarity coefficient; DMean, mean dose; HD95, 95th percentile of the Hausdorff distances; D2%, the minimum dose in the 2% of the volume that receives the highest dose; MSD, mean surface distance; PD, prescribed dose.

References

    1. McNutt TR, Bowers M, Cheng Z, et al. Practical data collection and extraction for big data applications in radiotherapy. Med Phys. 2018;45(10):e863‐e869. doi: 10.1002/mp.12817 - DOI - PubMed
    1. Gurjar M, Lindberg J, Björk‐Eriksson T, Olsson C. Automated data extraction tool (DET) for external applications in radiotherapy. Tech Innov Patient Support Radiat Oncol. 2023;25:100194. doi: 10.1016/j.tipsro.2022.12.001 - DOI - PMC - PubMed
    1. Van Dijk LV, Abusaif AA, Rigert J, et al. Normal tissue complication probability (NTCP) prediction model for osteoradionecrosis of the mandible in patients with head and neck cancer after Radiation Therapy: large‐Scale Observational Cohort. Int J Radiat Oncol. 2021;111(2):549‐558. doi: 10.1016/j.ijrobp.2021.04.042 - DOI - PMC - PubMed
    1. Dean JA, Welsh LC, Wong KH, et al. Normal tissue complication probability (NTCP) modelling of severe acute mucositis using a novel oral mucosal surface organ at Risk. Clin Oncol. 2017;29(4):263‐273. doi: 10.1016/j.clon.2016.12.001 - DOI - PMC - PubMed
    1. Stam B, Peulen H, Rossi MMG, Belderbos JSA, Sonke JJ. Validation of automatic segmentation of ribs for NTCP modeling. Radiother Oncol. 2016;118(3):528‐534. doi: 10.1016/j.radonc.2015.12.014 - DOI - PubMed

LinkOut - more resources