Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 31;19(1):98.
doi: 10.1186/s13014-024-02496-5.

Improving the performance of deep learning models in predicting and classifying gamma passing rates with discriminative features and a class balancing technique: a retrospective cohort study

Affiliations

Improving the performance of deep learning models in predicting and classifying gamma passing rates with discriminative features and a class balancing technique: a retrospective cohort study

Wei Song et al. Radiat Oncol. .

Abstract

Background: The purpose of this study was to improve the deep learning (DL) model performance in predicting and classifying IMRT gamma passing rate (GPR) by using input features related to machine parameters and a class balancing technique.

Methods: A total of 2348 fields from 204 IMRT plans for patients with nasopharyngeal carcinoma were retrospectively collected to form a dataset. Input feature maps, including fluence, leaf gap, leaf speed of both banks, and corresponding errors, were constructed from the dynamic log files. The SHAP framework was employed to compute the impact of each feature on the model output for recursive feature elimination. A series of UNet++ based models were trained on the obtained eight feature sets with three fine-tuning methods including the standard mean squared error (MSE) loss, a re-sampling technique, and a proposed weighted MSE loss (WMSE). Differences in mean absolute error, area under the receiver operating characteristic curve (AUC), sensitivity, and specificity were compared between the different models.

Results: The models trained with feature sets including leaf speed and leaf gap features predicted GPR for failed fields more accurately than the other models (F(7, 147) = 5.378, p < 0.001). The WMSE loss had the highest accuracy in predicting GPR for failed fields among the three fine-tuning methods (F(2, 42) = 14.149, p < 0.001), while an opposite trend was observed in predicting GPR for passed fields (F(2, 730) = 9.907, p < 0.001). The WMSE_FS5 model achieved a superior AUC (0.92) and more balanced sensitivity (0.77) and specificity (0.89) compared to the other models.

Conclusions: Machine parameters can provide discriminative input features for GPR prediction in DL. The novel weighted loss function demonstrates the ability to balance the prediction and classification accuracy between the passed and failed fields. The proposed approach is able to improve the DL model performance in predicting and classifying GPR, and can potentially be integrated into the plan optimization process to generate higher deliverability plans.

Trial registration: This clinical trial was registered in the Chinese Clinical Trial Registry on March 26th, 2020 (registration number: ChiCTR2000031276). https://clinicaltrials.gov/ct2/show/ChiCTR2000031276.

Keywords: Class imbalance; Classification; Deep learning; Machine parameters; Prediction; Quality assurance.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
The overall workflow for the study
Fig. 2
Fig. 2
a The proposed deep neural network based on the UNet++ architecture. b The structure of the basic convolution block
Fig. 3
Fig. 3
Mean SHAP values computed on the validation set for each input feature in each of the eight models ah trained during recursive feature elimination
Fig. 4
Fig. 4
Scatter plot of measured versus predicted gamma passing rate (GPR) for models trained with the full feature set (FS1), the optimal FS (FS5), and the FS with only the Fluence map (FS8) using each of the three fine-tuning methods. The diagonal dotted line represents the perfect prediction by an ideal model. The vertical and horizontal dashed lines represent the action limit. TP = true positive; TN = true negative; FP = false positive; FN = false negative
Fig. 5
Fig. 5
Comparison of the area under the receiver operating characteristic curve (AUC) for different models in the test dataset
Fig. 6
Fig. 6
The distribution of SHAP values computed on the test dataset for each feature in the WMSE_FS1 (a) and WMSE_FS5 (b) models

References

    1. Chen D, Cai SB, Soon YY, Cheo T, Vellayappan B, Tan CW, Ho F. Dosimetric comparison between intensity modulated radiation therapy (IMRT) vs dual arc volumetric arc therapy (VMAT) for nasopharyngeal cancer (NPC): systematic review and meta-analysis. J Med Imaging Radiat Sci. 2023;54(1):167–77. 10.1016/j.jmir.2022.10.195 - DOI - PubMed
    1. Miften M, Olch A, Mihailidis D, Moran J, Pawlicki T, Molineu A, et al. Tolerance limits and methodologies for IMRT measurement-based verification QA: recommendations of AAPM task group No 218. Med Phys. 2018;45(4):e53–83. 10.1002/mp.12810 - DOI - PubMed
    1. Chan LT, Tan YI, Tan PW, Leong YF, Khor JS, Teh MW, et al. Comparing log file to measurement-based patient-specific quality assurance. Phys Eng Sci Med. 2023;46(1):303–11. 10.1007/s13246-023-01219-6 - DOI - PubMed
    1. Stasko JT, Ferris WS, Adam DP, Culberson WS, Frigo SP. IMRT QA result prediction via MLC transmission decomposition. J Appl Clin Med Phys. 2023;24(8):1–10.10.1002/acm2.13990 - DOI - PMC - PubMed
    1. Zhu TC, Stathakis S, Clark JR, Feng W, Georg D, Holmes SM, et al. Report of AAPM task group 219 on independent calculation-based dose/MU verification for IMRT. Med Phys. 2021;48(10):e808–29. 10.1002/mp.15069 - DOI - PubMed

MeSH terms

LinkOut - more resources