. 2024 Jul 31;19(1):98.

doi: 10.1186/s13014-024-02496-5.

Improving the performance of deep learning models in predicting and classifying gamma passing rates with discriminative features and a class balancing technique: a retrospective cohort study

Wei Song¹, Wen Shang¹, Chunying Li¹, Xinyu Bian¹, Hong Lu¹, Jun Ma², Dahai Yu³

Affiliations

¹ Department of Radiation Oncology, Jiangsu Province Hospital of Chinese Medicine, Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, 210029, China.
² Department of Radiation Oncology, Jiangsu Province Hospital of Chinese Medicine, Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, 210029, China. bluemaple@sina.com.
³ Department of Radiation Oncology, Jiangsu Province Hospital of Chinese Medicine, Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, 210029, China. yudahaipumc@hotmail.com.

PMID: 39085872
PMCID: PMC11293183
DOI: 10.1186/s13014-024-02496-5

Improving the performance of deep learning models in predicting and classifying gamma passing rates with discriminative features and a class balancing technique: a retrospective cohort study

Wei Song et al. Radiat Oncol. 2024.

. 2024 Jul 31;19(1):98.

doi: 10.1186/s13014-024-02496-5.

Authors

Wei Song¹, Wen Shang¹, Chunying Li¹, Xinyu Bian¹, Hong Lu¹, Jun Ma², Dahai Yu³

Affiliations

¹ Department of Radiation Oncology, Jiangsu Province Hospital of Chinese Medicine, Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, 210029, China.
² Department of Radiation Oncology, Jiangsu Province Hospital of Chinese Medicine, Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, 210029, China. bluemaple@sina.com.
³ Department of Radiation Oncology, Jiangsu Province Hospital of Chinese Medicine, Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, 210029, China. yudahaipumc@hotmail.com.

PMID: 39085872
PMCID: PMC11293183
DOI: 10.1186/s13014-024-02496-5

Abstract

Background: The purpose of this study was to improve the deep learning (DL) model performance in predicting and classifying IMRT gamma passing rate (GPR) by using input features related to machine parameters and a class balancing technique.

Methods: A total of 2348 fields from 204 IMRT plans for patients with nasopharyngeal carcinoma were retrospectively collected to form a dataset. Input feature maps, including fluence, leaf gap, leaf speed of both banks, and corresponding errors, were constructed from the dynamic log files. The SHAP framework was employed to compute the impact of each feature on the model output for recursive feature elimination. A series of UNet++ based models were trained on the obtained eight feature sets with three fine-tuning methods including the standard mean squared error (MSE) loss, a re-sampling technique, and a proposed weighted MSE loss (WMSE). Differences in mean absolute error, area under the receiver operating characteristic curve (AUC), sensitivity, and specificity were compared between the different models.

Results: The models trained with feature sets including leaf speed and leaf gap features predicted GPR for failed fields more accurately than the other models (F(7, 147) = 5.378, p < 0.001). The WMSE loss had the highest accuracy in predicting GPR for failed fields among the three fine-tuning methods (F(2, 42) = 14.149, p < 0.001), while an opposite trend was observed in predicting GPR for passed fields (F(2, 730) = 9.907, p < 0.001). The WMSE_FS5 model achieved a superior AUC (0.92) and more balanced sensitivity (0.77) and specificity (0.89) compared to the other models.

Conclusions: Machine parameters can provide discriminative input features for GPR prediction in DL. The novel weighted loss function demonstrates the ability to balance the prediction and classification accuracy between the passed and failed fields. The proposed approach is able to improve the DL model performance in predicting and classifying GPR, and can potentially be integrated into the plan optimization process to generate higher deliverability plans.

Trial registration: This clinical trial was registered in the Chinese Clinical Trial Registry on March 26th, 2020 (registration number: ChiCTR2000031276). https://clinicaltrials.gov/ct2/show/ChiCTR2000031276.

Keywords: Class imbalance; Classification; Deep learning; Machine parameters; Prediction; Quality assurance.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

**Fig. 1**
The overall workflow for the study

**Fig. 2**
a The proposed deep neural network based on the UNet++ architecture. b The structure of the basic convolution block

**Fig. 3**
Mean SHAP values computed on the validation set for each input feature in each of the eight models a–h trained during recursive feature elimination

**Fig. 4**
Scatter plot of measured versus predicted gamma passing rate (GPR) for models trained with the full feature set (FS1), the optimal FS (FS5), and the FS with only the *Fluence* map (FS8) using each of the three fine-tuning methods. The diagonal dotted line represents the perfect prediction by an ideal model. The vertical and horizontal dashed lines represent the action limit. TP = true positive; TN = true negative; FP = false positive; FN = false negative

**Fig. 5**
Comparison of the area under the receiver operating characteristic curve (AUC) for different models in the test dataset

**Fig. 6**
The distribution of SHAP values computed on the test dataset for each feature in the WMSE_FS1 (a) and WMSE_FS5 (b) models

See this image and copyright information in PMC

References

1. Chen D, Cai SB, Soon YY, Cheo T, Vellayappan B, Tan CW, Ho F. Dosimetric comparison between intensity modulated radiation therapy (IMRT) vs dual arc volumetric arc therapy (VMAT) for nasopharyngeal cancer (NPC): systematic review and meta-analysis. J Med Imaging Radiat Sci. 2023;54(1):167–77. 10.1016/j.jmir.2022.10.195 - DOI - PubMed
1. Miften M, Olch A, Mihailidis D, Moran J, Pawlicki T, Molineu A, et al. Tolerance limits and methodologies for IMRT measurement-based verification QA: recommendations of AAPM task group No 218. Med Phys. 2018;45(4):e53–83. 10.1002/mp.12810 - DOI - PubMed
1. Chan LT, Tan YI, Tan PW, Leong YF, Khor JS, Teh MW, et al. Comparing log file to measurement-based patient-specific quality assurance. Phys Eng Sci Med. 2023;46(1):303–11. 10.1007/s13246-023-01219-6 - DOI - PubMed
1. Stasko JT, Ferris WS, Adam DP, Culberson WS, Frigo SP. IMRT QA result prediction via MLC transmission decomposition. J Appl Clin Med Phys. 2023;24(8):1–10. 10.1002/acm2.13990 - DOI - PMC - PubMed
1. Zhu TC, Stathakis S, Clark JR, Feng W, Georg D, Holmes SM, et al. Report of AAPM task group 219 on independent calculation-based dose/MU verification for IMRT. Med Phys. 2021;48(10):e808–29. 10.1002/mp.15069 - DOI - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- BioMed Central
- PubMed Central
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Improving the performance of deep learning models in predicting and classifying gamma passing rates with discriminative features and a class balancing technique: a retrospective cohort study

Affiliations

Improving the performance of deep learning models in predicting and classifying gamma passing rates with discriminative features and a class balancing technique: a retrospective cohort study

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous