Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec 30;24(1):353.
doi: 10.1186/s12880-024-01548-2.

Interpretable machine learning model for predicting clinically significant prostate cancer: integrating intratumoral and peritumoral radiomics with clinical and metabolic features

Affiliations

Interpretable machine learning model for predicting clinically significant prostate cancer: integrating intratumoral and peritumoral radiomics with clinical and metabolic features

Wenjun Zhao et al. BMC Med Imaging. .

Abstract

Background: To develop and validate an interpretable machine learning model based on intratumoral and peritumoral radiomics combined with clinicoradiological features and metabolic information from magnetic resonance spectroscopy (MRS), to predict clinically significant prostate cancer (csPCa, Gleason score ≥ 3 + 4) and avoid unnecessary biopsies.

Methods: This study retrospectively analyzed 350 patients with suspicious prostate lesions from our institution who underwent 3.0 Tesla multiparametric magnetic resonance imaging (mpMRI) prior to biopsy (training set, n = 191, testing set, n = 83, and a temporal validation set, n = 76). Intratumoral and peritumoral volumes of interest (VOIintra, VOIperi)) were manually segmented by experienced radiologists on T2-weighted imaging (T2WI) and apparent diffusion coefficient (ADC) maps. Radiomic features were extracted separately from the VOIintra and VOIperi. After feature selection via the recursive feature elimination (RFE) algorithm, intratumoral radiomic score (intra-rad-score) and peritumoral radiomic score (peri-rad-score) were constructed. The clinical model, MRS model, and combined model integrating radiomic, clinicoradiological and metabolic features were constructed via the eXtreme Gradient Boosting (XGBoost) algorithm. The predictive performance of the models was evaluated in both the training and testing sets using receiver operating characteristic (ROC) curve analysis. SHapley Additive exPlanations (SHAP) analysis was applied to the combined model to visualize and interpret the prediction process.

Results: A total of 350 patients were included, comprising 173 patients with csPCa (49.4%) and 177 patients with non-csPCa (50.6%). The intra-rad-score and peri-rad-score were constructed via 10 and 16 radiomic features. The combined model demonstrated the highest AUC, accuracy, F1 score, sensitivity, and specificity in the testing set (0.968, 0.928, 0.927, 0.932, and 0.923, respectively) and in the temporal validation set (0.940, 0.895, 0.890, 0.923, and 0.875, respectively). SHAP analysis revealed that the intra-rad-score, PSAD, peri-rad-score, and PI-RADS score were the most important predictors of the combined model.

Conclusion: We developed and validated a robust machine learning model incorporating intratumoral and peritumoral radiomic features, along with clinicoradiological and metabolic parameters, to accurately identify csPCa. The prediction process was visualized via SHAP analysis to facilitate clinical decision- making.

Keywords: Interpretability; Machine learning; Prostate cancer; Radiomics.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: This study was approved by the institutional review board of Xinxiang Central Hospital (approval number, 2023 − 761). The informed consent was waived. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Flow chat of patient selection
Fig. 2
Fig. 2
Flow chat of model construction based on radiomics and machine learning
Fig. 3
Fig. 3
Cross validation score plots of the training set (A) and testing set (B). A recursive feature elimination (RFE) algorithm based on logistic regression was employed for feature selection in both the training and testing sets. Five-fold cross-validation was performed, and the optimal feature subset was selected based on the average model accuracy in the validation set. For each number of features selected, the graph shows the mean and standard deviation of the model’s cross-validation accuracy. The best models in the training and testing sets are built with 10 and 16 features respectively (red dashed lines), respectively
Fig. 4
Fig. 4
ROC curves and radar charts of the model evaluation parameters for the five models in the training, testing, and validation sets. The radar chart displays the AUC, accuracy, F1 score, sensitivity, and specificity of the five models
Fig. 5
Fig. 5
Global visualization of the combined model through SHAP. (A) The SHAP bar chart shows the weights of the most important features in the model. (B) The SHAP bees-warm plot displays an information-dense summary illustrating the impact of top features on the model’s output
Fig. 6
Fig. 6
Individual visualization of the combined model through SHAP. (A) and (B) show two examples of correctly predicted csPCa cases. (C) and (D) show two examples of correctly predicted non-csPCa cases. (A) for this specific sample, the PI-RADS score, Cit level, peri-rad-score, and intra-rad-score increase the prediction probability of csPCa, whereas the PSAD and PSA have the opposite effect. (B) exhibits a similar pattern. In contrast, (C) shows that the peri-rad-score, PSAD, and PSA increase the predicted probability of non-csPCa, whereas the intra-rad-score and PI-RADS score have the opposite effect. Figure (D) displays a comparable pattern to (C)

Similar articles

Cited by

References

    1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer statistics 2020: GLOBOCAN estimates of incidence and Mortality Worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71(3):209–49. - PubMed
    1. Bell KJ, Del Mar C, Wright G, Dickinson J, Glasziou P. Prevalence of incidental prostate cancer: a systematic review of autopsy studies. Int J Cancer. 2015;137(7):1749–57. - PMC - PubMed
    1. Epstein JI, Egevad L, Amin MB, Delahunt B, Srigley JR, Humphrey PA. The 2014 International Society of Urological Pathology (ISUP) Consensus Conference on Gleason Grading of Prostatic Carcinoma: definition of grading patterns and proposal for a New Grading System. Am J Surg Pathol. 2016;40(2):244–52. - PubMed
    1. Hamdy FC, Donovan JL, Lane JA, Mason M, Metcalfe C, Holding P, et al. 10-Year outcomes after monitoring, surgery, or Radiotherapy for localized prostate Cancer. N Engl J Med. 2016;375(15):1415–24. - PubMed
    1. Wilt TJ, Jones KM, Barry MJ, Andriole GL, Culkin D, Wheeler T, et al. Follow-up of Prostatectomy versus Observation for early prostate Cancer. N Engl J Med. 2017;377(2):132–42. - PubMed