Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 23:12:1562558.
doi: 10.3389/fmed.2025.1562558. eCollection 2025.

Proteomic alterations in ovarian cancer-Predicting residual disease status using artificial intelligence and SHAP-based biomarker interpretation

Affiliations

Proteomic alterations in ovarian cancer-Predicting residual disease status using artificial intelligence and SHAP-based biomarker interpretation

Seyma Yasar et al. Front Med (Lausanne). .

Abstract

Introduction: High-grade serous ovarian cancer (HGSOC) is the most aggressive and prevalent subtype of ovarian Treatment outcomes are significantly influenced by residual disease status following neoadjuvant chemotherapy (NACT). Predicting residual disease before surgery can improve patient stratification and personalized treatment strategies.

Methods: This study analyzed pre-NACT proteomic data from 20 HGSOC patients treated with NACT. Patients were categorized into two groups based on surgical outcomes: no residual disease (R0, n = 14) and suboptimal residual disease (R1, n = 6). From an initial set of 97 differentially expressed proteins, 18 significant proteins were selected using the BORUTA feature selection method. Three machine learning models-Random Forest (RF), Support Vector Machine (SVM), and Bootstrap Aggregation with Classification and Regression Trees (BaggedCART)-were developed and evaluated.

Results: The Random Forest model achieved the best performance with an AUC of 0.955, accuracy of 0.830, sensitivity of 0.904, specificity of 0.763, and F1-score of 0.839. SHapley Additive exPlanations (SHAP) analysis identified five proteins (P48637, O43491, O95302, Q96CX2, and P49189) as the most influential predictors of residual disease. These proteins, including glutathione synthetase and peptidyl-prolyl cis-trans isomerase FKBP9, are associated with chemotherapy resistance mechanisms.

Discussion: The findings demonstrate the potential of integrating proteomic data with machine learning techniques for predicting surgical outcomes in HGSOC. Identified protein signatures may serve as valuable biomarkers for anticipating NACT response and informing clinical decision-making, ultimately contributing to personalized patient care.

Keywords: SHAP analysis; high-grade serous ovarian cancer (HGSOC); machine learning; neoadjuvant chemotherapy (NACT); proteomic biomarkers.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Scatter plot and bar chart showing SHAP value impacts on model outputs. Scatter plot (A) displays SHAP values for different features, with colors indicating feature impact from low (blue) to high (pink). Bar chart (B) ranks features by their mean SHAP values, showing relative importance.
Figure 1
(A) Global SHAP annotations of the random forest model for residual disease prediction. The bee swarm plot shows how features in the model affect predictions. Each dot represents a data sample, and the positions of the dots on the x-axis represent SHAP values (positive or negative impact). The colors of the dots represent feature values (blue—low, red—high). (B) Protein importance plots based on the mean SHAP values of the random forest model for residual disease prediction. The bar graph shows the average of the absolute SHAP values of the marginal contribution of each variable to the model output. This graph presents the relative importance of the variables on the model predictions in a hierarchical structure.

Similar articles

References

    1. Siegel RL, Giaquinto AN, Jemal A. Cancer statistics, 2024. CA Cancer J Clin. (2024) 74:12–49. 10.3322/caac.21820 - DOI - PubMed
    1. Petrucelli N, Daly MB, Pal T. BRCA1- and BRCA2-associated hereditary breast and ovarian cancer. In:Adam MP, Feldman J, Mirzaa GM, Pagon RA, Wallace SE, Amemiya A, editors. GeneReviews®. Seattle, WA: University of Washington; (1998). p. 1993–2025. - PubMed
    1. Lawson-Michod KA, Watt MH, Grieshober L, Green SE, Karabegovic L, Derzon S, et al. Pathways to ovarian cancer diagnosis: a qualitative study. BMC Women's Health. (2022) 22:430. 10.1186/s12905-022-02016-1 - DOI - PMC - PubMed
    1. Kim J, Park EY, Kim O, Schilder JM, Coffey DM, Cho C-H, et al. Cell origins of high-grade serous ovarian cancer. Cancers. (2018) 10:433. 10.3390/cancers10110433 - DOI - PMC - PubMed
    1. Lisio, M.-A., Fu L, Goyeneche A, Gao Zh, Telleria C. High-grade serous ovarian cancer: basic sciences, clinical and therapeutic standpoints. Int J Mol Sci. (2019) 20:952. 10.3390/ijms20040952 - DOI - PMC - PubMed

LinkOut - more resources