Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 12;23(1):18.
doi: 10.1186/s12938-024-01219-x.

Screening ovarian cancer by using risk factors: machine learning assists

Affiliations

Screening ovarian cancer by using risk factors: machine learning assists

Raoof Nopour. Biomed Eng Online. .

Abstract

Background and aim: Ovarian cancer (OC) is a prevalent and aggressive malignancy that poses a significant public health challenge. The lack of preventive strategies for OC increases morbidity, mortality, and other negative consequences. Screening OC through risk prediction could be leveraged as a powerful strategy for preventive purposes that have not received much attention. So, this study aimed to leverage machine learning approaches as predictive assistance solutions to screen high-risk groups of OC and achieve practical preventive purposes.

Materials and methods: As this study is data-driven and retrospective in nature, we leveraged 1516 suspicious OC women data from one concentrated database belonging to six clinical settings in Sari City from 2015 to 2019. Six machine learning (ML) algorithms, including XG-Boost, Random Forest (RF), J-48, support vector machine (SVM), K-nearest neighbor (KNN), and artificial neural network (ANN) were leveraged to construct prediction models for OC. To choose the best model for predicting OC, we compared various prediction models built using the area under the receiver characteristic operator curve (AU-ROC).

Results: Current experimental results revealed that the XG-Boost with AU-ROC = 0.93 (0.95 CI = [0.91-0.95]) was recognized as the best-performing model for predicting OC.

Conclusions: ML approaches possess significant predictive efficiency and interoperability to achieve powerful preventive strategies leveraging OC screening high-risk groups.

Keywords: Machine learning; Ovarian cancer; Predictive efficiency; Preventive strategy; Public health challenge.

PubMed Disclaimer

Conflict of interest statement

There are no competing interests.

Figures

Fig. 1
Fig. 1
The preprocessing steps of the samples in the dataset
Fig. 2
Fig. 2
The ROC of ML-trained algorithms
Fig. 3
Fig. 3
The RI of factors associated with OC prediction
Fig. 4
Fig. 4
The importance of factors based on permutation feature score
Fig. 5
Fig. 5
The mean SHAP values associated predictors of OC
Fig. 6
Fig. 6
SHAP values associated with OC prediction pertaining to all cases
Fig. 7
Fig. 7
The external capability classification
Fig. 8
Fig. 8
The performance indicators of XG-Boost in two external settings
Fig. 9
Fig. 9
The internal and external ROC of the XG-Boost model

Similar articles

Cited by

References

    1. Momenimovahed Z, Tiznobaik A, Taheri S, Salehiniya H. Ovarian cancer in the world: epidemiology and risk factors. Int J Womens Health. 2019;11:287–299. doi: 10.2147/IJWH.S197604. - DOI - PMC - PubMed
    1. Gaona-Luviano P, Medina-Gaona LA, Magaña-Pérez K. Epidemiology of ovarian cancer. Chin Clin Oncol. 2020;9(4):47. doi: 10.21037/cco-20-34. - DOI - PubMed
    1. La Vecchia C. Ovarian cancer epidemiology and risk factors. Eur J Cancer Prev. 2017;26(1):55–62. doi: 10.1097/CEJ.0000000000000217. - DOI - PubMed
    1. Orr B, Edwards RP. Diagnosis and treatment of ovarian cancer. Hematol Oncol Clin North Am. 2018;32(6):943–964. doi: 10.1016/j.hoc.2018.07.010. - DOI - PubMed
    1. Lisio M-A, Fu L, Goyeneche A, Gao Z-h, Telleria C. High-grade serous ovarian cancer: basic sciences, clinical and therapeutic standpoints. Int J Mol Sci. 2019;20(4):952. doi: 10.3390/ijms20040952. - DOI - PMC - PubMed