Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 8:1-22.
doi: 10.1159/000545850. Online ahead of print.

Artificial Intelligence Applied to Ultrasound Diagnosis of Pelvic Gynecological Tumors: A Systematic Review and Meta-Analysis

Affiliations

Artificial Intelligence Applied to Ultrasound Diagnosis of Pelvic Gynecological Tumors: A Systematic Review and Meta-Analysis

Axel Geysels et al. Gynecol Obstet Invest. .

Abstract

Introduction: The objective of this study wasto perform a systematic review on artificial intelligence (AI) studies focused on identifying and differentiating pelvic gynecological tumors on ultrasound scans.

Methods: Studies developing or validating AI models for diagnosing gynecological pelvic tumors on ultrasound scans were eligible for inclusion. We systematically searched PubMed, Embase, Web of Science, and Cochrane Central from their database inception until April 30, 2024. To assess the quality of the included studies, we adapted the QUADAS-2 risk of bias tool to address the unique challenges of AI in medical imaging. Using multilevel random-effects models, we performed a meta-analysis to generate summary estimates of the area under the receiver operating characteristic curve (AUC), sensitivity, and specificity. To provide a reference point of current diagnostic support tools for ultrasound examiners, we descriptively compared the pooled performance to that of the well-recognized ADNEX model on external validation. Subgroup analyses were performed to explore sources of heterogeneity.

Results: From 9,151 records retrieved, 44 studies were eligible: 40 on ovarian, 3 on endometrial, and 1 on myometrial pathology. Overall, 95% were at high risk of bias - primarily due to inappropriate study inclusion criteria, the absence of a patient-level split of training and testing image sets, and no calibration assessment. For ovarian tumors, the summary AUC for AI models distinguishing benign from malignant tumors was 0.89 (95% CI: 0.85-0.92). In lower risk studies (at least three low-risk domains), the summary AUC dropped to 0.87 (95% CI: 0.83-0.90), with deep learning models outperforming radiomics-based machine learning approaches in this subset. Only five studies included an external validation, and six evaluated calibration performance. In a recent systematic review of external validation studies, the ADNEX model had a pooled AUC of 0.93 (95% CI: 0.91-0.94) in studies at low risk of bias. Studies on endometrial and myometrial pathologies were reported individually.

Conclusion: Although AI models show promising discriminative performances for diagnosing gynecological tumors on ultrasound, most studies have methodological shortcomings that result in a high risk of bias. In addition, the ADNEX model appears to outperform most AI approaches for ovarian tumors. Future research should emphasize robust study designs - ideally large, multicenter, and prospective cohorts that mirror real-world populations - along with external validation, proper calibration, and standardized reporting.

Keywords: Artificial intelligence; Endometrial cancer; Gynecological oncology; Meta-analysis; Myometrial cancer; Ovarian cancer; Ultrasound.

PubMed Disclaimer

Conflict of interest statement

B.V.C., D.T., and W.F. serve as members of the Steering Committee of the International Ovarian Tumor Analysis (IOTA) consortium, with B.V.C. and D.T. being involved in the development of the ADNEX model. All other authors have no conflicts of interests.

Figures

Fig. 1.
Fig. 1.
PRISMA flowchart of the study selection.
Fig. 2.
Fig. 2.
Summary ROC graph with 95% CI stratified by the risk of bias based on the adapted QUADAS-2 assessment, spanning 40 models from 14 ovarian cancer studies with “less high risk” (at least 3 low-risk domains) and 36 models from 11 ovarian cancer studies with “very high risk” (less than 3 low-risk domains). ADNEX, Assessment of Different Neoplasias in the Adnexa; CA-125, cancer antigen-125; CI, confidence interval; ROC, receiver operating characteristic curve.
Fig. 3.
Fig. 3.
Forest plot of reported AUC with 95% CI stratified by the risk of bias based on the adapted QUADAS-2 assessment, spanning 34 models from 11 ovarian cancer studies with “less high risk” (at least 3 low-risk domains) and 16 models from 4 ovarian cancer studies with “very high risk” (less than 3 low-risk domains). A descriptive comparison with ADNEX on external validation – both with and without CA-125 – is also shown. ADNEX, Assessment of Different Neoplasias in the Adnexa; AUC, area under the receiver operating characteristic curve; CA-125, cancer antigen-125; CI, confidence interval; ROC, receiver operating characteristic curve.

Similar articles

Cited by

References

    1. Ferlay J, Steliarova-Foucher E, Lortet-Tieulent J, Rosso S, Coebergh JWW, Comber H, et al. Cancer incidence and mortality patterns in Europe: estimates for 40 countries in 2012. Eur J Cancer. 2013;49(6):1374–403. - PubMed
    1. Heintz A, Odicino F, Maissonneuve P, Quinn M, Benedet J, Creasman W, et al. Carcinoma of the ovary. Int J Gynecol Obstet. 2006;95(S1):S161–92. - PubMed
    1. Van Holsbeke C, Daemen A, Yazbek J, Holland TK, Bourne T, Mesens T, et al. Ultrasound experience substantially impacts on diagnostic performance and confidence when adnexal masses are classified using pattern recognition. Gynecol Obstet Invest. 2010;69(3):160–8. - PubMed
    1. Van Calster B, Van Hoorde K, Valentin L, Testa AC, Fischerova D, Van Holsbeke C, et al. Evaluating the risk of ovarian cancer before surgery using the ADNEX model to differentiate between benign, borderline, early and advanced stage invasive, and secondary metastatic tumours: prospective multicentre diagnostic study. BMJ. 2014;349:g5920. - PMC - PubMed
    1. Barreñada L, Ledger A, Dhiman P, Collins G, Wynants L, Verbakel JY, et al. ADNEX risk prediction model for diagnosis of ovarian cancer: systematic review and meta-analysis of external validation studies. BMJ Med. 2024;3(1):e000817. - PMC - PubMed

Publication types

LinkOut - more resources