. 2025 Mar 24;15(7):822.

doi: 10.3390/diagnostics15070822.

Evolution of an Artificial Intelligence-Powered Application for Mammography

Affiliations

¹ Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, 127051 Moscow, Russia.
² Department of Information Technology and Medical Data Processing, I.M. Sechenov First Moscow State Medical University (Sechenov University), 119991 Moscow, Russia.
³ Celsus (Medical Screening Systems), Viktorenko St., Bldg. 11, Room 21N, 125167 Moscow, Russia.
⁴ Department of Artificial Intelligence Technologies, MIREA-Russian Technological University, 119454 Moscow, Russia.

PMID: 40218172
PMCID: PMC11988740
DOI: 10.3390/diagnostics15070822

Evolution of an Artificial Intelligence-Powered Application for Mammography

Yuriy Vasilev et al. Diagnostics (Basel). 2025.

. 2025 Mar 24;15(7):822.

doi: 10.3390/diagnostics15070822.

Authors

Affiliations

¹ Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, 127051 Moscow, Russia.
² Department of Information Technology and Medical Data Processing, I.M. Sechenov First Moscow State Medical University (Sechenov University), 119991 Moscow, Russia.
³ Celsus (Medical Screening Systems), Viktorenko St., Bldg. 11, Room 21N, 125167 Moscow, Russia.
⁴ Department of Artificial Intelligence Technologies, MIREA-Russian Technological University, 119454 Moscow, Russia.

PMID: 40218172
PMCID: PMC11988740
DOI: 10.3390/diagnostics15070822

Abstract

Background: The implementation of radiological artificial intelligence (AI) solutions remains challenging due to limitations in existing testing methodologies. This study assesses the efficacy of a comprehensive methodology for performance testing and monitoring of commercial-grade mammographic AI models. Methods: We utilized a combination of retrospective and prospective multicenter approaches to evaluate a neural network based on the Faster R-CNN architecture with a ResNet-50 backbone, trained on a dataset of 3641 mammograms. The methodology encompassed functional and calibration testing, coupled with routine technical and clinical monitoring. Feedback from testers and radiologists was relayed to the developers, who made updates to the AI model. The test dataset comprised 112 medical organizations, representing 10 manufacturers of mammography equipment and encompassing 593,365 studies. The evaluation metrics included the area under the curve (AUC), accuracy, sensitivity, specificity, technical defects, and clinical assessment scores. Results: The results demonstrated significant enhancement in the AI model's performance through collaborative efforts among developers, testers, and radiologists. Notable improvements included functionality, diagnostic accuracy, and technical stability. Specifically, the AUC rose by 24.7% (from 0.73 to 0.91), the accuracy improved by 15.6% (from 0.77 to 0.89), sensitivity grew by 37.1% (from 0.62 to 0.85), and specificity increased by 10.7% (from 0.84 to 0.93). The average proportion of technical defects declined from 9.0% to 1.0%, while the clinical assessment score improved from 63.4 to 72.0. Following 2 years and 9 months of testing, the AI solution was integrated into the compulsory health insurance system. Conclusions: The multi-stage, lifecycle-based testing methodology demonstrated substantial potential in software enhancement and integration into clinical practice. Key elements of this methodology include robust functional and diagnostic requirements, continuous testing and updates, systematic feedback collection from testers and radiologists, and prospective monitoring.

Keywords: artificial intelligence; mammography; radiology; software; software validation.

PubMed Disclaimer

Conflict of interest statement

Evgeniy Nikitin and Artem Kapninskiy are employees of the company Celsus. They had no role in the design of this study or in the collection, analyses, or interpretation of data. The other authors declare no relationships with any companies whose products or services may be related to the subject matter of the article.

Figures

**Figure 1**
Model architecture. * Module components or actions performed.

**Figure 2**
Overview of the study design.

**Figure 3**
Overview of the study timeline.

**Figure 4**
Additional image series with graphical masks highlighting pathological findings.

**Figure 5**
ROC curves for (a) version 0.14.0 and (b) version 0.15.0 of the Celsus Mammography AI model: results of the first and second calibration tests.

**Figure 6**
(a) Variations in A and B defects during the technical monitoring; (b) changes in the average rate of technical defects observed during the technical monitoring.

**Figure 7**
Changes in the clinical assessment score during the clinical monitoring.

**Figure 8**
False-positive findings by the AI model: (a) benign vascular calcifications misclassified as malignant calcification, and (b) normal fibroglandular tissue misclassified as a malignant mass.

**Figure 9**
True-positive results from the AI model, illustrating the detected irregular masses with indistinct margins in the right and left breasts.

**Figure 10**
(a) ROC curve for Celsus Mammography (version 0.16.0) from the third calibration test; (b) ROC curve for Celsus Mammography (version 0.17.0) from the fourth calibration test.

See this image and copyright information in PMC

References

1. Katsika L., Boureka E., Kalogiannidis I., Tsakiridis I., Tirodimos I., Lallas K., Tsimtsiou Z., Dagklis T. Screening for Breast Cancer: A Comparative Review of Guidelines. Life. 2024;14:777. doi: 10.3390/life14060777. - DOI - PMC - PubMed
1. Akwo J., Hadadi I., Ekpo E. Diagnostic Efficacy of Five Different Imaging Modalities in the Assessment of Women Recalled at Breast Screening—A Systematic Review and Meta-Analysis. Cancers. 2024;16:3505. doi: 10.3390/cancers16203505. - DOI - PMC - PubMed
1. Ding L., Greuter M.J.W., Truyen I., Goossens M., Van der Vegt B., De Schutter H., Van Hal G., de Bock G.H. Effectiveness of Organized Mammography Screening for Different Breast Cancer Molecular Subtypes. Cancers. 2022;14:4831. doi: 10.3390/cancers14194831. - DOI - PMC - PubMed
1. Nicosia L., Gnocchi G., Gorini I., Venturini M., Fontana F., Pesapane F., Abiuso I., Bozzini A.C., Pizzamiglio M., Latronico A., et al. History of Mammography: Analysis of Breast Imaging Diagnostic Achievements over the Last Century. Healthcare. 2023;11:1596. doi: 10.3390/healthcare11111596. - DOI - PMC - PubMed
1. Magnus M.C., Ping M., Shen M.M., Bourgeois J., Magnus J.H. Effectiveness of Mammography Screening in Reducing Breast Cancer Mortality in Women Aged 39–49 Years: A Meta-Analysis. J. Women’s Health. 2011;20:845–852. doi: 10.1089/jwh.2010.2098. - DOI - PubMed

Grants and funding

123031400006-0/Moscow Health Care Department

LinkOut - more resources

Full Text Sources
- MDPI
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Evolution of an Artificial Intelligence-Powered Application for Mammography

Affiliations

Evolution of an Artificial Intelligence-Powered Application for Mammography

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

Grants and funding

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

References

Related information

Grants and funding

LinkOut - more resources

Full Text Sources