Breast Lesion Detection Using Weakly Dependent Customized Features and Machine Learning Models with Explainable Artificial Intelligence

Simona Moldovanu^{1

2}, Dan Munteanu², Keka C Biswas³, Luminita Moraru^{1

4

5}

Affiliations

¹ The Modelling & Simulation Laboratory, Dunarea de Jos University of Galati, 47 Domneasca Street, 800008 Galati, Romania.
² Department of Computer Science and Information Technology, Faculty of Automation, Computers, Electrical Engineering and Electronics, Dunarea de Jos University of Galati, 47 Domneasca Street, 800008 Galati, Romania.
³ Department of Biological Sciences, University of Alabama at Huntsville, Huntsville, AL 35899, USA.
⁴ Department of Chemistry, Physics & Environment, Faculty of Sciences and Environment, Dunarea de Jos University of Galati, 47 Domneasca Street, 800008 Galati, Romania.
⁵ Department of Physics, School of Science and Technology, Sefako Makgatho Health Sciences University, Medunsa, Pretoria 0204, South Africa.

PMID: 40422992
PMCID: PMC12112174
DOI: 10.3390/jimaging11050135

Breast Lesion Detection Using Weakly Dependent Customized Features and Machine Learning Models with Explainable Artificial Intelligence

Simona Moldovanu et al. J Imaging. 2025.

. 2025 Apr 28;11(5):135.

doi: 10.3390/jimaging11050135.

Authors

Simona Moldovanu^{1

2}, Dan Munteanu², Keka C Biswas³, Luminita Moraru^{1

4

5}

Affiliations

¹ The Modelling & Simulation Laboratory, Dunarea de Jos University of Galati, 47 Domneasca Street, 800008 Galati, Romania.
² Department of Computer Science and Information Technology, Faculty of Automation, Computers, Electrical Engineering and Electronics, Dunarea de Jos University of Galati, 47 Domneasca Street, 800008 Galati, Romania.
³ Department of Biological Sciences, University of Alabama at Huntsville, Huntsville, AL 35899, USA.
⁴ Department of Chemistry, Physics & Environment, Faculty of Sciences and Environment, Dunarea de Jos University of Galati, 47 Domneasca Street, 800008 Galati, Romania.
⁵ Department of Physics, School of Science and Technology, Sefako Makgatho Health Sciences University, Medunsa, Pretoria 0204, South Africa.

PMID: 40422992
PMCID: PMC12112174
DOI: 10.3390/jimaging11050135

Abstract

This research proposes a novel strategy for accurate breast lesion classification that combines explainable artificial intelligence (XAI), machine learning (ML) classifiers, and customized weakly dependent features from ultrasound (BU) images. Two new weakly dependent feature classes are proposed to improve the diagnostic accuracy and diversify the training data. These are based on image intensity variations and the area of bounded partitions and provide complementary rather than overlapping information. ML classifiers such as Random Forest (RF), Extreme Gradient Boosting (XGB), Gradient Boosting Classifiers (GBC), and LASSO regression were trained with both customized feature classes. To validate the reliability of our study and the results obtained, we conducted a statistical analysis using the McNemar test. Later, an XAI model was combined with ML to tackle the influence of certain features, the constraints of feature selection, and the interpretability capabilities across various ML models. LIME (Local Interpretable Model-Agnostic Explanations) and SHAP (SHapley Additive exPlanations) models were used in the XAI process to enhance the transparency and interpretation in clinical decision-making. The results revealed common relevant features for the malignant class, consistently identified by all of the classifiers, and for the benign class. However, we observed variations in the feature importance rankings across the different classifiers. Furthermore, our study demonstrates that the correlation between dependent features does not impact explainability.

Keywords: LIME; SHAP; XAI; dependent features; machine learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

**Figure 1**
Bounded histogram features. (a) Raw breast US image from the BUSI dataset; (b) Ground truth of a breast lesion; (c) Region of interest. Selected pixels within bounded repartitions are shown according to their gray-levels distributions. (d1) [0, 31]; (d2) [32, 63]; (d3) [64, 95]; (d4) [96, 127]; (d5) [128, 159]; (d6) [160, 191]; (d7) [192, 223]; (d8) [224, 255].

**Figure 2**
The overall feature importance in the prediction results over the test dataset. (a) Bounded histogram features (Chi); (b) Grayscale density features (Ci). The most important features are marked in yellow.

**Figure 3**
LIME output: the importance of individual features in the classification process by their relevance and score, and the features’ selection across various classifiers. (a1) RF and CHi; (a2) RF and Ci; (b1) GBC and CHi; (b2) GBC and Ci; (c1) XGB and CHi; (c2) XGB and Ci. “0” or blue is associated with the malignant class and “1” or orange is for the benign class.

**Figure 4**
SHAP-integrated ML classifiers’ summary plot on the test data for the malignant and benign output classes. (a1) RF and CHi; (a2) RF and Ci; (b1) GBC and CHi; (b2) GBC and Ci; (c1) XGB and CHi; (c2) XGB and Ci. The horizontal axis plots an SV for a specific feature and data point. The vertical axis ranks the features based on their importance. The values of the features are represented with the following code: lower values are shown in blue, and higher values are shown in red. Points that overlap are shown vertically.

See this image and copyright information in PMC

References

1. World Cancer Research Found International Breast Cancer Statistics. [(accessed on 15 July 2024)]. Available online: https://www.wcrf.org/cancer-trends/
1. American Cancer Society How Common Is Breast Cancer? [(accessed on 15 July 2024)]. Available online: https://www.cancer.org/cancer/types/breast-cancer.
1. World Health Organization Breast Cancer. [(accessed on 15 July 2024)]. Available online: https://www.who.int/news-room/fact-sheets/detail/breast-cancer.
1. Kerlikowske K. Epidemiology of Ductal Carcinoma In Situ. JNCI Monogr. 2010;2010:139–141. doi: 10.1093/jncimonographs/lgq027. - DOI - PMC - PubMed
1. Mahdavi M., Nassiri M., Kooshyar M.M., Vakili-Azghandi M., Avan A., Sandry R., Pillai S., Lam A.K., Gopalan V. Hereditary Breast Cancer; Genetic Penetrance and Current Status with BRCA. J. Cell. Physiol. 2019;234:5741–5750. doi: 10.1002/jcp.27464. - DOI - PubMed

LinkOut - more resources

Full Text Sources
- MDPI
- PubMed Central
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Breast Lesion Detection Using Weakly Dependent Customized Features and Machine Learning Models with Explainable Artificial Intelligence

Affiliations

Breast Lesion Detection Using Weakly Dependent Customized Features and Machine Learning Models with Explainable Artificial Intelligence

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources

Research Materials