Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 28;11(5):135.
doi: 10.3390/jimaging11050135.

Breast Lesion Detection Using Weakly Dependent Customized Features and Machine Learning Models with Explainable Artificial Intelligence

Affiliations

Breast Lesion Detection Using Weakly Dependent Customized Features and Machine Learning Models with Explainable Artificial Intelligence

Simona Moldovanu et al. J Imaging. .

Abstract

This research proposes a novel strategy for accurate breast lesion classification that combines explainable artificial intelligence (XAI), machine learning (ML) classifiers, and customized weakly dependent features from ultrasound (BU) images. Two new weakly dependent feature classes are proposed to improve the diagnostic accuracy and diversify the training data. These are based on image intensity variations and the area of bounded partitions and provide complementary rather than overlapping information. ML classifiers such as Random Forest (RF), Extreme Gradient Boosting (XGB), Gradient Boosting Classifiers (GBC), and LASSO regression were trained with both customized feature classes. To validate the reliability of our study and the results obtained, we conducted a statistical analysis using the McNemar test. Later, an XAI model was combined with ML to tackle the influence of certain features, the constraints of feature selection, and the interpretability capabilities across various ML models. LIME (Local Interpretable Model-Agnostic Explanations) and SHAP (SHapley Additive exPlanations) models were used in the XAI process to enhance the transparency and interpretation in clinical decision-making. The results revealed common relevant features for the malignant class, consistently identified by all of the classifiers, and for the benign class. However, we observed variations in the feature importance rankings across the different classifiers. Furthermore, our study demonstrates that the correlation between dependent features does not impact explainability.

Keywords: LIME; SHAP; XAI; dependent features; machine learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Bounded histogram features. (a) Raw breast US image from the BUSI dataset; (b) Ground truth of a breast lesion; (c) Region of interest. Selected pixels within bounded repartitions are shown according to their gray-levels distributions. (d1) [0, 31]; (d2) [32, 63]; (d3) [64, 95]; (d4) [96, 127]; (d5) [128, 159]; (d6) [160, 191]; (d7) [192, 223]; (d8) [224, 255].
Figure 2
Figure 2
The overall feature importance in the prediction results over the test dataset. (a) Bounded histogram features (Chi); (b) Grayscale density features (Ci). The most important features are marked in yellow.
Figure 3
Figure 3
LIME output: the importance of individual features in the classification process by their relevance and score, and the features’ selection across various classifiers. (a1) RF and CHi; (a2) RF and Ci; (b1) GBC and CHi; (b2) GBC and Ci; (c1) XGB and CHi; (c2) XGB and Ci. “0” or blue is associated with the malignant class and “1” or orange is for the benign class.
Figure 4
Figure 4
SHAP-integrated ML classifiers’ summary plot on the test data for the malignant and benign output classes. (a1) RF and CHi; (a2) RF and Ci; (b1) GBC and CHi; (b2) GBC and Ci; (c1) XGB and CHi; (c2) XGB and Ci. The horizontal axis plots an SV for a specific feature and data point. The vertical axis ranks the features based on their importance. The values of the features are represented with the following code: lower values are shown in blue, and higher values are shown in red. Points that overlap are shown vertically.

Similar articles

References

    1. World Cancer Research Found International Breast Cancer Statistics. [(accessed on 15 July 2024)]. Available online: https://www.wcrf.org/cancer-trends/
    1. American Cancer Society How Common Is Breast Cancer? [(accessed on 15 July 2024)]. Available online: https://www.cancer.org/cancer/types/breast-cancer.
    1. World Health Organization Breast Cancer. [(accessed on 15 July 2024)]. Available online: https://www.who.int/news-room/fact-sheets/detail/breast-cancer.
    1. Kerlikowske K. Epidemiology of Ductal Carcinoma In Situ. JNCI Monogr. 2010;2010:139–141. doi: 10.1093/jncimonographs/lgq027. - DOI - PMC - PubMed
    1. Mahdavi M., Nassiri M., Kooshyar M.M., Vakili-Azghandi M., Avan A., Sandry R., Pillai S., Lam A.K., Gopalan V. Hereditary Breast Cancer; Genetic Penetrance and Current Status with BRCA. J. Cell. Physiol. 2019;234:5741–5750. doi: 10.1002/jcp.27464. - DOI - PubMed

LinkOut - more resources