Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 3;15(1):19490.
doi: 10.1038/s41598-025-04344-2.

Enhanced nuclear information fusion and visual transformer for pathological breast cancer image classification

Affiliations

Enhanced nuclear information fusion and visual transformer for pathological breast cancer image classification

Qinyi Zhang et al. Sci Rep. .

Abstract

Breast cancer poses a significant threat to women's health. Early diagnosis using pathological images is crucial for effective treatment planning. However, the low resolution of pathological images poses significant challenges for the extraction of valid information, while their high complexity greatly increases the difficulty of image analysis. To address these challenges, this paper introduces an innovative classification method for breast cancer histopathological images, combining enhanced nuclear information with an Enhanced Vision Transformer (EVT) model using wavelet position embedding. The quintessence of the proposed method resides in its capacity to efficiently extract both biological and foundational image features from pathological images. This is accomplished by initially enhancing nuclear information through the application of segmentation models and sophisticated image processing techniques. Subsequently, wavelet positional embedding within the EVT model is leveraged to precisely capture key information embedded within the images. Experimental outcomes have demonstrated that our method attains an accuracy rate of 94.61% and an AUC value of 99.07% on the BreaKHis dataset, significantly outperforming other baseline network models in terms of classification efficacy. Furthermore, through visual representation, this study underscores the significance of nuclear information enhancement and wavelet position transformation in the EVT model, thereby further confirming the effectiveness and effectiveness of the method we proposed.

Keywords: Classification; Enhanced nuclear information fusion; Pathological breast cancer image; Segmentation; Visual transformer.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Shows the comparison of details, with raw data on the left and enhanced data on the right. (a) Differences before and after image enhancement. (b) The difference of channel 3 after normalization before and after image enhancement.
Fig. 2
Fig. 2
The overall flow of the method. The framework consists of three main stages: (1) Nuclear information acquisition, where the BreakHis dataset is processed by a segmentation model (2) Enhanced dataset construction, where the segmented nuclei are refined via scale-adjusted fusion and (3) Classification task implementation, where the enhanced data is.
Fig. 3
Fig. 3
Schematic diagram of nuclear fusion method. The pipeline processes histopathology images through nucleus segmentation, canny edge detection and edge strengthening.
Fig. 4
Fig. 4
EVT structure. The network processes input through tokenization blocks (convolution/ReLU/pooling), parallel branches (transpose/FC/attention), and a classifier with wavelet position embedding, ultimately outputting binary classification results.
Fig. 5
Fig. 5
Comparison of original data, nucleus segmentation results, and enhanced data for benign and malignant cases.
Fig. 6
Fig. 6
Confusion matrices comparing test data performance of EVT models with nuclear information enhancement and wavelet position transform.
Fig. 7
Fig. 7
ROC curves and AUC values illustrating the classification performance of various EVT model combinations on test data.
Fig. 8
Fig. 8
The color map is displayed after normalizing the tensor before the last flattener operation in the tokenizer.
Fig. 9
Fig. 9
Network model visualization. The a, c, e, g and i is the model.tokenizer.conv_layers[0][0] layer, the first convolutional layer in the network. The b, d, f, h and j is the Model.Tokenizer.conv_layers[2][2] layer, which is the last maximum pooling layer of the tokenizer module.

References

    1. Sung, H., Ferlay, J. & Siegel, R. L., et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA71(3), 209–249 (2021). - PubMed
    1. Gomes, D. S. et al. Inter-observer variability between general pathologists and a specialist in breast pathology in the diagnosis of lobular neoplasia, columnar cell lesions, atypical ductal hyperplasia and ductal carcinoma in situ of the breast. Diagn. Pathol.9, 1–9 (2014). - PMC - PubMed
    1. Kaul, V., Enslin, S. & Gross, S. A. History of artificial intelligence in medicine. Gastrointest. Endosc.92(4), 807–812 (2020). - PubMed
    1. LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature521(7553), 436–444 (2015). - PubMed
    1. Ibrahim, A. et al. Artificial intelligence in digital breast pathology: Techniques and applications. The Breast49, 267–273 (2020). - PMC - PubMed

MeSH terms

LinkOut - more resources