Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 25:11:e2780.
doi: 10.7717/peerj-cs.2780. eCollection 2025.

Enhancing colorectal polyp classification using gaze-based attention networks

Affiliations

Enhancing colorectal polyp classification using gaze-based attention networks

Zhenghao Guo et al. PeerJ Comput Sci. .

Abstract

Colorectal polyps are potential precursor lesions of colorectal cancer. Accurate classification of colorectal polyps during endoscopy is crucial for early diagnosis and effective treatment. Automatic and accurate classification of colorectal polyps based on convolutional neural networks (CNNs) during endoscopy is vital for assisting endoscopists in diagnosis and treatment. However, this task remains challenging due to difficulties in the data acquisition and annotation processes, the poor interpretability of the data output, and the lack of widespread acceptance of the CNN models by clinicians. This study proposes an innovative approach that utilizes gaze attention information from endoscopists as an auxiliary supervisory signal to train a CNN-based model for the classification of colorectal polyps. Gaze information from the reading of endoscopic images was first recorded through an eye-tracker. Then, the gaze information was processed and applied to supervise the CNN model's attention via an attention consistency module. Comprehensive experiments were conducted on a dataset that contained three types of colorectal polyps. The results showed that EfficientNet_b1 with supervised gaze information achieved an overall test accuracy of 86.96%, a precision of 87.92%, a recall of 88.41%, an F1 score of 88.16%, the area under the receiver operating characteristic (ROC) curve (AUC) is 0.9022. All evaluation metrics surpassed those of EfficientNet_b1 without gaze information supervision. The class activation maps generated by the proposed network also indicate that the endoscopist's gaze-attention information, as auxiliary prior knowledge, increases the accuracy of colorectal polyp classification, offering a new solution to the field of medical image analysis.

Keywords: Class activation map; Colorectal polyps; Eye-tracking; Gaze attention.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Figure 1
Figure 1. Illustration of different types of colorectal polyps and corresponding gaze attention heatmaps.
(A) Diagram of a hyperplastic or inflammatory polyp; (B) gaze attention heatmaps of a hyperplastic or inflammatory polyp; (C) diagram of an adenomatous polyp, intramucosal carcinoma, and superficial submucosal invasive carcinoma; (D) gaze attention heatmaps of an adenomatous polyp, intramucosal carcinoma, and superficial submucosal invasive carcinoma; (E) diagram of a deep submucosal invasive carcinoma; (F) gaze attention heatmaps of a deep submucosal invasive carcinoma.
Figure 2
Figure 2. Architecture of EfficientNet_b1 backbone.
Figure 3
Figure 3. Architecture of gaze-based attention network for colorectal polyp classification.
Figure 4
Figure 4. Confusion matrices of EfficientNet_b1.
(A) Confusion matrix without gaze attention enabled; (B) confusion matrix with gaze attention enabled.
Figure 5
Figure 5. ROC curves of EfficientNet_b1 and EfficientNet_b1+Gaze.
Figure 6
Figure 6. Demonstration of the effects of enabling or disabling gaze attention.
The first category shows pictures of the test; the second column shows the effect of using CAM; and the third column shows the effect of adding gaze attention.

References

    1. Bhattacharya M, Jain S, Prasanna P. GazeRadar: a gaze and radiomics-guided disease localization framework. Conference on Medical Image Computing and Computer Assisted Intervention; 2022. pp. 686–696.
    1. Bisogni C, Nappi M, Tortora G, Del Bimbo A. Gaze analysis: a survey on its applications. Image and Vision Computing. 2024;144(2024):104961. doi: 10.1016/j.imavis.2024.104961. - DOI
    1. Chen PJ, Lin MC, Lai MJ, Lin JC, Tseng VS. Accurate classification of diminutive colorectal polyps using computer-aided analysis. Gastroenterology. 2018;154(3):568–575. doi: 10.1053/j.gastro.2017.10.010. - DOI - PubMed
    1. Hewett DG, Kaltenbach T, Sano Y, Tanaka S, Saunders BP, Rex DK. Validation of a simple classification system for endoscopic diagnosis of small colorectal polyps using narrow-band imaging. Gastroenterology. 2012;143(3):599–607.e1. doi: 10.1053/j.gastro.2012.05.006. - DOI - PubMed
    1. Hossain MS, Rahman MM, Uddin MF, Hasan M, Hossain MA, Samad MA. Deeppoly: deep learning based polyps segmentation and classification for autonomous colonoscopy examination. IEEE Access. 2023;11:95889–95902. doi: 10.1109/ACCESS.2023.3310541. - DOI

LinkOut - more resources