Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec 30;13(1):123.
doi: 10.3390/diagnostics13010123.

GAR-Net: Guided Attention Residual Network for Polyp Segmentation from Colonoscopy Video Frames

Affiliations

GAR-Net: Guided Attention Residual Network for Polyp Segmentation from Colonoscopy Video Frames

Joel Raymann et al. Diagnostics (Basel). .

Erratum in

Abstract

Colorectal Cancer is one of the most common cancers found in human beings, and polyps are the predecessor of this cancer. Accurate Computer-Aided polyp detection and segmentation system can help endoscopists to detect abnormal tissues and polyps during colonoscopy examination, thereby reducing the chance of polyps growing into cancer. Many of the existing techniques fail to delineate the polyps accurately and produce a noisy/broken output map if the shape and size of the polyp are irregular or small. We propose an end-to-end pixel-wise polyp segmentation model named Guided Attention Residual Network (GAR-Net) by combining the power of both residual blocks and attention mechanisms to obtain a refined continuous segmentation map. An enhanced Residual Block is proposed that suppresses the noise and captures low-level feature maps, thereby facilitating information flow for a more accurate semantic segmentation. We propose a special learning technique with a novel attention mechanism called Guided Attention Learning that can capture the refined attention maps both in earlier and deeper layers regardless of the size and shape of the polyp. To study the effectiveness of the proposed GAR-Net, various experiments were carried out on two benchmark collections viz., CVC-ClinicDB (CVC-612) and Kvasir-SEG dataset. From the experimental evaluations, it is shown that GAR-Net outperforms other previously proposed models such as FCN8, SegNet, U-Net, U-Net with Gated Attention, ResUNet, and DeepLabv3. Our proposed model achieves 91% Dice co-efficient and 83.12% mean Intersection over Union (mIoU) on the benchmark CVC-ClinicDB (CVC-612) dataset and 89.15% dice co-efficient and 81.58% mean Intersection over Union (mIoU) on the Kvasir-SEG dataset. The proposed GAR-Net model provides a robust solution for polyp segmentation from colonoscopy video frames.

Keywords: attention mechanism; colonoscopy; deep learning; healthcare informatics; medical image analysis; polyp segmentation; semantic segmentation.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The overview of the Residual Block used.
Figure 2
Figure 2
The overview of the Modified Residual Block.
Figure 3
Figure 3
The overview of the Guided Attention Module with Guided Attention Learning.
Figure 4
Figure 4
The overview of the proposed GAR-Net Architecture.
Figure 5
Figure 5
Training and Validation results on the CVC-ClinicDB dataset.
Figure 6
Figure 6
The Training and Validation Graphs for the Kvasir-SEG Dataset training.
Figure 7
Figure 7
The output comparison for all the models on the CVC-ClinicDB Dataset.
Figure 8
Figure 8
The output comparison for all the models on the Kvasir-SEG Dataset.
Figure 9
Figure 9
The attention visualization comparison: Attention Gated Network vs. Guided Attention Module with a test sample from CVC-ClinicDB dataset. Even in deeper layers, GAM is still able to capture refined attention maps, while Attention Gated Network only outputs coarse and broken attention maps.
Figure 10
Figure 10
The attention visualization comparison compares Attention Gated Network vs. Guided Attention Module with a test sample from the Kvasir-SEG dataset. Notice even in deeper layers, GAM is still able to capture refined attention maps, while Attention Gated Network-only outputs coarse and broken attention maps.

References

    1. Lawton C. Colonoscopic Polypectomy and Long-Term Prevention of Colorectal-Cancer Deaths. Yearb. Oncol. 2013;2013:128–129. doi: 10.1016/j.yonc.2013.04.004. - DOI
    1. van Rijn J.C., Reitsma J.B., Stoker J., Bossuyt P.M., van Deventer S.J., Dekker E. Polyp Miss Rate Determined by Tandem Colonoscopy: A Systematic Review. Am. J. Gastroenterol. 2006;101:343–350. doi: 10.1111/j.1572-0241.2006.00390.x. - DOI - PubMed
    1. Breier M., Gross S., Behrens A., Stehle T., Aach T. Active contours for localizing polyps in colonoscopic NBI image data. Med. Imaging 2011 Comput.-Aided Diagn. 2011;7963:79632M. doi: 10.1117/12.877986. - DOI
    1. Bernal J., Sánchez J., Vilariño F. Towards automatic polyp detection with a polyp appearance model. Pattern Recognit. 2012;45:3166–3182. doi: 10.1016/j.patcog.2012.03.002. - DOI
    1. Manjunath K.N., Prabhu G.K., Siddalingaswamy P.C. A quantitative validation of segmented colon in virtual colonoscopy using image moments. Biomed. J. 2020;43:74–82. doi: 10.1016/j.bj.2019.07.006. - DOI - PMC - PubMed

LinkOut - more resources