Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 10:57:110927.
doi: 10.1016/j.dib.2024.110927. eCollection 2024 Dec.

Pixel-wise annotation for clear and contaminated regions segmentation in wireless capsule endoscopy images: A multicentre database

Affiliations

Pixel-wise annotation for clear and contaminated regions segmentation in wireless capsule endoscopy images: A multicentre database

Vahid Sadeghi et al. Data Brief. .

Abstract

Wireless capsule endoscopy (WCE) is capable of non-invasively visualizing the small intestine, the most complicated segment of the gastrointestinal tract, to detect different types of abnormalities. However, its main drawback is reviewing the vast number of captured images (more than 50,000 frames). The recorded images are only sometimes clear, and different contaminating agents, such as turbid materials and air bubbles, degrade the visualization quality of the WCE images. This condition could cause serious problems such as reducing mucosal view visualization, prolonging recorded video reviewing time, and increasing the risks of missing pathology. On the other hand, accurately quantifying the amount of turbid fluids and bubbles can indicate potential motility malfunction. To assist in developing computer vision-based techniques, we have constructed the first multicentre publicly available clear and contaminated annotated dataset by precisely segmenting 17,593 capsule endoscopy images from three different databases. In contrast to the existing datasets, our dataset has been annotated at the pixel level, discriminating the clear and contaminated regions and subsequently differentiating bubbles and turbid fluids from normal tissue. To create the dataset, we first selected all of the images (2906 frames) in the reduced mucosal view class covering different levels of contamination and randomly selected 12,237 images from the normal class of the copyright-free CC BY 4.0 licensed small bowel capsule endoscopy (SBCE) images from the Kvasir capsule endoscopy database. To mitigate the possible available bias in the mentioned dataset and to increase the sample size, the number of 2077 and 373 images have been stochastically chosen from the SEE-AI project and CECleanliness datasets respectively for the subsequent annotation. Randomly selected images have been annotated with the aid of ImageJ and ITK-SNAP software under the supervision of an expert SBCE reader with extensive experience in gastroenterology and endoscopy. For each image, two binary and tri-colour ground truth (GT) masks have been created in which each pixel has been indexed into two classes (clear and contaminated) and three classes (bubble, turbid fluids, and normal), respectively. To the best of the author's knowledge, there is no implemented clear and contaminated region segmentation on the capsule endoscopy reading software. Curated multicentre dataset can be utilized to implement applicable segmentation algorithms for identification of clear and contaminated regions and discrimination bubbles, as well as turbid fluids from normal tissue in the small intestine. Since the annotated images belong to three different sources, they provide a diverse representation of the clear and contaminated patterns in the WCE images. This diversity is valuable for training the models that are more robust to variations in data characteristics and can generalize well across different subjects and settings. The inclusion of images from three different centres allows for robust cross-validation opportunities, where computer vision-based models can be trained on one centre's annotated images and evaluated on others.

Keywords: Bubble; Ground truth masks; Small bowel capsule endoscopy; Small bowel visualization quality; Turbid fluids.

PubMed Disclaimer

Figures

Fig 1:
Fig. 1
Overall flowchart of the dataset construction.
Fig 2:
Fig. 2
Folder structure of the current WCE dataset.
Fig 3:
Fig. 3
Some samples of dataset frames with their corresponding manually segmented masks. First to third rows: original images, manually annotated binary masks, and tri-variate-created GT masks.
Fig 4:
Fig. 4
Some raw images from the three different databases along with their corresponding manually generated binary mask.
Fig 5:
Fig. 5
Bubble-contaminated regions annotation by utilizing the ITK-SNAP software.
Fig 6:
Fig. 6
Schematic of contaminated region segmentation with the aid of ImageJ software.
Fig 7:
Fig. 7
Block diagram of creation tri-colour GT mask.

References

    1. Schneider C.A., Rasband W.S., Eliceiri K.W. NIH Image to ImageJ: 25 years of image analysis. Nat. Methods. 2012;9(7):671–675. doi: 10.1038/nmeth.2089. - DOI - PMC - PubMed
    1. Yushkevich P.A., et al. User-guided 3D active contour segmentation of anatomical structures: significantly improved efficiency and reliability. Neuroimage. 2006;31(3):1116–1128. doi: 10.1016/j.neuroimage.2006.01.015. - DOI - PubMed
    1. Smedsrud P.H., et al. Kvasir-Capsule, a video capsule endoscopy dataset. Sci. Data. 2021;8(1):1–10. doi: 10.1038/s41597-021-00920-z. - DOI - PMC - PubMed
    1. Yokote A., et al. Small bowel capsule endoscopy examination and open access database with artificial intelligence: the SEE-artificial intelligence project. DEN Open. 2024;4(1):1–10. doi: 10.1002/deo2.258. - DOI - PMC - PubMed
    1. Noorda R., Nevárez A., Colomer A., Beltrán V.Pons, Naranjo V. Automatic evaluation of degree of cleanliness in capsule endoscopy based on a novel CNN architecture. Sci. Rep. 2020;10(1):1–13. doi: 10.1038/s41598-020-74668-8. - DOI - PMC - PubMed

LinkOut - more resources