Pixel-wise annotation for clear and contaminated regions segmentation in wireless capsule endoscopy images: A multicentre database

Vahid Sadeghi^{1

2

3}, Yasaman Sanahmadi^{1

2

3}, Maryam Behdad⁴, Alireza Vard^{2

3}, Mohsen Sharifi⁵, Ahmad Raeisi⁶, Mehdi Nikkhah⁷, Alireza Mehridehnavi^{2

3}

Affiliations

¹ Student Research Committee, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan, Iran.
² Medical Image & Signal Processing Research Center, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan, Iran.
³ Department of Bioelectrics and Biomedical Engineering, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan, Iran.
⁴ Department of Electrical Engineering, Yazd University, Yazd, Iran.
⁵ Gastroenterologist & Hepatologist Fellowship of Endosonography Isfahan University of Medical Sciences, Isfahan, Iran.
⁶ Department of Internal Medicine, Clinical Research Development Unit, Hajar Hospital, Shahrekord University of Medical Sciences, Shahrekord, Iran.
⁷ Gastrointestinal and Liver Diseases Research Centre, Iran University of Medical Sciences, Tehran, Iran.

PMID: 39351133
PMCID: PMC11440793
DOI: 10.1016/j.dib.2024.110927

Pixel-wise annotation for clear and contaminated regions segmentation in wireless capsule endoscopy images: A multicentre database

Vahid Sadeghi et al. Data Brief. 2024.

. 2024 Sep 10:57:110927.

doi: 10.1016/j.dib.2024.110927. eCollection 2024 Dec.

Authors

Vahid Sadeghi^{1

2

3}, Yasaman Sanahmadi^{1

2

3}, Maryam Behdad⁴, Alireza Vard^{2

3}, Mohsen Sharifi⁵, Ahmad Raeisi⁶, Mehdi Nikkhah⁷, Alireza Mehridehnavi^{2

3}

Affiliations

¹ Student Research Committee, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan, Iran.
² Medical Image & Signal Processing Research Center, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan, Iran.
³ Department of Bioelectrics and Biomedical Engineering, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan, Iran.
⁴ Department of Electrical Engineering, Yazd University, Yazd, Iran.
⁵ Gastroenterologist & Hepatologist Fellowship of Endosonography Isfahan University of Medical Sciences, Isfahan, Iran.
⁶ Department of Internal Medicine, Clinical Research Development Unit, Hajar Hospital, Shahrekord University of Medical Sciences, Shahrekord, Iran.
⁷ Gastrointestinal and Liver Diseases Research Centre, Iran University of Medical Sciences, Tehran, Iran.

PMID: 39351133
PMCID: PMC11440793
DOI: 10.1016/j.dib.2024.110927

Abstract

Wireless capsule endoscopy (WCE) is capable of non-invasively visualizing the small intestine, the most complicated segment of the gastrointestinal tract, to detect different types of abnormalities. However, its main drawback is reviewing the vast number of captured images (more than 50,000 frames). The recorded images are only sometimes clear, and different contaminating agents, such as turbid materials and air bubbles, degrade the visualization quality of the WCE images. This condition could cause serious problems such as reducing mucosal view visualization, prolonging recorded video reviewing time, and increasing the risks of missing pathology. On the other hand, accurately quantifying the amount of turbid fluids and bubbles can indicate potential motility malfunction. To assist in developing computer vision-based techniques, we have constructed the first multicentre publicly available clear and contaminated annotated dataset by precisely segmenting 17,593 capsule endoscopy images from three different databases. In contrast to the existing datasets, our dataset has been annotated at the pixel level, discriminating the clear and contaminated regions and subsequently differentiating bubbles and turbid fluids from normal tissue. To create the dataset, we first selected all of the images (2906 frames) in the reduced mucosal view class covering different levels of contamination and randomly selected 12,237 images from the normal class of the copyright-free CC BY 4.0 licensed small bowel capsule endoscopy (SBCE) images from the Kvasir capsule endoscopy database. To mitigate the possible available bias in the mentioned dataset and to increase the sample size, the number of 2077 and 373 images have been stochastically chosen from the SEE-AI project and CECleanliness datasets respectively for the subsequent annotation. Randomly selected images have been annotated with the aid of ImageJ and ITK-SNAP software under the supervision of an expert SBCE reader with extensive experience in gastroenterology and endoscopy. For each image, two binary and tri-colour ground truth (GT) masks have been created in which each pixel has been indexed into two classes (clear and contaminated) and three classes (bubble, turbid fluids, and normal), respectively. To the best of the author's knowledge, there is no implemented clear and contaminated region segmentation on the capsule endoscopy reading software. Curated multicentre dataset can be utilized to implement applicable segmentation algorithms for identification of clear and contaminated regions and discrimination bubbles, as well as turbid fluids from normal tissue in the small intestine. Since the annotated images belong to three different sources, they provide a diverse representation of the clear and contaminated patterns in the WCE images. This diversity is valuable for training the models that are more robust to variations in data characteristics and can generalize well across different subjects and settings. The inclusion of images from three different centres allows for robust cross-validation opportunities, where computer vision-based models can be trained on one centre's annotated images and evaluated on others.

Keywords: Bubble; Ground truth masks; Small bowel capsule endoscopy; Small bowel visualization quality; Turbid fluids.

PubMed Disclaimer

Figures

Fig 1: — **Fig. 1**
Overall flowchart of the dataset construction.

Fig 2: — **Fig. 2**
Folder structure of the current WCE dataset.

Fig 3: — **Fig. 3**
Some samples of dataset frames with their corresponding manually segmented masks. First to third rows: original images, manually annotated binary masks, and tri-variate-created GT masks.

Fig 4: — **Fig. 4**
Some raw images from the three different databases along with their corresponding manually generated binary mask.

Fig 5: — **Fig. 5**
Bubble-contaminated regions annotation by utilizing the ITK-SNAP software.

Fig 6: — **Fig. 6**
Schematic of contaminated region segmentation with the aid of ImageJ software.

Fig 7: — **Fig. 7**
Block diagram of creation tri-colour GT mask.

See this image and copyright information in PMC

References

1. Schneider C.A., Rasband W.S., Eliceiri K.W. NIH Image to ImageJ: 25 years of image analysis. Nat. Methods. 2012;9(7):671–675. doi: 10.1038/nmeth.2089. - DOI - PMC - PubMed
1. Yushkevich P.A., et al. User-guided 3D active contour segmentation of anatomical structures: significantly improved efficiency and reliability. Neuroimage. 2006;31(3):1116–1128. doi: 10.1016/j.neuroimage.2006.01.015. - DOI - PubMed
1. Smedsrud P.H., et al. Kvasir-Capsule, a video capsule endoscopy dataset. Sci. Data. 2021;8(1):1–10. doi: 10.1038/s41597-021-00920-z. - DOI - PMC - PubMed
1. Yokote A., et al. Small bowel capsule endoscopy examination and open access database with artificial intelligence: the SEE-artificial intelligence project. DEN Open. 2024;4(1):1–10. doi: 10.1002/deo2.258. - DOI - PMC - PubMed
1. Noorda R., Nevárez A., Colomer A., Beltrán V.Pons, Naranjo V. Automatic evaluation of degree of cleanliness in capsule endoscopy based on a novel CNN architecture. Sci. Rep. 2020;10(1):1–13. doi: 10.1038/s41598-020-74668-8. - DOI - PMC - PubMed

LinkOut - more resources

Full Text Sources
- Elsevier Science
- PubMed Central
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Pixel-wise annotation for clear and contaminated regions segmentation in wireless capsule endoscopy images: A multicentre database

Affiliations

Pixel-wise annotation for clear and contaminated regions segmentation in wireless capsule endoscopy images: A multicentre database

Authors

Affiliations

Abstract

Figures

References

LinkOut - more resources

Full Text Sources

Miscellaneous