Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 20;12(1):828.
doi: 10.1038/s41597-025-05112-7.

Galar - a large multi-label video capsule endoscopy dataset

Affiliations

Galar - a large multi-label video capsule endoscopy dataset

Maxime Le Floch et al. Sci Data. .

Abstract

Video capsule endoscopy (VCE) is an important technology with many advantages (non-invasive, representation of small bowel), but faces many limitations as well (time-consuming analysis, short battery lifetime, and poor image quality). Artificial intelligence (AI) holds potential to address every one of these challenges, however the progression of machine learning methods is limited by the avaibility of extensive data. We propose Galar, the most comprehensive dataset of VCE to date. Galar consists of 80 videos, culminating in 3,513,539 annotated frames covering functional, anatomical, and pathological aspects and introducing a selection of 29 distinct labels. The multisystem and multicenter VCE data from two centers in Saxony (Germany), was annotated framewise and cross-validated by five annotators. The vast scope of annotation and size of Galar make the dataset a valuable resource for the use of AI models in VCE, thereby facilitating research in diagnostic methods, patient care workflow, and the development of predictive analytics in the field.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Example images of the 26 labels in the dataset. The figure does not contain images of the labels esophagitis, varices and celiac, as there were no instances of these pathologies present in the set of VCE studies.
Fig. 2
Fig. 2
Overall frames per label count of the Galar dataset. Image occurrences per labels are displayed across the three main groups (technical, sections and anatomical). The y-axis is scaled logarithmically. Legend: Orange - Anatomical Green - Pathologies Red - Technical.
Fig. 3
Fig. 3
The file structure of the Galar dataset. Frames are stored chronologically in subfolders of the Frames folder. Labels are stored in a single CSV file, per study. The metadata file further contains data on a per study basis.

Similar articles

References

    1. Ahmed, M. et al. Video Capsule Endoscopy in Gastroenterology. Gastroenterology Research15, 47–55, 10.14740/gr1487 (2022). - PMC - PubMed
    1. Kwack, W. G. et al. Current Status and Research into Overcoming Limitations of Capsule Endoscopy. Clinical endoscopy49, 8–15, 10.5946/ce.2016.49.1.8 (2016). - PMC - PubMed
    1. Liao, Z. et al. Indications and detection, completion, and retention rates of small-bowel capsule endoscopy: a systematic review. Gastrointestinal Endoscopy71, 280–286, 10.1016/j.gie.2009.09.031 (2010). - PubMed
    1. Goenka, M. K. et al. Capsule endoscopy: Present status and future expectation. World Journal of Gastroenterology20, 10024–10037, 10.3748/wjg.v20.i29.10024 (2014). - PMC - PubMed
    1. Iddan, G. et al. Wireless capsule endoscopy. Nature405, 417–417, 10.1038/35013140 (2000). - PubMed

LinkOut - more resources