Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Oct 3;10(1):671.
doi: 10.1038/s41597-023-02564-7.

Endomapper dataset of complete calibrated endoscopy procedures

Affiliations

Endomapper dataset of complete calibrated endoscopy procedures

Pablo Azagra et al. Sci Data. .

Abstract

Computer-assisted systems are becoming broadly used in medicine. In endoscopy, most research focuses on the automatic detection of polyps or other pathologies, but localization and navigation of the endoscope are completely performed manually by physicians. To broaden this research and bring spatial Artificial Intelligence to endoscopies, data from complete procedures is needed. This paper introduces the Endomapper dataset, the first collection of complete endoscopy sequences acquired during regular medical practice, making secondary use of medical data. Its main purpose is to facilitate the development and evaluation of Visual Simultaneous Localization and Mapping (VSLAM) methods in real endoscopy data. The dataset contains more than 24 hours of video. It is the first endoscopic dataset that includes endoscope calibration as well as the original calibration videos. Meta-data and annotations associated with the dataset vary from the anatomical landmarks, procedure labeling, segmentations, reconstructions, simulated sequences with ground truth and same patient procedures. The software used in this paper is publicly available.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Overview of the Endomapper Dataset.
Fig. 2
Fig. 2
Two examples of calibration images (left, middle). The calibration pattern (right).
Fig. 3
Fig. 3
Example of photometric calibration results.
Fig. 4
Fig. 4
Illustration of the anatomical regions labeled.
Fig. 5
Fig. 5
Directory structure of the dataset.
Fig. 6
Fig. 6
Examples for the tool segmentation mask in Seq_009.
Fig. 7
Fig. 7
Two clusters from the COLMAP reconstruction after processing Seq_001. For each cluster, it is shown a 3D view of the frames’ poses and colon map points and five RGB images as summary of the cluster frames.
Fig. 8
Fig. 8
Comparison of how a regular pixel grid is undistorted by the calibration of each endoscope. Colonoscopes and gastroscopes are separated for easier visualization.
Fig. 9
Fig. 9
Distributions of error in the images in prototype calibrations. The line representing the error is not magnified, observe that most of them appear as points as errors are mostly smaller than one pixel.
Fig. 10
Fig. 10
Relation between the incoming ray angle θ with the distorted radius rd. The dotted curves represent the ideal orthogonal and equisolid projection models. The right image is a zoom of the curves to show the small differences between the colonoscope and the gastroscope.
Fig. 11
Fig. 11
View angles plotted on top of calibration images from each prototype endoscope. The iso-lines are plotted in 20° intervals.
Fig. 12
Fig. 12
Examples of successful retrieval from different sequences of the same patient. The left column contains the queries from the current sequence (Seq_035) while the rest of the columns are the first three retrieved images from the previous sequence (Seq_027).
Fig. 13
Fig. 13
ORB-SLAM3 sub-map in Seq_015 between frames 54420 and 55170. The camera undergoes a forward-backwards motion. Right, 3D map in top view, keyframes in blue, map points in red. Left images corresponding to 4 keyframes spread over the trajectory.
Fig. 14
Fig. 14
Binary Segmentation examples from Endomapper dataset using different approaches fine-tuned on Endomapper dataset: (a) Original Image (b) Ground-truth manual segmentation (c) MiniNet (d) UNet (e) LinkNet (f) MF-TAPNet.

References

    1. Campos C, Elvira R, Rodrguez JJG, Montiel JMM, Tardós JD. ORB-SLAM3: An accurate open-source library for visual, visual–inertial, and multimap SLAM. IEEE Transactions on Robotics. 2021;37:1874–1890. doi: 10.1109/TRO.2021.3075644. - DOI
    1. Engel J, Koltun V, Cremers D. Direct sparse odometry. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2017;40:611–625. doi: 10.1109/TPAMI.2017.2658577. - DOI - PubMed
    1. Azagra P, 2022. Endomapper dataset of complete calibrated endoscopy procedures. Synapse. - DOI - PMC - PubMed
    1. Maier-Hein L, et al. Heidelberg colorectal data set for surgical data science in the sensor operating room. Scientific Data. 2021;8:1–11. doi: 10.1038/s41597-021-00882-2. - DOI - PMC - PubMed
    1. Rau, A., Bhattarai, B., Agapito, L. & Stoyanov, D. Bimodal camera pose prediction for endoscopy. Preprint at https://arxiv.org/abs/2204.04968 (2022).

LinkOut - more resources