Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul;19(7):1259-1266.
doi: 10.1007/s11548-024-03171-6. Epub 2024 May 22.

OneSLAM to map them all: a generalized approach to SLAM for monocular endoscopic imaging based on tracking any point

Affiliations

OneSLAM to map them all: a generalized approach to SLAM for monocular endoscopic imaging based on tracking any point

Timo Teufel et al. Int J Comput Assist Radiol Surg. 2024 Jul.

Abstract

Purpose: Monocular SLAM algorithms are the key enabling technology for image-based surgical navigation systems for endoscopic procedures. Due to the visual feature scarcity and unique lighting conditions encountered in endoscopy, classical SLAM approaches perform inconsistently. Many of the recent approaches to endoscopic SLAM rely on deep learning models. They show promising results when optimized on singular domains such as arthroscopy, sinus endoscopy, colonoscopy or laparoscopy, but are limited by an inability to generalize to different domains without retraining.

Methods: To address this generality issue, we propose OneSLAM a monocular SLAM algorithm for surgical endoscopy that works out of the box for several endoscopic domains, including sinus endoscopy, colonoscopy, arthroscopy and laparoscopy. Our pipeline builds upon robust tracking any point (TAP) foundation models to reliably track sparse correspondences across multiple frames and runs local bundle adjustment to jointly optimize camera poses and a sparse 3D reconstruction of the anatomy.

Results: We compare the performance of our method against three strong baselines previously proposed for monocular SLAM in endoscopy and general scenes. OneSLAM presents better or comparable performance over existing approaches targeted to that specific data in all four tested domains, generalizing across domains without the need for retraining.

Conclusion: OneSLAM benefits from the convincing performance of TAP foundation models but generalizes to endoscopic sequences of different anatomies all while demonstrating better or comparable performance over domain-specific SLAM approaches. Future research on global loop closure will investigate how to reliably detect loops in endoscopic scenes to reduce accumulated drift and enhance long-term navigation capabilities.

Keywords: 3D motion estimation; Arthroscopy; Computer vision; Endoscopy; Image-based navigation; Monocular SLAM; Tracking any point.

PubMed Disclaimer

References

    1. De Groen PC (2017) History of the endoscope [scanning our past]. Proc IEEE 105(10):1987–1995 - DOI
    1. Litwin DE, Cahan MA (2008) Laparoscopic cholecystectomy. Surg Clin North Am 88(6):1295–1313 - DOI - PubMed
    1. Winawer SJ, Zauber AG, Ho MN, O’brien MJ, Gottlieb LS, Sternberg SS, Waye JD, Schapiro M, Bond JH, Panish JF et al (1993) Prevention of colorectal cancer by colonoscopic polypectomy. New Engl J Med 329(27):1977–1981 - DOI - PubMed
    1. Burman M, Finkelstein H, Mayer L (1934) Arthroscopy of the knee joint. JBJS 16(2):255–268
    1. Kennedy DW (1985) Functional endoscopic sinus surgery: technique. Arch Otolaryngol 111(10):643–649 - DOI - PubMed

MeSH terms

LinkOut - more resources