. 2023 Feb:84:102709.

doi: 10.1016/j.media.2022.102709. Epub 2022 Dec 14.

Robust endoscopic image mosaicking via fusion of multimodal estimation

Liang Li¹, Evangelos Mazomenos², James H Chandler³, Keith L Obstein⁴, Pietro Valdastri⁵, Danail Stoyanov⁶, Francisco Vasconcelos⁷

Affiliations

¹ Wellcome/EPSRC Centre for Interventional and Surgical Sciences(WEISS) and Department of Computer Science, University College London, London, UK; College of Control Science and Engineering, Zhejiang University, Hangzhou, 310027, China. Electronic address: liang.li@zju.edu.cn.
² Wellcome/EPSRC Centre for Interventional and Surgical Sciences(WEISS) and Department of Computer Science, University College London, London, UK. Electronic address: e.mazomenos@ucl.ac.uk.
³ Storm Lab UK, School of Electronic, and Electrical Engineering, University of Leeds, Leeds LS2 9JT, UK. Electronic address: J.H.Chandler@leeds.ac.uk.
⁴ Division of Gastroenterology, Hepatology, and Nutrition, Vanderbilt University Medical Center, Nashville, TN 37232, USA; STORM Lab, Department of Mechanical Engineering, Vanderbilt University, Nashville, TN 37235, USA. Electronic address: keith.obstein@vanderbilt.edu.
⁵ Storm Lab UK, School of Electronic, and Electrical Engineering, University of Leeds, Leeds LS2 9JT, UK. Electronic address: p.valdastri@leeds.ac.uk.
⁶ Wellcome/EPSRC Centre for Interventional and Surgical Sciences(WEISS) and Department of Computer Science, University College London, London, UK. Electronic address: danail.stoyanov@ucl.ac.uk.
⁷ Wellcome/EPSRC Centre for Interventional and Surgical Sciences(WEISS) and Department of Computer Science, University College London, London, UK. Electronic address: f.vasconcelos@ucl.ac.uk.

PMID: 36549045
PMCID: PMC10636739
DOI: 10.1016/j.media.2022.102709

Robust endoscopic image mosaicking via fusion of multimodal estimation

Liang Li et al. Med Image Anal. 2023 Feb.

. 2023 Feb:84:102709.

doi: 10.1016/j.media.2022.102709. Epub 2022 Dec 14.

Authors

Liang Li¹, Evangelos Mazomenos², James H Chandler³, Keith L Obstein⁴, Pietro Valdastri⁵, Danail Stoyanov⁶, Francisco Vasconcelos⁷

Affiliations

¹ Wellcome/EPSRC Centre for Interventional and Surgical Sciences(WEISS) and Department of Computer Science, University College London, London, UK; College of Control Science and Engineering, Zhejiang University, Hangzhou, 310027, China. Electronic address: liang.li@zju.edu.cn.
² Wellcome/EPSRC Centre for Interventional and Surgical Sciences(WEISS) and Department of Computer Science, University College London, London, UK. Electronic address: e.mazomenos@ucl.ac.uk.
³ Storm Lab UK, School of Electronic, and Electrical Engineering, University of Leeds, Leeds LS2 9JT, UK. Electronic address: J.H.Chandler@leeds.ac.uk.
⁴ Division of Gastroenterology, Hepatology, and Nutrition, Vanderbilt University Medical Center, Nashville, TN 37232, USA; STORM Lab, Department of Mechanical Engineering, Vanderbilt University, Nashville, TN 37235, USA. Electronic address: keith.obstein@vanderbilt.edu.
⁵ Storm Lab UK, School of Electronic, and Electrical Engineering, University of Leeds, Leeds LS2 9JT, UK. Electronic address: p.valdastri@leeds.ac.uk.
⁶ Wellcome/EPSRC Centre for Interventional and Surgical Sciences(WEISS) and Department of Computer Science, University College London, London, UK. Electronic address: danail.stoyanov@ucl.ac.uk.
⁷ Wellcome/EPSRC Centre for Interventional and Surgical Sciences(WEISS) and Department of Computer Science, University College London, London, UK. Electronic address: f.vasconcelos@ucl.ac.uk.

PMID: 36549045
PMCID: PMC10636739
DOI: 10.1016/j.media.2022.102709

Abstract

We propose an endoscopic image mosaicking algorithm that is robust to light conditioning changes, specular reflections, and feature-less scenes. These conditions are especially common in minimally invasive surgery where the light source moves with the camera to dynamically illuminate close range scenes. This makes it difficult for a single image registration method to robustly track camera motion and then generate consistent mosaics of the expanded surgical scene across different and heterogeneous environments. Instead of relying on one specialised feature extractor or image registration method, we propose to fuse different image registration algorithms according to their uncertainties, formulating the problem as affine pose graph optimisation. This allows to combine landmarks, dense intensity registration, and learning-based approaches in a single framework. To demonstrate our application we consider deep learning-based optical flow, hand-crafted features, and intensity-based registration, however, the framework is general and could take as input other sources of motion estimation, including other sensor modalities. We validate the performance of our approach on three datasets with very different characteristics to highlighting its generalisability, demonstrating the advantages of our proposed fusion framework. While each individual registration algorithm eventually fails drastically on certain surgical scenes, the fusion approach flexibly determines which algorithms to use and in which proportion to more robustly obtain consistent mosaics.

Keywords: Endoscopic image mosaicking; Image mosaicking; Medical image processing; Optical flow; Pose graph optimisation.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

**Fig. 1**
The diagram of the proposed method. There are three component homography estimation algorithms, *i.e.*, SIFT-based, direct registration-based, and the optical flow-based. The pose graph is constructed based on the three estimation sources with their own uncertainties respectively. The optimal state is obtained by optimising the cost function in the affine Lie group. Finally, the panorama can be generated with the optimal homography matrices.

**Fig. 2**
An example of the results of optical flow prediction and correspondence establishment. (a) and (b) show the two input images, and (c) is the predicted flow field by the Flownet2.0, where the colour coding scheme is shown in (d). The correspondence can be established using Eq. (3). In theory, the correspondence is very dense as correspondence for most pixels can be computed except ones close to the image border. Only a small portion of the correspondence is presented in (e) for a better visualisation.

**Fig. 3**
An illustration of the pose graph that is constructed using the optical flow, SIFT, direct registration, and loop closure detection. The nodes are denoted in blue triangles. And the different types of edges are denoted in lines with different colours.

**Fig. 4**
Examples of mosaicking directly obtained from using the robot kinematics, extracted from seq. 1 (a) and seq. 5 (b) of the SCARED dataset. The kinematics are not accurate enough to generate mosaics.

**Fig. 5**
Results on the SCARED dataset. Mosaicking results for five sequences are presented from the first to the last row. The SIFT, direct registration, optical flow, and fusion-based mosaicking are presented from the first to the fourth column. The problematic parts of the panorama are denoted in blue, orange, and green rectangles from the first to the third column. The fusion-based mosaicking can correct them and combine advantages of the component methods to give high-quality panoramas.

**Fig. 6**
Results on the fetoscopy dataset. Mosaicking results for six sequences are presented from the first to the last row. The SIFT, direct registration, optical flow, and fusion-based mosaicking are presented from the first to the fourth column. SIFT-based method fails to work on this dataset due to the texture-less background and difficulty to extract enough features. The fusion-based method fuses results of the direct registration-based and optical flow-based homography estimation, and can combine the advantages of both methods to generate better panoramas.

**Fig. 7**
Results on the human cadaver dataset. Mosaicking results for five sequences are presented from the first to the last row. The SIFT, direct registration, optical flow, and fusion-based mosaicking are presented from the first to the fourth column. From the first to the fourth sequence, only the optical flow works among the three component methods. And the result of fusion is same as that of optical-flow mosaicking. For the fifth sequence, the fusion-based method fuses the results of SIFT-based and optical flow-based homography estimation using the affine pose graph, to yield a more consistent panorama.

**Fig. 8**
Mosaics generated by simple mean fusion of the SIFT-based, direct registration-based, and the optical flow-based estimation.

**Fig. 9**
SSIM between overlapping registered frames with distance between 1 (consecutive) and 5. Each boxplot shows SSIM results of all frame pairs in a video with specified distance. Lower values denote poorer methods.

**Fig. 10**
A comparison of mosaicking generated by fusion with and without loop closure on sequence 2 of the fetoscopy dataset.

See this image and copyright information in PMC

References

1. Allan M., Mcleod J., Wang C.C., Rosenthal J.C., Fu K.X., Zeffiro T., Xia W., Zhanshi Z., Luo H., Zhang X., et al. 2021. Stereo correspondence and reconstruction of endoscopic data challenge. arXiv preprint arXiv:2101.01133.
1. Bano S., Vasconcelos F., Amo M.T., Dwyer G., Gruijthuijsen C., Deprest J., Ourselin S., Vander Poorten E., Vercauteren T., Stoyanov D. Proc. Int. Conf. on Medical Image Computing and Computer-Assisted Intervention. Springer; 2019. Deep sequential mosaicking of fetoscopic videos; pp. 311–319.
1. Bano S., Vasconcelos F., Shepherd L.M., Vander Poorten E., Vercauteren T., Ourselin S., David A.L., Deprest J., Stoyanov D. Proc. Int. Conf. on Medical Image Computing and Computer-Assisted Intervention. Springer; 2020. Deep placental vessel segmentation for fetoscopic mosaicking; pp. 763–773.
1. Bartoli A. Groupwise geometric and photometric direct image registration. IEEE Trans. Pattern Anal. Mach. Intell. 2008;30(12):2098–2108. - PubMed
1. Baum Z.M., Hu Y., Barratt D.C. Real-time multimodal image registration with partial intraoperative point-set data. Med. Image Anal. 2021 - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

203145Z/16/Z/WT_/Wellcome Trust/United Kingdom

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Robust endoscopic image mosaicking via fusion of multimodal estimation

Affiliations

Robust endoscopic image mosaicking via fusion of multimodal estimation

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical