Review

. 2020 Nov;69(11):2035-2045.

doi: 10.1136/gutjnl-2019-320466. Epub 2020 May 11.

Machine learning in GI endoscopy: practical guidance in how to interpret a novel field

Affiliations

¹ Department of Electrical Engineering, VCA Group, University of Technology Eindhoven, Eindhoven, Noord-Brabant, The Netherlands.
² Department of Gastroenterology and Hepatology, Amsterdam UMC-Locatie AMC, Amsterdam, North Holland, The Netherlands.
³ Department of Gastroenterology and Hepatology, Catharina Hospital, Eindhoven, The Netherlands.
⁴ Digestive Disease Center, Showa University Northern Yokohama Hospital, Yokohama, Kanagawa, Japan.
⁵ Division of Gastroenterology, Vancouver General Hospital, The University of British Columbia, Vancouver, British Columbia, Canada.
⁶ Department of Gastroenterology and Hepatology, Amsterdam UMC-Locatie AMC, Amsterdam, North Holland, The Netherlands j.j.bergman@amsterdamumc.nl.

^# Contributed equally.

PMID: 32393540
PMCID: PMC7569393
DOI: 10.1136/gutjnl-2019-320466

Review

Machine learning in GI endoscopy: practical guidance in how to interpret a novel field

Fons van der Sommen et al. Gut. 2020 Nov.

. 2020 Nov;69(11):2035-2045.

doi: 10.1136/gutjnl-2019-320466. Epub 2020 May 11.

Affiliations

¹ Department of Electrical Engineering, VCA Group, University of Technology Eindhoven, Eindhoven, Noord-Brabant, The Netherlands.
² Department of Gastroenterology and Hepatology, Amsterdam UMC-Locatie AMC, Amsterdam, North Holland, The Netherlands.
³ Department of Gastroenterology and Hepatology, Catharina Hospital, Eindhoven, The Netherlands.
⁴ Digestive Disease Center, Showa University Northern Yokohama Hospital, Yokohama, Kanagawa, Japan.
⁵ Division of Gastroenterology, Vancouver General Hospital, The University of British Columbia, Vancouver, British Columbia, Canada.
⁶ Department of Gastroenterology and Hepatology, Amsterdam UMC-Locatie AMC, Amsterdam, North Holland, The Netherlands j.j.bergman@amsterdamumc.nl.

^# Contributed equally.

PMID: 32393540
PMCID: PMC7569393
DOI: 10.1136/gutjnl-2019-320466

Abstract

There has been a vast increase in GI literature focused on the use of machine learning in endoscopy. The relative novelty of this field poses a challenge for reviewers and readers of GI journals. To appreciate scientific quality and novelty of machine learning studies, understanding of the technical basis and commonly used techniques is required. Clinicians often lack this technical background, while machine learning experts may be unfamiliar with clinical relevance and implications for daily practice. Therefore, there is an increasing need for a multidisciplinary, international evaluation on how to perform high-quality machine learning research in endoscopy. This review aims to provide guidance for readers and reviewers of peer-reviewed GI journals to allow critical appraisal of the most relevant quality requirements of machine learning studies. The paper provides an overview of common trends and their potential pitfalls and proposes comprehensive quality requirements in six overarching themes: terminology, data, algorithm description, experimental setup, interpretation of results and machine learning in clinical practice.

Keywords: computerised image analysis; endoscopy; gastrointesinal endoscopy.

PubMed Disclaimer

Conflict of interest statement

Competing interests: None declared.

Figures

**Figure 1**
Graphical display of overfitting of training data. In this figure, the leftmost panel displays data points of two classes, in which the class is indicated by the colour. The centre panel shows the same data including the prediction of a model trained on that data as the background colour. Overfitting is clearly visible as the model isolates points of the red class, rather than capturing the class as a whole. The rightmost panel shows the prediction of a different model as background colour. Although this model makes mistakes (red points can be seen on a blue background and vice versa), this model demonstrates better generalisation, as it captures the class distributions rather than individual points.

**Figure 2**
Visualisation of training, validation and test set and overfitting, and their appropriate use. The training dataset is used to train the model, followed by validation. In case of unsatisfactory performance, the model is changed, retrained and again validated. In case of satisfactory performance, the model is then tested on a separate test set to evaluate model performance.

**Figure 3**
Graphical display of fourfold cross-validation.

**Figure 4**
Exemplary case of subtle Barrett’s neoplasia, delineated by three experts (yellow, blue and green). Parts of the lesion (‘the sweet spot’) are recognised by all experts (black), yet other parts are only recognised by one or two experts. Reprinted from Bergman J, de Groof AJ, Pech O, et al. An interactive web-based educational tool improves detection and delineation of Barrett's esophagus-related neoplasia. Gastroenterology 2019;156:1299-1308, with permission from Elsevier.

See this image and copyright information in PMC

Comment in

Challenging detection of hard-to-find gastric cancers with artificial intelligence-assisted endoscopy.
Murakami D, Yamato M, Amano Y, Tada T. Murakami D, et al. Gut. 2021 Jun;70(6):1196-1198. doi: 10.1136/gutjnl-2020-322453. Epub 2020 Aug 18. Gut. 2021. PMID: 32816967 Free PMC article. No abstract available.

References

1. Ehteshami Bejnordi B, Veta M, Johannes van Diest P, et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA 2017;318:2199–210. 10.1001/jama.2017.14585 - DOI - PMC - PubMed
1. Ghafoorian M, Karssemeijer N, Heskes T, et al. Location sensitive deep Convolutional neural networks for segmentation of white matter hyperintensities. Sci Rep 2017;7:5110. 10.1038/s41598-017-05300-5 - DOI - PMC - PubMed
1. Ciompi F, Chung K, van Riel SJ, et al. Towards automatic pulmonary nodule management in lung cancer screening with deep learning. Sci Rep 2017;7:46479. 10.1038/srep46479 - DOI - PMC - PubMed
1. Lakhani P, Sundaram B. Deep learning at chest radiography: automated classification of pulmonary tuberculosis by using Convolutional neural networks. Radiology 2017;284:574–82. 10.1148/radiol.2017162326 - DOI - PubMed
1. Kooi T, Litjens G, van Ginneken B, et al. Large scale deep learning for computer aided detection of mammographic lesions. Med Image Anal 2017;35:303–12. 10.1016/j.media.2016.07.007 - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- ClinicalTrials.gov

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine learning in GI endoscopy: practical guidance in how to interpret a novel field

Affiliations

Machine learning in GI endoscopy: practical guidance in how to interpret a novel field

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Comment in

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical