Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2020 Nov;69(11):2035-2045.
doi: 10.1136/gutjnl-2019-320466. Epub 2020 May 11.

Machine learning in GI endoscopy: practical guidance in how to interpret a novel field

Affiliations
Review

Machine learning in GI endoscopy: practical guidance in how to interpret a novel field

Fons van der Sommen et al. Gut. 2020 Nov.

Abstract

There has been a vast increase in GI literature focused on the use of machine learning in endoscopy. The relative novelty of this field poses a challenge for reviewers and readers of GI journals. To appreciate scientific quality and novelty of machine learning studies, understanding of the technical basis and commonly used techniques is required. Clinicians often lack this technical background, while machine learning experts may be unfamiliar with clinical relevance and implications for daily practice. Therefore, there is an increasing need for a multidisciplinary, international evaluation on how to perform high-quality machine learning research in endoscopy. This review aims to provide guidance for readers and reviewers of peer-reviewed GI journals to allow critical appraisal of the most relevant quality requirements of machine learning studies. The paper provides an overview of common trends and their potential pitfalls and proposes comprehensive quality requirements in six overarching themes: terminology, data, algorithm description, experimental setup, interpretation of results and machine learning in clinical practice.

Keywords: computerised image analysis; endoscopy; gastrointesinal endoscopy.

PubMed Disclaimer

Conflict of interest statement

Competing interests: None declared.

Figures

Figure 1
Figure 1
Graphical display of overfitting of training data. In this figure, the leftmost panel displays data points of two classes, in which the class is indicated by the colour. The centre panel shows the same data including the prediction of a model trained on that data as the background colour. Overfitting is clearly visible as the model isolates points of the red class, rather than capturing the class as a whole. The rightmost panel shows the prediction of a different model as background colour. Although this model makes mistakes (red points can be seen on a blue background and vice versa), this model demonstrates better generalisation, as it captures the class distributions rather than individual points.
Figure 2
Figure 2
Visualisation of training, validation and test set and overfitting, and their appropriate use. The training dataset is used to train the model, followed by validation. In case of unsatisfactory performance, the model is changed, retrained and again validated. In case of satisfactory performance, the model is then tested on a separate test set to evaluate model performance.
Figure 3
Figure 3
Graphical display of fourfold cross-validation.
Figure 4
Figure 4
Exemplary case of subtle Barrett’s neoplasia, delineated by three experts (yellow, blue and green). Parts of the lesion (‘the sweet spot’) are recognised by all experts (black), yet other parts are only recognised by one or two experts. Reprinted from Bergman J, de Groof AJ, Pech O, et al. An interactive web-based educational tool improves detection and delineation of Barrett's esophagus-related neoplasia. Gastroenterology 2019;156:1299-1308, with permission from Elsevier.

Comment in

References

    1. Ehteshami Bejnordi B, Veta M, Johannes van Diest P, et al. . Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA 2017;318:2199–210. 10.1001/jama.2017.14585 - DOI - PMC - PubMed
    1. Ghafoorian M, Karssemeijer N, Heskes T, et al. . Location sensitive deep Convolutional neural networks for segmentation of white matter hyperintensities. Sci Rep 2017;7:5110. 10.1038/s41598-017-05300-5 - DOI - PMC - PubMed
    1. Ciompi F, Chung K, van Riel SJ, et al. . Towards automatic pulmonary nodule management in lung cancer screening with deep learning. Sci Rep 2017;7:46479. 10.1038/srep46479 - DOI - PMC - PubMed
    1. Lakhani P, Sundaram B. Deep learning at chest radiography: automated classification of pulmonary tuberculosis by using Convolutional neural networks. Radiology 2017;284:574–82. 10.1148/radiol.2017162326 - DOI - PubMed
    1. Kooi T, Litjens G, van Ginneken B, et al. . Large scale deep learning for computer aided detection of mammographic lesions. Med Image Anal 2017;35:303–12. 10.1016/j.media.2016.07.007 - DOI - PubMed