. 2020 Oct;159(4):1406-1416.e11.

doi: 10.1053/j.gastro.2020.06.021. Epub 2020 Jun 17.

Clinical-Grade Detection of Microsatellite Instability in Colorectal Tumors by Deep Learning

Amelie Echle¹, Heike Irmgard Grabsch², Philip Quirke³, Piet A van den Brandt⁴, Nicholas P West³, Gordon G A Hutchins³, Lara R Heij⁵, Xiuxiang Tan⁵, Susan D Richman³, Jeremias Krause¹, Elizabeth Alwers⁶, Josien Jenniskens⁴, Kelly Offermans⁴, Richard Gray⁷, Hermann Brenner⁸, Jenny Chang-Claude⁹, Christian Trautwein¹, Alexander T Pearson¹⁰, Peter Boor¹¹, Tom Luedde¹², Nadine Therese Gaisa¹¹, Michael Hoffmeister⁶, Jakob Nikolas Kather¹³

Affiliations

¹ Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany.
² Department of Pathology, GROW School for Oncology and Developmental Biology, Maastricht University Medical Center+, Maastricht, The Netherlands; Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, United Kingdom.
³ Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, United Kingdom.
⁴ Department of Epidemiology, Maastricht University Medical Center+, Maastricht, The Netherlands.
⁵ Visceral and Transplant Surgery, University Hospital Rheinisch-Westfälische Technische Hochschule Aachen, Aachen, Germany; NUTRIM School of Nutrition and Translational Research in Metabolism, Maastricht University, Maastricht, the Netherlands; Institute of Pathology, University Hospital RWTH Aachen, Aachen, Germany.
⁶ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center, Heidelberg, Germany.
⁷ Clinical Trial Service Unit, University of Oxford, Oxford, United Kingdom.
⁸ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center, Heidelberg, Germany; Division of Preventive Oncology, German Cancer Research Center and National Center for Tumor Diseases, Heidelberg, Germany; German Cancer Consortium, German Cancer Research Center, Heidelberg, Germany.
⁹ Division of Cancer Epidemiology, German Cancer Research Center, Heidelberg, Germany; Cancer Epidemiology Group, University Cancer Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
¹⁰ Section of Hematology/Oncology, Department of Medicine, University of Chicago, Chicago, Illinois.
¹¹ Institute of Pathology, University Hospital RWTH Aachen, Aachen, Germany.
¹² Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany; Division of Gastroenterology, Hepatology, and Hepatobiliary Oncology, Aachen, Germany.
¹³ Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany; Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, United Kingdom; German Cancer Consortium, German Cancer Research Center, Heidelberg, Germany; Medical Oncology, National Center for Tumor Diseases, University Hospital Heidelberg, Heidelberg, Germany. Electronic address: jkather@ukaachen.de.

PMID: 32562722
PMCID: PMC7578071
DOI: 10.1053/j.gastro.2020.06.021

Clinical-Grade Detection of Microsatellite Instability in Colorectal Tumors by Deep Learning

Amelie Echle et al. Gastroenterology. 2020 Oct.

. 2020 Oct;159(4):1406-1416.e11.

doi: 10.1053/j.gastro.2020.06.021. Epub 2020 Jun 17.

Authors

Affiliations

¹ Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany.
² Department of Pathology, GROW School for Oncology and Developmental Biology, Maastricht University Medical Center+, Maastricht, The Netherlands; Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, United Kingdom.
³ Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, United Kingdom.
⁴ Department of Epidemiology, Maastricht University Medical Center+, Maastricht, The Netherlands.
⁵ Visceral and Transplant Surgery, University Hospital Rheinisch-Westfälische Technische Hochschule Aachen, Aachen, Germany; NUTRIM School of Nutrition and Translational Research in Metabolism, Maastricht University, Maastricht, the Netherlands; Institute of Pathology, University Hospital RWTH Aachen, Aachen, Germany.
⁶ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center, Heidelberg, Germany.
⁷ Clinical Trial Service Unit, University of Oxford, Oxford, United Kingdom.
⁸ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center, Heidelberg, Germany; Division of Preventive Oncology, German Cancer Research Center and National Center for Tumor Diseases, Heidelberg, Germany; German Cancer Consortium, German Cancer Research Center, Heidelberg, Germany.
⁹ Division of Cancer Epidemiology, German Cancer Research Center, Heidelberg, Germany; Cancer Epidemiology Group, University Cancer Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
¹⁰ Section of Hematology/Oncology, Department of Medicine, University of Chicago, Chicago, Illinois.
¹¹ Institute of Pathology, University Hospital RWTH Aachen, Aachen, Germany.
¹² Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany; Division of Gastroenterology, Hepatology, and Hepatobiliary Oncology, Aachen, Germany.
¹³ Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany; Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, United Kingdom; German Cancer Consortium, German Cancer Research Center, Heidelberg, Germany; Medical Oncology, National Center for Tumor Diseases, University Hospital Heidelberg, Heidelberg, Germany. Electronic address: jkather@ukaachen.de.

PMID: 32562722
PMCID: PMC7578071
DOI: 10.1053/j.gastro.2020.06.021

Abstract

Background & aims: Microsatellite instability (MSI) and mismatch-repair deficiency (dMMR) in colorectal tumors are used to select treatment for patients. Deep learning can detect MSI and dMMR in tumor samples on routine histology slides faster and less expensively than molecular assays. However, clinical application of this technology requires high performance and multisite validation, which have not yet been performed.

Methods: We collected H&E-stained slides and findings from molecular analyses for MSI and dMMR from 8836 colorectal tumors (of all stages) included in the MSIDETECT consortium study, from Germany, the Netherlands, the United Kingdom, and the United States. Specimens with dMMR were identified by immunohistochemistry analyses of tissue microarrays for loss of MLH1, MSH2, MSH6, and/or PMS2. Specimens with MSI were identified by genetic analyses. We trained a deep-learning detector to identify samples with MSI from these slides; performance was assessed by cross-validation (N = 6406 specimens) and validated in an external cohort (n = 771 specimens). Prespecified endpoints were area under the receiver operating characteristic (AUROC) curve and area under the precision-recall curve (AUPRC).

Results: The deep-learning detector identified specimens with dMMR or MSI with a mean AUROC curve of 0.92 (lower bound, 0.91; upper bound, 0.93) and an AUPRC of 0.63 (range, 0.59-0.65), or 67% specificity and 95% sensitivity, in the cross-validation development cohort. In the validation cohort, the classifier identified samples with dMMR with an AUROC of 0.95 (range, 0.92-0.96) without image preprocessing and an AUROC of 0.96 (range, 0.93-0.98) after color normalization.

Conclusions: We developed a deep-learning system that detects colorectal cancer specimens with dMMR or MSI using H&E-stained slides; it detected tissues with dMMR with an AUROC of 0.96 in a large, international validation cohort. This system might be used for high-throughput, low-cost evaluation of colorectal tissue specimens.

Keywords: Lynch syndrome; biomarker; cancer immunotherapy; mutation.

PubMed Disclaimer

Conflict of interest statement

Disclosures: JNK has an informal, unpaid advisory role at Pathomix (Heidelberg, Germany) which does not relate to this research. JNK declares no other relationships or competing interests. All other authors declare no competing interests.

Figures

**Figure 1:. Deep learning workflow and learning curves.**
(A) Histological routine images were collected from four large patient cohorts. All slides were manually quality-checked to ensure presence of tumor tissue (circled in black). (B) Tumor regions were automatically tessellated and a library of millions of non-normalized (native) image tiles was created. (C) The deep learning system was trained on increasing numbers of patients and evaluated on a random subset (n=906 patients). Performance initially increased by adding more patients to the training set, but reached a plateau at approximately 5000 patients. (D) Cross-validated experiment on the full international cohort (comprising TCGA, DACHS, QUASAR and NLCS). Receiver operating characteristic (ROC) with true positive rate (TPR) shown against false positive rate (FPR), area under the ROC curve (AUROC) is shown on top. (E) ROC curve (left) and precision-recall-curve (right) of the same classifier applied to a large external dataset. High test performance was maintained in this dataset and thus, the classifier generalized well beyond the training cohorts. Black line = average performance, shaded area = bootstrapped confidence interval, red line = random model (no skill).

**Figure 2:. Cross-validated subgroup analysis for detection of MSI and dMMR in the international cohort (n=6406 patients).**
AUC = area under the receiver operating curve as shown in the image, TPR = true positive rate, FPR = false positive rate, WT = wild type, MUT = mutated.

**Figure 3:. Prediction map in the external test cohort YCR-BCIP-RESECT.**
(A-C) Representative images from the YCR-BCIP-RESECT test cohort labeled with immunohistochemically defined mismatch repair (MMR) status. (D-F) Corresponding deep learning prediction maps. The edge length of each prediction tile is 256 μm. (G-I) Higher magnification of regions highlighted in a-e. True MSI or dMMR patients were strongly and homogeneously predicted to be MSI or dMMR (such as the patient shown in A). True MSS or pMMR patients were overall predicted to be MSS or pMMR (such as the patients in B and C), but a pronounced heterogeneity was observed in necrotic areas, poorly differentiated areas and immune-infiltrated tumor areas at the invasive edge.

**Figure 4:. Effect of color normalization on classifier performance.**
(A) A representative set of tiles from the MSIDETECT study. (B) The same tiles after color normalization. (C) Classifier performance on an external test set (YCR-BCIP-RESECT, n=771 patients) improves after color-normalizing training and test sets. Experiment #4N is with color normalization, experiment #4 is without color normalization. AUROC: area under the receiver operating curve, TPR: true positive rate, FPR: false positive rate.

See this image and copyright information in PMC

Comment in

Colorectal Cancer: Microsatellite Instability/Mismatch Repair Testing in the Era of Digital Pathology.
Pollett A. Pollett A. Gastroenterology. 2020 Oct;159(4):1235-1237. doi: 10.1053/j.gastro.2020.08.008. Epub 2020 Aug 13. Gastroenterology. 2020. PMID: 32800777 No abstract available.

References

1. Luchini C, Bibeau F, Ligtenberg MJL, et al. ESMO recommendations on microsatellite instability testing for immunotherapy in cancer, and its relationship with PD-1/PD-L1 expression and tumour mutational burden: a systematic review-based approach. Ann Oncol 2019;30:1232–1243. - PubMed
1. Kather JN, Halama N, Jaeger D. Genomics and emerging biomarkers for immunotherapy of colorectal cancer. Semin Cancer Biol 2018;52:189–197. - PubMed
1. Boland CR, Goel A. Microsatellite instability in colorectal cancer. Gastroenterology 2010;138:2073–2087.e3. - PMC - PubMed
1. Anon. Molecular testing strategies for Lynch syndrome in people with colorectal cancer - NICE Guidance. Available at: https://www.nice.org.uk/guidance/dg27/chapter/1-Recommendations [Accessed November 13, 2019].
1. Stjepanovic N, Moreira L, Carneiro F, et al. Hereditary gastrointestinal cancers: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up†. Ann Oncol 2019;30:1558–1571. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Clinical-Grade Detection of Microsatellite Instability in Colorectal Tumors by Deep Learning

Affiliations

Clinical-Grade Detection of Microsatellite Instability in Colorectal Tumors by Deep Learning

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Comment in

References

Publication types

MeSH terms

Substances

Supplementary concepts

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Miscellaneous