Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology

Narmin Ghaffari Laleh¹, Hannah Sophie Muti¹, Chiara Maria Lavinia Loeffler¹, Amelie Echle¹, Oliver Lester Saldanha¹, Faisal Mahmood², Ming Y Lu², Christian Trautwein¹, Rupert Langer³, Bastian Dislich⁴, Roman D Buelow⁵, Heike Irmgard Grabsch⁶, Hermann Brenner⁷, Jenny Chang-Claude⁸, Elizabeth Alwers⁹, Titus J Brinker¹⁰, Firas Khader¹¹, Daniel Truhn¹¹, Nadine T Gaisa⁵, Peter Boor⁵, Michael Hoffmeister⁹, Volkmar Schulz¹², Jakob Nikolas Kather¹³

Affiliations

¹ Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany.
² Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
³ Institute of Pathology and Molecular Pathology, Kepler University Hospital, Johannes Kepler University Linz, Linz, Austria.
⁴ Institute of Pathology, University of Bern, Switzerland.
⁵ Institute of Pathology, University Hospital RWTH Aachen, Aachen, Germany.
⁶ Department of Pathology, GROW School for Oncology and Developmental Biology, Maastricht University Medical Center+, Maastricht, The Netherlands.; Division of Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK.
⁷ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany; Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Heidelberg, Germany; German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany.
⁸ Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany; Cancer Epidemiology Group, University Cancer Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
⁹ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany.
¹⁰ Digital Biomarkers for Oncology Group, German Cancer Research Center (DKFZ), Heidelberg, Germany.
¹¹ Department of Radiology, University Hospital RWTH Aachen, Aachen, Germany.
¹² Department of Physics of Molecular Imaging Systems, Experimental Molecular Imaging, RWTH Aachen University, Aachen, Germany; Fraunhofer Institute for Digital Medicine MEVIS, Bremen, Germany; Comprehensive Diagnostic Center Aachen (CDCA), University Hospital Aachen, Aachen, Germany; Hyperion Hybrid Imaging Systems GmbH, Aachen, Germany.
¹³ Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany; Division of Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK; Else Kroener Fresenius Center for Digital Health, Medical Faculty Carl Gustav Carus, Technical University Dresden, Dresden, Germany. Electronic address: jkather@ukaachen.de.

PMID: 35588568
DOI: 10.1016/j.media.2022.102474

Free article

Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology

Narmin Ghaffari Laleh et al. Med Image Anal. 2022 Jul.

Free article

. 2022 Jul:79:102474.

doi: 10.1016/j.media.2022.102474. Epub 2022 May 4.

Authors

Affiliations

¹ Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany.
² Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
³ Institute of Pathology and Molecular Pathology, Kepler University Hospital, Johannes Kepler University Linz, Linz, Austria.
⁴ Institute of Pathology, University of Bern, Switzerland.
⁵ Institute of Pathology, University Hospital RWTH Aachen, Aachen, Germany.
⁶ Department of Pathology, GROW School for Oncology and Developmental Biology, Maastricht University Medical Center+, Maastricht, The Netherlands.; Division of Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK.
⁷ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany; Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Heidelberg, Germany; German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany.
⁸ Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany; Cancer Epidemiology Group, University Cancer Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
⁹ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany.
¹⁰ Digital Biomarkers for Oncology Group, German Cancer Research Center (DKFZ), Heidelberg, Germany.
¹¹ Department of Radiology, University Hospital RWTH Aachen, Aachen, Germany.
¹² Department of Physics of Molecular Imaging Systems, Experimental Molecular Imaging, RWTH Aachen University, Aachen, Germany; Fraunhofer Institute for Digital Medicine MEVIS, Bremen, Germany; Comprehensive Diagnostic Center Aachen (CDCA), University Hospital Aachen, Aachen, Germany; Hyperion Hybrid Imaging Systems GmbH, Aachen, Germany.
¹³ Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany; Division of Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK; Else Kroener Fresenius Center for Digital Health, Medical Faculty Carl Gustav Carus, Technical University Dresden, Dresden, Germany. Electronic address: jkather@ukaachen.de.

PMID: 35588568
DOI: 10.1016/j.media.2022.102474

Erratum in

Erratum to 'Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology' Medical Image Analysis, Volume 79, July 2022, 102474.
Ghaffari Laleh N, Muti HS, Loeffler CML, Echle A, Saldanha OL, Mahmood F, Lu MY, Trautwein C, Langer R, Dislich B, Buelow RD, Grabsch HI, Brenner H, Chang-Claude J, Alwers E, Brinker TJ, Khader F, Truhn D, Gaisa NT, Boor P, Hoffmeister M, Schulz V, Kather JN. Ghaffari Laleh N, et al. Med Image Anal. 2022 Nov;82:102622. doi: 10.1016/j.media.2022.102622. Epub 2022 Sep 18. Med Image Anal. 2022. PMID: 36130464 No abstract available.

Abstract

Artificial intelligence (AI) can extract visual information from histopathological slides and yield biological insight and clinical biomarkers. Whole slide images are cut into thousands of tiles and classification problems are often weakly-supervised: the ground truth is only known for the slide, not for every single tile. In classical weakly-supervised analysis pipelines, all tiles inherit the slide label while in multiple-instance learning (MIL), only bags of tiles inherit the label. However, it is still unclear how these widely used but markedly different approaches perform relative to each other. We implemented and systematically compared six methods in six clinically relevant end-to-end prediction tasks using data from N=2980 patients for training with rigorous external validation. We tested three classical weakly-supervised approaches with convolutional neural networks and vision transformers (ViT) and three MIL-based approaches with and without an additional attention module. Our results empirically demonstrate that histological tumor subtyping of renal cell carcinoma is an easy task in which all approaches achieve an area under the receiver operating curve (AUROC) of above 0.9. In contrast, we report significant performance differences for clinically relevant tasks of mutation prediction in colorectal, gastric, and bladder cancer. In these mutation prediction tasks, classical weakly-supervised workflows outperformed MIL-based weakly-supervised methods for mutation prediction, which is surprising given their simplicity. This shows that new end-to-end image analysis pipelines in computational pathology should be compared to classical weakly-supervised methods. Also, these findings motivate the development of new methods which combine the elegant assumptions of MIL with the empirically observed higher performance of classical weakly-supervised approaches. We make all source codes publicly available at https://github.com/KatherLab/HIA, allowing easy application of all methods to any similar task.

Keywords: Artificial intelligence; Computational pathology; Convolutional neural networks; Multiple-Instance Learning; Vision transformers; Weakly-supervised deep learning.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest JNK declares consulting services for Owkin, France and Panakeia, UK. TJB reports owning a company that develops mobile apps, outside the scope of the submitted work (Smart Health Heidelberg GmbH, Handschuhsheimer Landstr. 9/1, 69120 Heidelberg). No other potential conflicts of interest are reported by any of the authors.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology

Affiliations

Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology

Authors

Affiliations

Erratum in

Abstract

Conflict of interest statement

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources