Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jan;37(1):316-325.
doi: 10.1109/TMI.2017.2758580. Epub 2017 Oct 2.

Multi-Instance Multi-Label Learning for Multi-Class Classification of Whole Slide Breast Histopathology Images

Multi-Instance Multi-Label Learning for Multi-Class Classification of Whole Slide Breast Histopathology Images

Caner Mercan et al. IEEE Trans Med Imaging. 2018 Jan.

Abstract

Digital pathology has entered a new era with the availability of whole slide scanners that create the high-resolution images of full biopsy slides. Consequently, the uncertainty regarding the correspondence between the image areas and the diagnostic labels assigned by pathologists at the slide level, and the need for identifying regions that belong to multiple classes with different clinical significances have emerged as two new challenges. However, generalizability of the state-of-the-art algorithms, whose accuracies were reported on carefully selected regions of interest (ROIs) for the binary benign versus cancer classification, to these multi-class learning and localization problems is currently unknown. This paper presents our potential solutions to these challenges by exploiting the viewing records of pathologists and their slide-level annotations in weakly supervised learning scenarios. First, we extract candidate ROIs from the logs of pathologists' image screenings based on different behaviors, such as zooming, panning, and fixation. Then, we model each slide with a bag of instances represented by the candidate ROIs and a set of class labels extracted from the pathology forms. Finally, we use four different multi-instance multi-label learning algorithms for both slide-level and ROI-level predictions of diagnostic categories in whole slide breast histopathology images. Slide-level evaluation using 5-class and 14-class settings showed average precision values up to 81% and 69%, respectively, under different weakly labeled learning scenarios. ROI-level predictions showed that the classifier could successfully perform multi-class localization and classification within whole slide images that were selected to include the full range of challenging diagnostic categories.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Viewing behavior of six different pathologists on a whole slide image with a size of 74896 × 75568 pixels. The time spent by each pathologist on different image areas is illustrated using the heat map given above the images. The unmarked regions represent unviewed areas, and overlays from dark gray to red and yellow represent increasing cumulative viewing times. The diagnostic labels assigned by each pathologist to this image are also shown.
Fig. 2
Fig. 2
Hierarchical mapping of 14 classes to 5. The mapping was designed by experienced pathologists [13]. The focus of data collection was to study ductal malignancies, so when only lobular carcinoma in situ or atypical lobular hyperplasia was present in a slide, it was put to the non-proliferative category.
Fig. 3
Fig. 3
ROI detection from the viewport logs. (a) Viewport log of a particular pathologist. The x-axis shows the log entry. The red, blue, and green bars represent the zoom level, displacement, and duration, respectively. (b) The rectangular regions visible on the pathologist’s screen during the selected actions are drawn on the actual image. A zoom peak is a red circle in (a) and a red rectangle in (b), a slow panning is a blue circle in (a) and a blue rectangle in (b), a fixation is a green circle in (a) and a green rectangle in (b). (c) Candidate ROIs resulting from the union of the selected actions.
Fig. 4
Fig. 4
Different learning scenarios in the context of whole slide breast histopathology. The input to a learning algorithm is the set of candidate ROIs obtained from the viewing logs of the pathologists and the diagnostic labels assigned to the whole slide. Different learning algorithms use these samples in different ways during training. The notation is defined in the text. The 5-class setting is shown, but we also use 14-class labels in the experiments.
Fig. 5
Fig. 5
Whole slide ROI-level classification examples. From left to right: original image; each 1200 × 1200 window is colored according to the class with the highest score (see Figure 2 for the colors of the classes); scores for individual classes using the color map show on the right. The consensus ROIs are shown using black rectangles. The consensus diagnosis for the case in the first row is atypical ductal hyperplasia, and the consensus diagnoses for the second and third rows are ductal carcinoma in situ.

References

    1. Gurcan MN, Boucheron LE, Can A, Madabhushi A, Rajpoot NM, Yener B. Histopathological image analysis: A review. IEEE Reviews in Biomedical Eng. 2009 Oct;2:147–171. - PMC - PubMed
    1. Jain RK, Mehta R, Dimitrov R, Larsson LG, Musto PM, Hodges KB, Ulbright TM, Hattab EM, Agaram N, Idrees MT, Badve S. Atypical ductal hyperplasia: interobserver and intraobserver variability. Modern Pathology. 2011;24:917–923. - PubMed
    1. Allison KH, Rendi MH, Peacock S, Morgan T, Elmore JG, Weaver DL. Histological features associated with diagnostic agreement in atypical ductal hyperplasia of the breast: Illustrative cases from the B-Path study. Histopathology. 2016;69:1028–1046. - PMC - PubMed
    1. Elmore JG, Longton GM, Carney PA, Geller BM, Onega T, Tosteson ANA, Nelson HD, Pepe MS, Allison KH, Schnitt SJ, O’Malley FP, Weaver DL. Diagnostic concordance among pathologists interpreting breast biopsy specimens. Journal of American Medical Association. 2015;313(11):1122–1132. - PMC - PubMed
    1. Doyle S, Feldman M, Tomaszewski J, Madabhushi A. A boosted bayesian multiresolution classifier for prostate cancer detection from digitized needle biopsies. IEEE Trans on Biomedical Eng. 2012 May;59(5):1205–1218. - PubMed

Publication types

MeSH terms