. 2019 Sep 15;35(18):3461-3467.

doi: 10.1093/bioinformatics/btz083.

Structured crowdsourcing enables convolutional segmentation of histology images

Mohamed Amgad¹, Habiba Elfandy², Hagar Hussein³, Lamees A Atteya⁴, Mai A T Elsebaie⁵, Lamia S Abo Elnasr⁶, Rokia A Sakr⁶, Hazem S E Salem⁵, Ahmed F Ismail⁷, Anas M Saad⁵, Joumana Ahmed³, Maha A T Elsebaie⁵, Mustafijur Rahman⁸, Inas A Ruhban⁹, Nada M Elgazar¹⁰, Yahya Alagha³, Mohamed H Osman¹¹, Ahmed M Alhusseiny¹⁰, Mariam M Khalaf¹², Abo-Alela F Younes⁵, Ali Abdulkarim³, Duaa M Younes⁵, Ahmed M Gadallah⁵, Ahmad M Elkashash³, Salma Y Fala¹³, Basma M Zaki¹³, Jonathan Beezley¹⁴, Deepak R Chittajallu¹⁴, David Manthey¹⁴, David A Gutman¹⁵, Lee A D Cooper^{1

16}

Affiliations

¹ Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, USA.
² Department of Pathology, National Cancer Institute, Cairo, Egypt.
³ Department of Medicine, Cairo University, Cairo, Egypt.
⁴ Egyptian Ministry of Health, Cairo, Egypt.
⁵ Department of Medicine, Ain Shams University, Cairo, Egypt.
⁶ Department of Medicine, Menoufia University, Menoufia, Egypt.
⁷ Department of Pathology, Medical Research Institute, Alexandria University, Alexandria, Egypt.
⁸ Department of Medicine, Chittagong University, Chittagong, Bangladesh.
⁹ Department of Medicine, Damascus University, Damascus, Syria.
¹⁰ Department of Medicine, Mansoura University, Mansoura, Egypt.
¹¹ Department of Medicine, Zagazig University, Zagazig, Egypt.
¹² Department of Medicine, Batterjee Medical College, Jeddah, Saudi Arabia.
¹³ Department of Medicine, Suez Canal University, Ismailia, Egypt.
¹⁴ Kitware Inc., Clifton Park, NY, USA.
¹⁵ Department of Neurology, Emory University School of Medicine, Atlanta, GA, USA.
¹⁶ Department of Biomedical Engineering, Emory University, Atlanta, GA, USA.

PMID: 30726865
PMCID: PMC6748796
DOI: 10.1093/bioinformatics/btz083

Structured crowdsourcing enables convolutional segmentation of histology images

Mohamed Amgad et al. Bioinformatics. 2019.

. 2019 Sep 15;35(18):3461-3467.

doi: 10.1093/bioinformatics/btz083.

Authors

Affiliations

¹ Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, USA.
² Department of Pathology, National Cancer Institute, Cairo, Egypt.
³ Department of Medicine, Cairo University, Cairo, Egypt.
⁴ Egyptian Ministry of Health, Cairo, Egypt.
⁵ Department of Medicine, Ain Shams University, Cairo, Egypt.
⁶ Department of Medicine, Menoufia University, Menoufia, Egypt.
⁷ Department of Pathology, Medical Research Institute, Alexandria University, Alexandria, Egypt.
⁸ Department of Medicine, Chittagong University, Chittagong, Bangladesh.
⁹ Department of Medicine, Damascus University, Damascus, Syria.
¹⁰ Department of Medicine, Mansoura University, Mansoura, Egypt.
¹¹ Department of Medicine, Zagazig University, Zagazig, Egypt.
¹² Department of Medicine, Batterjee Medical College, Jeddah, Saudi Arabia.
¹³ Department of Medicine, Suez Canal University, Ismailia, Egypt.
¹⁴ Kitware Inc., Clifton Park, NY, USA.
¹⁵ Department of Neurology, Emory University School of Medicine, Atlanta, GA, USA.
¹⁶ Department of Biomedical Engineering, Emory University, Atlanta, GA, USA.

PMID: 30726865
PMCID: PMC6748796
DOI: 10.1093/bioinformatics/btz083

Abstract

Motivation: While deep-learning algorithms have demonstrated outstanding performance in semantic image segmentation tasks, large annotation datasets are needed to create accurate models. Annotation of histology images is challenging due to the effort and experience required to carefully delineate tissue structures, and difficulties related to sharing and markup of whole-slide images.

Results: We recruited 25 participants, ranging in experience from senior pathologists to medical students, to delineate tissue regions in 151 breast cancer slides using the Digital Slide Archive. Inter-participant discordance was systematically evaluated, revealing low discordance for tumor and stroma, and higher discordance for more subjectively defined or rare tissue classes. Feedback provided by senior participants enabled the generation and curation of 20 000+ annotated tissue regions. Fully convolutional networks trained using these annotations were highly accurate (mean AUC=0.945), and the scale of annotation data provided notable improvements in image classification accuracy.

Availability and implementation: Dataset is freely available at: https://goo.gl/cNM4EL.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

**Fig. 1.**
Study overview. (A) Slides from the TNBC cohort were reviewed for difficulty and the study coordinator selected a single representative ROI in each slide. (B) Participants were recruited on social media from medical student interest groups. Documentation and instructional videos were developed to train participants in breast cancer pathology and the use of DSA annotation tools. A spreadsheet lists slide-level descriptions of histologic features for each of the 151 images to aid in training. (C) Participants were each assigned six slides based on experience. Challenging slides were assigned to faculty/pathology residents, while standard slides were distributed among all participants. (D) The DSA was used by participants to draw the outlines of tissue regions in their assigned slides/ROIs. A Slack workspace enabled less experienced users to ask questions and receive guidance from the more experienced users. (E) Ten evaluation ROIs were identified in the slides and were annotated by all participants in an unsupervised manner to enable inter-participant comparisons. (F) Agreement between each pair of participants was evaluated using the Dice coefficient to generate an inter-participant discordance matrix

**Fig. 2.**
Screenshot of the DSA and HistomicsTK web interface. The main viewport allows panning and zooming within the slide. Annotations are grouped by class into layers (middle right panel) whose style properties like color and fill can be adjusted (bottom right panel). Other features include: controlling annotation transparency, an interactive mode to highlight individual annotations, and ability to download the WSI, regions of interest or annotations. Annotation properties can also be programmatically manipulated using the DSA API

**Fig. 3.**
Evaluation slide set concordance and model accuracy. (A) Inter-participant discordance matrices for SP, JP, NP and AL. (B) 2-D MDS plots of participant discordance. (**C, D**) Testing accuracy and confusion of comparison models trained on evaluation set ROIs from SPs (cyan) and NPs (magenta), measured against post-correction masks from the core set. Confusion matrix values are percentages relative to total pixel count. (Color version of this figure is available at *Bioinformatics online*.)

**Fig. 4.**
Model performance over the testing set. (A) Visualization of full semantic segmentation model predictions on testing set regions of interest. Color codes used: red (tumor); transparent (stroma); cyan (inflammatory infiltrates); yellow (necrosis). (B) Area under ROC curve for semantic segmentation algorithm, broken down by region class. (C) Effect of training sample size on scale-dependent patch classification models. Each point represents the macro-average AUC of a single model, trained on different sets of randomly selected slides. (Color version of this figure is available at *Bioinformatics online*.)

See this image and copyright information in PMC

References

1. Alialy R. et al. (2018) A review on the applications of crowdsourcing in human pathology. J. Pathol. Inform., 9, 2.. - PMC - PubMed
1. Fouad Y.A., Aanei C. (2017) Revisiting the hallmarks of cancer. Am. J. Cancer Res., 7, 1016–1036. - PMC - PubMed
1. Gutman D.A. et al. (2013) Cancer Digital Slide Archive: an informatics resource to support integrated in silico analysis of TCGA pathology data. J. Am. Med. Inform. Assoc., 20, 1091–1098. - PMC - PubMed
1. Gutman D.A. et al. (2017) The digital slide archive: a software platform for management, integration, and analysis of histology for cancer research. Cancer Res., 77, e75–e78. - PMC - PubMed
1. Hughes H. et al. (2018) Quanti.us: a tool for rapid, flexible, crowd-based annotation of images. Nat. Methods, 15, 587. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

Grants and funding

U24 CA194362/CA/NCI NIH HHS/United States

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Structured crowdsourcing enables convolutional segmentation of histology images

Affiliations

Structured crowdsourcing enables convolutional segmentation of histology images

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical