. 2021 Mar 9;11(1):5471.

doi: 10.1038/s41598-021-84630-x.

Deep learning classification of lung cancer histology using CT images

Tafadzwa L Chaunzwa^{1

2

3}, Ahmed Hosny^{4

5}, Yiwen Xu^{4

5}, Andrea Shafer⁶, Nancy Diao⁶, Michael Lanuti⁷, David C Christiani^{6

8}, Raymond H Mak^{4

5}, Hugo J W L Aerts^{9

10

11

12}

Affiliations

¹ Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA. tafadzwa_chaunzwa@dfci.harvard.edu.
² Department of Radiation Oncology, Dana Farber Cancer Institute and Brigham and Women's Hospital, Boston, MA, USA. tafadzwa_chaunzwa@dfci.harvard.edu.
³ Howard Hughes Medical Institute, Chevy Chase, MD, USA. tafadzwa_chaunzwa@dfci.harvard.edu.
⁴ Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA.
⁵ Department of Radiation Oncology, Dana Farber Cancer Institute and Brigham and Women's Hospital, Boston, MA, USA.
⁶ Harvard T.H. Chan School of Public Health, Boston, MA, USA.
⁷ Division of Thoracic Surgery, Massachusetts General Hospital, Boston, MA, USA.
⁸ Department of Medicine, Massachusetts General Hospital, Boston, MA, USA.
⁹ Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA. hugo_aerts@dfci.harvard.edu.
¹⁰ Department of Radiation Oncology, Dana Farber Cancer Institute and Brigham and Women's Hospital, Boston, MA, USA. hugo_aerts@dfci.harvard.edu.
¹¹ Department of Radiology, Dana Farber Cancer Institute and Brigham and Women's Hospital, Boston, MA, USA. hugo_aerts@dfci.harvard.edu.
¹² Radiology and Nuclear Medicine, CARIM & GROW, Maastricht University, Maastricht, The Netherlands. hugo_aerts@dfci.harvard.edu.

PMID: 33727623
PMCID: PMC7943565
DOI: 10.1038/s41598-021-84630-x

Deep learning classification of lung cancer histology using CT images

Tafadzwa L Chaunzwa et al. Sci Rep. 2021.

. 2021 Mar 9;11(1):5471.

doi: 10.1038/s41598-021-84630-x.

Authors

Affiliations

¹ Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA. tafadzwa_chaunzwa@dfci.harvard.edu.
² Department of Radiation Oncology, Dana Farber Cancer Institute and Brigham and Women's Hospital, Boston, MA, USA. tafadzwa_chaunzwa@dfci.harvard.edu.
³ Howard Hughes Medical Institute, Chevy Chase, MD, USA. tafadzwa_chaunzwa@dfci.harvard.edu.
⁴ Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA.
⁵ Department of Radiation Oncology, Dana Farber Cancer Institute and Brigham and Women's Hospital, Boston, MA, USA.
⁶ Harvard T.H. Chan School of Public Health, Boston, MA, USA.
⁷ Division of Thoracic Surgery, Massachusetts General Hospital, Boston, MA, USA.
⁸ Department of Medicine, Massachusetts General Hospital, Boston, MA, USA.
⁹ Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA. hugo_aerts@dfci.harvard.edu.
¹⁰ Department of Radiation Oncology, Dana Farber Cancer Institute and Brigham and Women's Hospital, Boston, MA, USA. hugo_aerts@dfci.harvard.edu.
¹¹ Department of Radiology, Dana Farber Cancer Institute and Brigham and Women's Hospital, Boston, MA, USA. hugo_aerts@dfci.harvard.edu.
¹² Radiology and Nuclear Medicine, CARIM & GROW, Maastricht University, Maastricht, The Netherlands. hugo_aerts@dfci.harvard.edu.

PMID: 33727623
PMCID: PMC7943565
DOI: 10.1038/s41598-021-84630-x

Abstract

Tumor histology is an important predictor of therapeutic response and outcomes in lung cancer. Tissue sampling for pathologist review is the most reliable method for histology classification, however, recent advances in deep learning for medical image analysis allude to the utility of radiologic data in further describing disease characteristics and for risk stratification. In this study, we propose a radiomics approach to predicting non-small cell lung cancer (NSCLC) tumor histology from non-invasive standard-of-care computed tomography (CT) data. We trained and validated convolutional neural networks (CNNs) on a dataset comprising 311 early-stage NSCLC patients receiving surgical treatment at Massachusetts General Hospital (MGH), with a focus on the two most common histological types: adenocarcinoma (ADC) and Squamous Cell Carcinoma (SCC). The CNNs were able to predict tumor histology with an AUC of 0.71(p = 0.018). We also found that using machine learning classifiers such as k-nearest neighbors (kNN) and support vector machine (SVM) on CNN-derived quantitative radiomics features yielded comparable discriminative performance, with AUC of up to 0.71 (p = 0.017). Our best performing CNN functioned as a robust probabilistic classifier in heterogeneous test sets, with qualitatively interpretable visual explanations to its predictions. Deep learning based radiomics can identify histological phenotypes in lung cancer. It has the potential to augment existing approaches and serve as a corrective aid for diagnosticians.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Figure 1**
Dataset breakdown for model A and model B. Patients were categorized into three groups; ADC, SCC, and an “Other” category that comprised all other NSCLC histology subtypes. Similar to data presented in Table 2 for model A, model B was fine-tuned using the same BLCS dataset, but with the inclusion of all other histology types. This translated to a tuning-set with 120 ADC, 52 SCC, and 56 patients with “Other” histology types, and a test-set with 35 ADC, 16 SCC, and 32 patients with “Other” histology types (summarized in Fig. 1).

**Figure 2**
Experimental design. A convolutional neural network (VGG16) developed by the visual geometry group at Oxford (13) and pre-trained on the large ImageNet dataset of more than 14 million hand-annotated natural images is employed in this analytical study. *Model A* is fine-tuned using a sample of 172 patients with either adenocarcinoma or squamous cell carcinoma and is used to predict future cases of these histology types using a held-out test set of 51 patients with adenocarcinoma or squamous cell carcinoma only. This model is also used as a fixed feature extractor for the assessment of machine learning classifiers (kNN, SVM, Linear-SVM, RF). These quantitative radiographic features are derived from the last pooling and first fully connected layers, corresponding to 512-D and 4096-D vectors, respectively. *Model A* is also used as a probabilistic classifier of histology and tested on a held-out test-set of 83 cases containing all histology types, grouped into adenocarcinoma, squamous cell carcinoma, or other. *Model B* is the fully connected VGG16 network tuned with a heterogenous sample of 228 cases with all histology types, and has as its output 3 different histology types, tested on the 83-patient sample as illustrated.

**Figure 3**
*Model A* and B schematic; This convolutional neural network architecture is based on the VGG architecture. With our transfer learning approach, weights of the last convolutional and pooling layers were fine-tuned using radiographic data. Model A, tuned on adenocarcinoma and squamous cell carcinoma tuning-set, had two classes as output in the softmax layer, while Model B which was tuned on a dataset containing all histology types had 3 type outputs.

**Figure 4**
Discriminative performance of deep learning based radiomics models as represented by area under the ROC curve (AUC) scores. Model A, tuned with a dataset containing adenocarcinoma (ADC) and squamous cell carcinoma (SCC) only, displayed an AUC of 0.71 for the 51 patient ADC vs SCC test set, and Model B which was tuned with a dataset containing all histology types had AUC of 0.58 on a heterogenous test set of 83 patients (ADC vs SCC vs Other). Also shown are AUC scores for models combining deep learning derived feature maps with machine learning classifiers. When used on a 4096-D feature vector represented by the first fully connected layer in Model A with dimensionality reduction, the kNN model had an AUC of 0.71, Linear SVM model had AUC of 0.68, SVM model had AUC of 0.64, and RF had AUC of 0.57. When used on a 512-D feature vector, the kNN model had AUC of 0.64, Linear SVM model had AUC of 0.62, SVM model had AUC of 0.63, and RF had AUC of 0.61.

**Figure 5**
*Model A* as probabilistic classifier of non-small cell histology in 83 sample held-out test set containing all histology types. There is a statistically significant difference in predictions comparing all 3 histology groups: ADC, SCC, Other. Comparison of ADC vs SCC revealed a statistically significant difference with p-value of 0.003, while comparison of SCC vs Other had a p-value of p = 0.235, and ADC vs Other had a p-value of 0.355.

**Figure 6**
Gradient based class activation heat maps (Grad-CAM) for deep learning based *model A*. Visualization of image regions with the most discriminative value in type prediction as determined by the best performing convolutional neural network model. Here sample test input images are shown with overlaid activation contours, where red highlights regions with highest contribution and blue representing areas with the least value. The second and last convolutional layer in *model A* were used for generation of class activation maps as depicted by Fig. 3.

See this image and copyright information in PMC

References

1. Huang T, Li J, Zhang C, Hong Q, Jiang D, Ye M, Duan S. Distinguishing lung adenocarcinoma from lung squamous cell carcinoma by two hypomethylated and three hypermethylated genes: a meta-analysis. PLoS ONE. 2016;11:e0149088. doi: 10.1371/journal.pone.0149088. - DOI - PMC - PubMed
1. Davidson MR, Gazdar AF, Clarke BE. The pivotal role of pathology in the management of lung cancer. J. Thorac. Dis. 2013;5(Suppl 5):S463–S478. - PMC - PubMed
1. Kasraeian S, Allison DC, Ahlmann ER, Fedenko AN, Menendez LR. A comparison of fine-needle aspiration, core biopsy, and surgical biopsy in the diagnosis of extremity soft tissue masses. Clin. Orthop. Relat. Res. 2010;468:2992–3002. doi: 10.1007/s11999-010-1401-x. - DOI - PMC - PubMed
1. Ilié M, Hofman P. Pros: Can tissue biopsy be replaced by liquid biopsy? Transl. Lung Cancer Res. 2016;5:420–423. doi: 10.21037/tlcr.2016.08.06. - DOI - PMC - PubMed
1. Zhao B, Tan Y, Tsai W-Y, Qi J, Xie C, Lu L, Schwartz LH. Reproducibility of radiomics for deciphering tumor phenotype with imaging. Sci. Rep. 2016;6:23428. doi: 10.1038/srep23428. - DOI - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations
Medical
- MedlinePlus Consumer Health Information
- MedlinePlus Health Information
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Deep learning classification of lung cancer histology using CT images

Affiliations

Deep learning classification of lung cancer histology using CT images

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Research Materials