. 2021 May;8(3):031906.

doi: 10.1117/1.JMI.8.3.031906. Epub 2021 May 8.

Using deep neural networks and interpretability methods to identify gene expression patterns that predict radiomic features and histology in non-small cell lung cancer

Nova F Smedley^{1

2}, Denise R Aberle^{1

2}, William Hsu^{1

2

3}

Affiliations

¹ University of California, Los Angeles, Department of Radiological Sciences, Los Angeles, California, United States.
² University of California, Los Angeles, Department of Bioengineering, Los Angeles, California, United States.
³ University of California, Los Angeles, Bioinformatics Interdepartmental Program, Los Angeles, California, United States.

PMID: 33977113
PMCID: PMC8105647
DOI: 10.1117/1.JMI.8.3.031906

Using deep neural networks and interpretability methods to identify gene expression patterns that predict radiomic features and histology in non-small cell lung cancer

Nova F Smedley et al. J Med Imaging (Bellingham). 2021 May.

. 2021 May;8(3):031906.

doi: 10.1117/1.JMI.8.3.031906. Epub 2021 May 8.

Authors

Nova F Smedley^{1

2}, Denise R Aberle^{1

2}, William Hsu^{1

2

3}

Affiliations

¹ University of California, Los Angeles, Department of Radiological Sciences, Los Angeles, California, United States.
² University of California, Los Angeles, Department of Bioengineering, Los Angeles, California, United States.
³ University of California, Los Angeles, Bioinformatics Interdepartmental Program, Los Angeles, California, United States.

PMID: 33977113
PMCID: PMC8105647
DOI: 10.1117/1.JMI.8.3.031906

Abstract

Purpose: Integrative analysis combining diagnostic imaging and genomic information can uncover biological insights into lesions that are visible on radiologic images. We investigate techniques for interrogating a deep neural network trained to predict quantitative image (radiomic) features and histology from gene expression in non-small cell lung cancer (NSCLC). Approach: Using 262 training and 89 testing cases from two public datasets, deep feedforward neural networks were trained to predict the values of 101 computed tomography (CT) radiomic features and histology. A model interrogation method called gene masking was used to derive the learned associations between subsets of genes and a radiomic feature or histology class [adenocarcinoma (ADC), squamous cell, and other]. Results: Overall, neural networks outperformed other classifiers. In testing, neural networks classified histology with area under the receiver operating characteristic curves (AUCs) of 0.86 (ADC), 0.91 (squamous cell), and 0.71 (other). Classification performance of radiomics features ranged from 0.42 to 0.89 AUC. Gene masking analysis revealed new and previously reported associations. For example, hypoxia genes predicted histology ( $> 0.90 AUC$ ). Previously published gene signatures for classifying histology were also predictive in our model ( $> 0.80 AUC$ ). Gene sets related to the immune or cardiac systems and cell development processes were predictive ( $> 0.70 AUC$ ) of several different radiomic features. AKT signaling, tumor necrosis factor, and Rho gene sets were each predictive of tumor textures. Conclusions: This work demonstrates neural networks' ability to map gene expressions to radiomic features and histology types in NSCLC and to interpret the models to identify predictive genes associated with each feature or type.

Keywords: deep learning; gene expression; model interpretability; non-small cell lung cancer; radiogenomics.

PubMed Disclaimer

Figures

**Fig. 1**
An overview of this study’s approaches to (a) training and (b) interpretation radiogenomic neural networks.

**Fig. 2**
The architecture and hyperparameter tuning of a radiogenomic neural network.

**Fig. 3**
The ability of models to predict NSCLC histology and stage in (a) training and (b) testing. In training, models were evaluated using 10-fold CV, and models were compared using the mean AUC scores in CV. The top performing model was then retrained on the full training dataset and evaluated on the testing dataset. The testing performance scores are shown for each histology type and stage. (c)–(e) Gene masking of the histology neural network using gene sets from (c) published gene signatures for histology,^, (d) Hallmark (top five out of 50), and (e) Gene Ontology biological processes (GO.bp, top ten out of 7350). In (c)–(e), each column is a type of histology and each row is a gene set used to mask the trained model to inspect how well the model predicted a certain histology type. The color in a cell shows the model’s performance using a gene set to predict a histology type, where red denotes higher AUCs and purple denotes lower AUCs in the testing dataset. (b) The ability of the histology model to predict each histology type and the AUC score is based on using all genes in the gene expression profile. (c)–(e) The ability of a specific gene set in predicting a histology type. For more details on the gene sets used, see Sec. 2. Notation: lcc = gene signature for large cell carcinoma, scc = gene signature for squamous cell carcinoma, adc = gene signature for adenocarciomas, adc versus scc = gene signature for differentiating ADC from SCC.

**Fig. 4**
Radiogenomic modeling performance (a) between neural networks and other models in the training dataset. Neural network performance (b) in the training and testing datasets for the 13 radiomic features selected for further analysis. Train scores represent the averaged scores of the validation folds during 10-fold CV in the training dataset. The test scores are the model’s performance in the testing dataset after models were retrained on the full training dataset.

**Fig. 5**
Gene masking of the radiogenomics models with biological processes from GO. The top three gene sets ranked by test AUC for each radiomic feature are shown.

See this image and copyright information in PMC

Cited by

Radiogenomics: Hunting Down Liver Metastasis in Colorectal Cancer Patients.
de la Pinta C, Castillo ME, Collado M, Galindo-Pumariño C, Peña C. de la Pinta C, et al. Cancers (Basel). 2021 Nov 5;13(21):5547. doi: 10.3390/cancers13215547. Cancers (Basel). 2021. PMID: 34771709 Free PMC article. Review.
The effect of harmonization on the variability of PET radiomic features extracted using various segmentation methods.
Hosseini SA, Shiri I, Ghaffarian P, Hajianfar G, Avval AH, Seyfi M, Servaes S, Rosa-Neto P, Zaidi H, Ay MR. Hosseini SA, et al. Ann Nucl Med. 2024 Jul;38(7):493-507. doi: 10.1007/s12149-024-01923-7. Epub 2024 Apr 4. Ann Nucl Med. 2024. PMID: 38575814 Free PMC article.
Automated Prediction of Neoadjuvant Chemoradiotherapy Response in Locally Advanced Cervical Cancer Using Hybrid Model-Based MRI Radiomics.
Yang H, Xu Y, Dong M, Zhang Y, Gong J, Huang D, He J, Wei L, Huang S, Zhao L. Yang H, et al. Diagnostics (Basel). 2023 Dec 19;14(1):5. doi: 10.3390/diagnostics14010005. Diagnostics (Basel). 2023. PMID: 38201314 Free PMC article.
Introduction to Special Issue Imaging in Cancer Diagnosis.
Zanon C, Quaia E, Crimì F. Zanon C, et al. Tomography. 2024 Jan 15;10(1):101-104. doi: 10.3390/tomography10010009. Tomography. 2024. PMID: 38250955 Free PMC article.
Synergies of Radiomics and Transcriptomics in Lung Cancer Diagnosis: A Pilot Study.
Dovrou A, Bei E, Sfakianakis S, Marias K, Papanikolaou N, Zervakis M. Dovrou A, et al. Diagnostics (Basel). 2023 Feb 15;13(4):738. doi: 10.3390/diagnostics13040738. Diagnostics (Basel). 2023. PMID: 36832225 Free PMC article.

See all "Cited by" articles

References

1. Segal E., et al. , “Decoding global gene expression programs in liver cancer by noninvasive imaging,” Nat. Biotechnol. 25, 675–680 (2007).NABIF910.1038/nbt1306 - DOI - PubMed
1. Diehn M., et al. , “Identification of noninvasive imaging surrogates for brain tumor gene-expression modules,” Proc. Natl. Acad. Sci. U. S. A. 105, 5213–5218 (2008).10.1073/pnas.0801279105 - DOI - PMC - PubMed
1. Zinn P. O., et al. , “Radiogenomic mapping of edema/cellular invasion MRI-phenotypes in glioblastoma multiforme,” PLoS One 6, e25451 (2011).POLNCL10.1371/journal.pone.0025451 - DOI - PMC - PubMed
1. Colen R. R., et al. , “Imaging genomic mapping of an invasive MRI phenotype predicts patient outcome and metabolic dysfunction: a TCGA glioma phenotype research group project,” BMC Med. Genom. 7, 30 (2014).10.1186/1755-8794-7-30 - DOI - PMC - PubMed
1. Chang K., et al. , “Residual convolutional neural network for the determination of IDH status in low- and high-grade gliomas from MR imaging,” Clin. Cancer Res. 24, 1073–1081 (2018).10.1158/1078-0432.CCR-17-2236 - DOI - PMC - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Using deep neural networks and interpretability methods to identify gene expression patterns that predict radiomic features and histology in non-small cell lung cancer

Affiliations

Using deep neural networks and interpretability methods to identify gene expression patterns that predict radiomic features and histology in non-small cell lung cancer

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Figures

Similar articles

Cited by

References

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources