Machine Learning and Imaging Informatics in Oncology

doi:10.1159/000493575

Review

. 2020;98(6):344-362.

doi: 10.1159/000493575. Epub 2018 Nov 23.

Machine Learning and Imaging Informatics in Oncology

Huan-Hsin Tseng¹, Lise Wei¹, Sunan Cui¹, Yi Luo¹, Randall K Ten Haken¹, Issam El Naqa²

Affiliations

¹ Department of Radiation Oncology, University of Michigan, Ann Arbor, Michigan, USA.
² Department of Radiation Oncology, University of Michigan, Ann Arbor, Michigan, USA, ielnaqa@med.umich.edu.

PMID: 30472716
PMCID: PMC6533165
DOI: 10.1159/000493575

Review

Machine Learning and Imaging Informatics in Oncology

Huan-Hsin Tseng et al. Oncology. 2020.

. 2020;98(6):344-362.

doi: 10.1159/000493575. Epub 2018 Nov 23.

Authors

Huan-Hsin Tseng¹, Lise Wei¹, Sunan Cui¹, Yi Luo¹, Randall K Ten Haken¹, Issam El Naqa²

Affiliations

¹ Department of Radiation Oncology, University of Michigan, Ann Arbor, Michigan, USA.
² Department of Radiation Oncology, University of Michigan, Ann Arbor, Michigan, USA, ielnaqa@med.umich.edu.

PMID: 30472716
PMCID: PMC6533165
DOI: 10.1159/000493575

Abstract

In the era of personalized and precision medicine, informatics technologies utilizing machine learning (ML) and quantitative imaging are witnessing a rapidly increasing role in medicine in general and in oncology in particular. This expanding role ranges from computer-aided diagnosis to decision support of treatments with the potential to transform the current landscape of cancer management. In this review, we aim to provide an overview of ML methodologies and imaging informatics techniques and their recent application in modern oncology. We will review example applications of ML in oncology from the literature, identify current challenges and highlight future potentials.

Keywords: Imaging informatics; Machine learning; Oncology.

PubMed Disclaimer

Conflict of interest statement

Disclosure Statement

The authors have no conflicts of interest to declare.

Figures

**Fig. 1.**
A schematic of the relation between AI, ML, Deep Learning, Big Data, and Data Science. It is noted that machine learning is a computational branch from AI that aims to provide computers with ability to perform tasks beyond their original programming such as data mining and big data analytics.

**Fig. 2.**
(a) (Left) An illustration of supervised learning using neural networks (right figure) classifying synthetic data of binary labels (blue and red scatter dots), where the nonlinear decision boundary is shown in white. (b) A multi-layer (deep) neural network with two hidden layers. The so-called deep learning usually refers to learning algorithms heavily relying on such computational units.

**Fig. 3.**
(a) [Left] An illustration of an unsupervised learning using p-SNE with open image database Olivetti faces, where similar images (same person) are clustered automatically without providing any identity information. (b) [Right (reprint permission granted)] Dawson et al. [13] demonstrated that PCA can be used to observe clinical data structure. In this case the data describing the xerostomia occurrences due to parotid gland dose distributions is linearly separable.

**Fig. 4.**
The structure of an CNN, usually consisting of three distinct layers: the convolution layer, the pooling layer, and a final fully-connected layer (Fig. 2(b)), where the convolution layer and pooling (subsampling) layer may be connected several times before a final fully-connected layer is encountered. An image mapped by a convolution layer is called a feature map, which triggers attention of many computer scientists. Figure created by Aphex34 distributed under a CC BY-SA 4.0 license (from Wikimedia Commons).

**Fig. 5.**
(a)[Left] The workflow of the model built by Vallieres et al. [41] The best combinations of radiomic features were selected in the training set, where these radiomic features were then combined with selected clinical variables in the training set. Independent prediction analysis was later performed in the testing set for all classifiers fully constructed in the training set. (b)[Right] Risk assessment of tumor outcomes in [41]. (1) Probability of occurrence of events for each patient of the testing set. The output probability of occurrence of events of random forests allows for risk stratification. (2) Kaplan-Meier curves of the testing set using a risk stratification into two groups as defined by a random forest output probability threshold of 0.5. All curves show significant prognostic performance. (3) Kaplan-Meier curves of the testing set using a risk stratification into three groups as defined by random forest output probability thresholds of 1/3 and 2/3.

**Fig. 6.**
Lesion classification pipeline based on diagnostic images. Two types of features are extracted from a medical image: (a) CNN features with pretrained CNN and (b) handcrafted features with conventional CADx. High and low-level features extracted by pretrained CNN are evaluated in terms of their classification performance and preprocessing requirements. Furthermore, the classifier outputs from the pooled CNN features and the handcrafted features are fused in the evaluation of a combination of the two types of features. [permissions required!!]

**Fig. 7.**
One proposed framework for cancer metastases detection by Wang et al. [61] who won the first prize in Camelyon16 cancer detection competition [9]. The model was based on deep CNNs, GoogLeNet of 27 layers.

See this image and copyright information in PMC

Cited by

Large-Scale Integration of DICOM Metadata into HL7-FHIR for Medical Research.
Iancu A, Bauer J, May MS, Prokosch HU, Dörfler A, Uder M, Kapsner LA. Iancu A, et al. Methods Inf Med. 2024 Sep;63(3-04):77-84. doi: 10.1055/a-2521-4250. Epub 2025 Apr 15. Methods Inf Med. 2024. PMID: 40233823 Free PMC article.
Using Machine Learning Approaches on Dynamic Patient-Reported Outcomes to Cluster Cancer Treatment-Related Symptoms.
Asper N, Witschel HF, von Stockar L, Laurenzi E, Kolberg HC, Vetter M, Roth S, Kullak-Ublick G, Trojan A. Asper N, et al. Curr Oncol. 2025 Jun 6;32(6):334. doi: 10.3390/curroncol32060334. Curr Oncol. 2025. PMID: 40558277 Free PMC article.
Leveraging the Academic Artificial Intelligence Silecosystem to Advance the Community Oncology Enterprise.
McDonnell KJ. McDonnell KJ. J Clin Med. 2023 Jul 21;12(14):4830. doi: 10.3390/jcm12144830. J Clin Med. 2023. PMID: 37510945 Free PMC article.
Deep Neural Networks and Transfer Learning on a Multivariate Physiological Signal Dataset.
Bizzego A, Gabrieli G, Esposito G. Bizzego A, et al. Bioengineering (Basel). 2021 Mar 6;8(3):35. doi: 10.3390/bioengineering8030035. Bioengineering (Basel). 2021. PMID: 33800842 Free PMC article.
Oncology Informatics: Status Quo and Outlook.
Putora PM, Baudis M, Beadle BM, El Naqa I, Giordano FA, Nicolay NH. Putora PM, et al. Oncology. 2020;98(6):329-331. doi: 10.1159/000507586. Epub 2020 May 14. Oncology. 2020. PMID: 32408309 Free PMC article. Review.

See all "Cited by" articles

References

1. Mitchell TM et al., “Machine learning. wcb,” 1997.
1. Huang S-H and Pan Y-C, “Automated visual inspection in the semiconductor industry: A survey,” Computers in Industry, vol. 66, pp. 1 – 10, 2015. [Online]. Available: http://www.sciencedirect.com/science/article/pii/S0166361514001845
1. LLC G. (2018) Google cloud speech-to-text enables developers to convert audio to text by applying powerful neural network models in an easy to use api. [Online]. Available: https://cloud.google.com/speech-to-text/
1. Ciocca S. (2017) How does spotify know you so well? [Online]. Available: https://medium.com/s/story/spotifys-discover-weekly-how-machine-learning...
1. Kooti F, Grbovic M, Aiello LM, Djuric N, Radosavljevic V, and Lerman K, “Analyzing uber’s ride-sharing economy,” in Proceedings of the 26th International Conference on World Wide Web Companion International World Wide Web Conferences Steering Committee, 2017, pp. 574–582.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

[1] Mitchell TM et al., “Machine learning. wcb,” 1997.

[2] Mitchell TM et al., “Machine learning. wcb,” 1997.

[3] Huang S-H and Pan Y-C, “Automated visual inspection in the semiconductor industry: A survey,” Computers in Industry, vol. 66, pp. 1 – 10, 2015. [Online]. Available: http://www.sciencedirect.com/science/article/pii/S0166361514001845

[4] Huang S-H and Pan Y-C, “Automated visual inspection in the semiconductor industry: A survey,” Computers in Industry, vol. 66, pp. 1 – 10, 2015. [Online]. Available: http://www.sciencedirect.com/science/article/pii/S0166361514001845

[5] LLC G. (2018) Google cloud speech-to-text enables developers to convert audio to text by applying powerful neural network models in an easy to use api. [Online]. Available: https://cloud.google.com/speech-to-text/

[6] LLC G. (2018) Google cloud speech-to-text enables developers to convert audio to text by applying powerful neural network models in an easy to use api. [Online]. Available: https://cloud.google.com/speech-to-text/

[7] Ciocca S. (2017) How does spotify know you so well? [Online]. Available: https://medium.com/s/story/spotifys-discover-weekly-how-machine-learning...

[8] Ciocca S. (2017) How does spotify know you so well? [Online]. Available: https://medium.com/s/story/spotifys-discover-weekly-how-machine-learning...

[9] Kooti F, Grbovic M, Aiello LM, Djuric N, Radosavljevic V, and Lerman K, “Analyzing uber’s ride-sharing economy,” in Proceedings of the 26th International Conference on World Wide Web Companion International World Wide Web Conferences Steering Committee, 2017, pp. 574–582.

[10] Kooti F, Grbovic M, Aiello LM, Djuric N, Radosavljevic V, and Lerman K, “Analyzing uber’s ride-sharing economy,” in Proceedings of the 26th International Conference on World Wide Web Companion International World Wide Web Conferences Steering Committee, 2017, pp. 574–582.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine Learning and Imaging Informatics in Oncology

Affiliations

Machine Learning and Imaging Informatics in Oncology

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical