Targeted transfer learning to improve performance in small medical physics datasets

doi:10.1002/mp.14507

. 2020 Dec;47(12):6246-6256.

doi: 10.1002/mp.14507. Epub 2020 Oct 25.

Targeted transfer learning to improve performance in small medical physics datasets

Miguel Romero¹, Yannet Interian¹, Timothy Solberg², Gilmer Valdes²

Affiliations

¹ Master of Science in Data Science, University of San Francisco, San Francisco, CA, 94105, USA.
² Department of Radiation Oncology, University of California San Francisco, San Francisco, CA, 94158, USA.

PMID: 33007112
DOI: 10.1002/mp.14507

Targeted transfer learning to improve performance in small medical physics datasets

Miguel Romero et al. Med Phys. 2020 Dec.

. 2020 Dec;47(12):6246-6256.

doi: 10.1002/mp.14507. Epub 2020 Oct 25.

Authors

Miguel Romero¹, Yannet Interian¹, Timothy Solberg², Gilmer Valdes²

Affiliations

¹ Master of Science in Data Science, University of San Francisco, San Francisco, CA, 94105, USA.
² Department of Radiation Oncology, University of California San Francisco, San Francisco, CA, 94158, USA.

PMID: 33007112
DOI: 10.1002/mp.14507

Abstract

Purpose: To perform an in-depth evaluation of current state of the art techniques in training neural networks to identify appropriate approaches in small datasets.

Method: In total, 112,120 frontal-view X-ray images from the NIH ChestXray14 dataset were used in our analysis. Two tasks were studied: unbalanced multi-label classification of 14 diseases, and binary classification of pneumonia vs non-pneumonia. All datasets were randomly split into training, validation, and testing (70%, 10%, and 20%). Two popular convolution neural networks (CNNs), DensNet121 and ResNet50, were trained using PyTorch. We performed several experiments to test: (a) whether transfer learning using pretrained networks on ImageNet are of value to medical imaging/physics tasks (e.g., predicting toxicity from radiographic images after training on images from the internet), (b) whether using pretrained networks trained on problems that are similar to the target task helps transfer learning (e.g., using X-ray pretrained networks for X-ray target tasks), (c) whether freeze deep layers or change all weights provides an optimal transfer learning strategy, (d) the best strategy for the learning rate policy, and (e) what quantity of data is needed in order to appropriately deploy these various strategies (N = 50 to N = 77 880).

Results: In the multi-label problem, DensNet121 needed at least 1600 patients to be comparable to, and 10 000 to outperform, radiomics-based logistic regression. In classifying pneumonia vs non-pneumonia, both CNN and radiomics-based methods performed poorly when N < 2000. For small datasets ( < 2000), however, a significant boost in performance (>15% increase on AUC) comes from a good selection of the transfer learning dataset, dropout, cycling learning rate, and freezing and unfreezing of deep layers as training progresses. In contrast, if sufficient data are available (>35 000), little or no tweaking is needed to obtain impressive performance. While transfer learning using X-ray images from other anatomical sites improves performance, we also observed a similar boost by using pretrained networks from ImageNet. Having source images from the same anatomical site, however, outperforms every other methodology, by up to 15%. In this case, DL models can be trained with as little as N = 50.

Conclusions: While training DL models in small datasets (N < 2000) is challenging, no tweaking is necessary for bigger datasets (N > 35 000). Using transfer learning with images from the same anatomical site can yield remarkable performance in new tasks with as few as N = 50. Surprisingly, we did not find any advantage for using images from other anatomical sites over networks that have been trained using ImageNet. This indicates that features learned may not be as general as currently believed, and performance decays rapidly even by just changing the anatomical site of the images.

Keywords: deep learning; machine learning; small datasets.

PubMed Disclaimer

Cited by

Sensor-Location-Specific Joint Acquisition of Peripheral Artery Bioimpedance and Photoplethysmogram for Wearable Applications.
Metshein M, Abdullayev A, Gautier A, Larras B, Frappe A, Cardiff B, Annus P, Land R, Märtens O. Metshein M, et al. Sensors (Basel). 2023 Aug 11;23(16):7111. doi: 10.3390/s23167111. Sensors (Basel). 2023. PMID: 37631647 Free PMC article.
Omics-imaging signature-based nomogram to predict the progression-free survival of patients with hepatocellular carcinoma after transcatheter arterial chemoembolization.
Guan QL, Zhang HX, Gu JP, Cao GF, Ren WX. Guan QL, et al. World J Clin Cases. 2024 Jun 26;12(18):3340-3350. doi: 10.12998/wjcc.v12.i18.3340. World J Clin Cases. 2024. PMID: 38983440 Free PMC article.
Development of an AI-Assisted Embryo Selection System Using Iberian Ribbed Newts for Embryo-Fetal Development Toxicity Testing.
Saiki N, Adachi A, Ohnishi H, Koga A, Ueki M, Kohno K, Hayashi T, Ohbayashi T. Saiki N, et al. Yonago Acta Med. 2024 Aug 27;67(3):233-241. doi: 10.33160/yam.2024.08.011. eCollection 2024 Aug. Yonago Acta Med. 2024. PMID: 39193136 Free PMC article.
Segmentation and classification on chest radiography: a systematic survey.
Agrawal T, Choudhary P. Agrawal T, et al. Vis Comput. 2023;39(3):875-913. doi: 10.1007/s00371-021-02352-7. Epub 2022 Jan 8. Vis Comput. 2023. PMID: 35035008 Free PMC article.
Linear fine-tuning: a linear transformation based transfer strategy for deep MRI reconstruction.
Bi W, Xv J, Song M, Hao X, Gao D, Qi F. Bi W, et al. Front Neurosci. 2023 Jun 20;17:1202143. doi: 10.3389/fnins.2023.1202143. eCollection 2023. Front Neurosci. 2023. PMID: 37409107 Free PMC article.

See all "Cited by" articles

References

REFERENCES

1. Valdes G, Scheuermann R, Hung CY, et al. A mathematical framework for virtual IMRT QA using machine learning. Med Phys. 2016;43:4323-4334.
1. Valdes G, Chan MF, Lim SB, et al. IMRT QA using machine learning: a multi-institutional validation. J Appl Clin Med Phys. 2017;18:279-284.
1. Interian Y, Rideout V, Kearney VP, et al. Deep nets vs expert designed features in medical physics: An IMRT QA case study. Med Phys. 2018;45:2672-2680.
1. Valdes G, Morin O, Valenciaga Y, et al. Use of truebeam developer mode for imaging QA. J Appl Clin Med Phys. 2015;16:322-333.
1. Zhu X, Ge Y, Li T, et al. A planning quality evaluation tool for prostate adaptive IMRT based on machine learning. Med Phys. 2011;38:719-726.

MeSH terms

Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Ovid Technologies, Inc.
- Wiley

[1] Valdes G, Scheuermann R, Hung CY, et al. A mathematical framework for virtual IMRT QA using machine learning. Med Phys. 2016;43:4323-4334.

[2] Valdes G, Scheuermann R, Hung CY, et al. A mathematical framework for virtual IMRT QA using machine learning. Med Phys. 2016;43:4323-4334.

[3] Valdes G, Chan MF, Lim SB, et al. IMRT QA using machine learning: a multi-institutional validation. J Appl Clin Med Phys. 2017;18:279-284.

[4] Valdes G, Chan MF, Lim SB, et al. IMRT QA using machine learning: a multi-institutional validation. J Appl Clin Med Phys. 2017;18:279-284.

[5] Interian Y, Rideout V, Kearney VP, et al. Deep nets vs expert designed features in medical physics: An IMRT QA case study. Med Phys. 2018;45:2672-2680.

[6] Interian Y, Rideout V, Kearney VP, et al. Deep nets vs expert designed features in medical physics: An IMRT QA case study. Med Phys. 2018;45:2672-2680.

[7] Valdes G, Morin O, Valenciaga Y, et al. Use of truebeam developer mode for imaging QA. J Appl Clin Med Phys. 2015;16:322-333.

[8] Valdes G, Morin O, Valenciaga Y, et al. Use of truebeam developer mode for imaging QA. J Appl Clin Med Phys. 2015;16:322-333.

[9] Zhu X, Ge Y, Li T, et al. A planning quality evaluation tool for prostate adaptive IMRT based on machine learning. Med Phys. 2011;38:719-726.

[10] Zhu X, Ge Y, Li T, et al. A planning quality evaluation tool for prostate adaptive IMRT based on machine learning. Med Phys. 2011;38:719-726.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Targeted transfer learning to improve performance in small medical physics datasets

Affiliations

Targeted transfer learning to improve performance in small medical physics datasets

Authors

Affiliations

Abstract

Similar articles

Cited by

References

REFERENCES

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources