Targeted transfer learning to improve performance in small medical physics datasets
- PMID: 33007112
- DOI: 10.1002/mp.14507
Targeted transfer learning to improve performance in small medical physics datasets
Abstract
Purpose: To perform an in-depth evaluation of current state of the art techniques in training neural networks to identify appropriate approaches in small datasets.
Method: In total, 112,120 frontal-view X-ray images from the NIH ChestXray14 dataset were used in our analysis. Two tasks were studied: unbalanced multi-label classification of 14 diseases, and binary classification of pneumonia vs non-pneumonia. All datasets were randomly split into training, validation, and testing (70%, 10%, and 20%). Two popular convolution neural networks (CNNs), DensNet121 and ResNet50, were trained using PyTorch. We performed several experiments to test: (a) whether transfer learning using pretrained networks on ImageNet are of value to medical imaging/physics tasks (e.g., predicting toxicity from radiographic images after training on images from the internet), (b) whether using pretrained networks trained on problems that are similar to the target task helps transfer learning (e.g., using X-ray pretrained networks for X-ray target tasks), (c) whether freeze deep layers or change all weights provides an optimal transfer learning strategy, (d) the best strategy for the learning rate policy, and (e) what quantity of data is needed in order to appropriately deploy these various strategies (N = 50 to N = 77 880).
Results: In the multi-label problem, DensNet121 needed at least 1600 patients to be comparable to, and 10 000 to outperform, radiomics-based logistic regression. In classifying pneumonia vs non-pneumonia, both CNN and radiomics-based methods performed poorly when N < 2000. For small datasets ( < 2000), however, a significant boost in performance (>15% increase on AUC) comes from a good selection of the transfer learning dataset, dropout, cycling learning rate, and freezing and unfreezing of deep layers as training progresses. In contrast, if sufficient data are available (>35 000), little or no tweaking is needed to obtain impressive performance. While transfer learning using X-ray images from other anatomical sites improves performance, we also observed a similar boost by using pretrained networks from ImageNet. Having source images from the same anatomical site, however, outperforms every other methodology, by up to 15%. In this case, DL models can be trained with as little as N = 50.
Conclusions: While training DL models in small datasets (N < 2000) is challenging, no tweaking is necessary for bigger datasets (N > 35 000). Using transfer learning with images from the same anatomical site can yield remarkable performance in new tasks with as few as N = 50. Surprisingly, we did not find any advantage for using images from other anatomical sites over networks that have been trained using ImageNet. This indicates that features learned may not be as general as currently believed, and performance decays rapidly even by just changing the anatomical site of the images.
Keywords: deep learning; machine learning; small datasets.
© 2020 American Association of Physicists in Medicine.
Similar articles
-
Deep Feature Learning from a Hospital-Scale Chest X-ray Dataset with Application to TB Detection on a Small-Scale Dataset.Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:4076-4079. doi: 10.1109/EMBC.2019.8856729. Annu Int Conf IEEE Eng Med Biol Soc. 2019. PMID: 31946767
-
Exploring deep learning radiomics for classifying osteoporotic vertebral fractures in X-ray images.Front Endocrinol (Lausanne). 2024 Mar 28;15:1370838. doi: 10.3389/fendo.2024.1370838. eCollection 2024. Front Endocrinol (Lausanne). 2024. PMID: 38606087 Free PMC article.
-
TEM virus images: Benchmark dataset and deep learning classification.Comput Methods Programs Biomed. 2021 Sep;209:106318. doi: 10.1016/j.cmpb.2021.106318. Epub 2021 Jul 29. Comput Methods Programs Biomed. 2021. PMID: 34375851
-
Deep learning for colon cancer histopathological images analysis.Comput Biol Med. 2021 Sep;136:104730. doi: 10.1016/j.compbiomed.2021.104730. Epub 2021 Aug 4. Comput Biol Med. 2021. PMID: 34375901 Review.
-
Machine learning-based approaches for distinguishing viral and bacterial pneumonia in paediatrics: A scoping review.Comput Methods Programs Biomed. 2025 Aug;268:108802. doi: 10.1016/j.cmpb.2025.108802. Epub 2025 May 8. Comput Methods Programs Biomed. 2025. PMID: 40349546
Cited by
-
Sensor-Location-Specific Joint Acquisition of Peripheral Artery Bioimpedance and Photoplethysmogram for Wearable Applications.Sensors (Basel). 2023 Aug 11;23(16):7111. doi: 10.3390/s23167111. Sensors (Basel). 2023. PMID: 37631647 Free PMC article.
-
Omics-imaging signature-based nomogram to predict the progression-free survival of patients with hepatocellular carcinoma after transcatheter arterial chemoembolization.World J Clin Cases. 2024 Jun 26;12(18):3340-3350. doi: 10.12998/wjcc.v12.i18.3340. World J Clin Cases. 2024. PMID: 38983440 Free PMC article.
-
Development of an AI-Assisted Embryo Selection System Using Iberian Ribbed Newts for Embryo-Fetal Development Toxicity Testing.Yonago Acta Med. 2024 Aug 27;67(3):233-241. doi: 10.33160/yam.2024.08.011. eCollection 2024 Aug. Yonago Acta Med. 2024. PMID: 39193136 Free PMC article.
-
Segmentation and classification on chest radiography: a systematic survey.Vis Comput. 2023;39(3):875-913. doi: 10.1007/s00371-021-02352-7. Epub 2022 Jan 8. Vis Comput. 2023. PMID: 35035008 Free PMC article.
-
Linear fine-tuning: a linear transformation based transfer strategy for deep MRI reconstruction.Front Neurosci. 2023 Jun 20;17:1202143. doi: 10.3389/fnins.2023.1202143. eCollection 2023. Front Neurosci. 2023. PMID: 37409107 Free PMC article.
References
REFERENCES
-
- Valdes G, Scheuermann R, Hung CY, et al. A mathematical framework for virtual IMRT QA using machine learning. Med Phys. 2016;43:4323-4334.
-
- Valdes G, Chan MF, Lim SB, et al. IMRT QA using machine learning: a multi-institutional validation. J Appl Clin Med Phys. 2017;18:279-284.
-
- Interian Y, Rideout V, Kearney VP, et al. Deep nets vs expert designed features in medical physics: An IMRT QA case study. Med Phys. 2018;45:2672-2680.
-
- Valdes G, Morin O, Valenciaga Y, et al. Use of truebeam developer mode for imaging QA. J Appl Clin Med Phys. 2015;16:322-333.
-
- Zhu X, Ge Y, Li T, et al. A planning quality evaluation tool for prostate adaptive IMRT based on machine learning. Med Phys. 2011;38:719-726.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources