Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
- PMID: 26886976
- PMCID: PMC4890616
- DOI: 10.1109/TMI.2016.2528162
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
Abstract
Remarkable progress has been made in image recognition, primarily due to the availability of large-scale annotated datasets and deep convolutional neural networks (CNNs). CNNs enable learning data-driven, highly representative, hierarchical image features from sufficient training data. However, obtaining datasets as comprehensively annotated as ImageNet in the medical imaging domain remains a challenge. There are currently three major techniques that successfully employ CNNs to medical image classification: training the CNN from scratch, using off-the-shelf pre-trained CNN features, and conducting unsupervised CNN pre-training with supervised fine-tuning. Another effective method is transfer learning, i.e., fine-tuning CNN models pre-trained from natural image dataset to medical image tasks. In this paper, we exploit three important, but previously understudied factors of employing deep convolutional neural networks to computer-aided detection problems. We first explore and evaluate different CNN architectures. The studied models contain 5 thousand to 160 million parameters, and vary in numbers of layers. We then evaluate the influence of dataset scale and spatial image context on performance. Finally, we examine when and why transfer learning from pre-trained ImageNet (via fine-tuning) can be useful. We study two specific computer-aided detection (CADe) problems, namely thoraco-abdominal lymph node (LN) detection and interstitial lung disease (ILD) classification. We achieve the state-of-the-art performance on the mediastinal LN detection, and report the first five-fold cross-validation classification results on predicting axial CT slices with ILD categories. Our extensive empirical evaluation, CNN model analysis and valuable insights can be extended to the design of high performance CAD systems for other medical imaging tasks.
Figures













Similar articles
-
Seeking an optimal approach for Computer-aided Diagnosis of Pulmonary Embolism.Med Image Anal. 2024 Jan;91:102988. doi: 10.1016/j.media.2023.102988. Epub 2023 Oct 13. Med Image Anal. 2024. PMID: 37924750 Free PMC article.
-
MABAL: a Novel Deep-Learning Architecture for Machine-Assisted Bone Age Labeling.J Digit Imaging. 2018 Aug;31(4):513-519. doi: 10.1007/s10278-018-0053-3. J Digit Imaging. 2018. PMID: 29404850 Free PMC article.
-
A deep convolutional neural network architecture for interstitial lung disease pattern classification.Med Biol Eng Comput. 2020 Apr;58(4):725-737. doi: 10.1007/s11517-019-02111-w. Epub 2020 Jan 22. Med Biol Eng Comput. 2020. PMID: 31965407
-
Convolutional neural networks for computer-aided detection or diagnosis in medical image analysis: An overview.Math Biosci Eng. 2019 Jul 15;16(6):6536-6561. doi: 10.3934/mbe.2019326. Math Biosci Eng. 2019. PMID: 31698575 Review.
-
A review of convolutional neural network based methods for medical image classification.Comput Biol Med. 2025 Feb;185:109507. doi: 10.1016/j.compbiomed.2024.109507. Epub 2024 Dec 3. Comput Biol Med. 2025. PMID: 39631108 Review.
Cited by
-
Older Adult Fall Risk Prediction with Deep Learning and Timed Up and Go (TUG) Test Data.Bioengineering (Basel). 2024 Oct 5;11(10):1000. doi: 10.3390/bioengineering11101000. Bioengineering (Basel). 2024. PMID: 39451376 Free PMC article.
-
An artificial intelligence model for the simulation of visual effects in patients with visual field defects.Ann Transl Med. 2020 Jun;8(11):703. doi: 10.21037/atm.2020.02.162. Ann Transl Med. 2020. PMID: 32617323 Free PMC article.
-
Deep Transfer Learning Based Classification Model for COVID-19 Disease.Ing Rech Biomed. 2022 Apr;43(2):87-92. doi: 10.1016/j.irbm.2020.05.003. Epub 2020 May 20. Ing Rech Biomed. 2022. PMID: 32837678 Free PMC article.
-
Automated detection of dental restorations using deep learning on panoramic radiographs.Dentomaxillofac Radiol. 2022 Dec 1;51(8):20220244. doi: 10.1259/dmfr.20220244. Epub 2022 Sep 12. Dentomaxillofac Radiol. 2022. PMID: 36043433 Free PMC article.
-
An automated snoring sound classification method based on local dual octal pattern and iterative hybrid feature selector.Biomed Signal Process Control. 2021 Jan;63:102173. doi: 10.1016/j.bspc.2020.102173. Epub 2020 Sep 7. Biomed Signal Process Control. 2021. PMID: 32922509 Free PMC article.
References
-
- Deng J., et al. , “ImageNet: A large-scale hierarchical image database,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2009, pp. 248–255.
-
- Russakovsky O., et al. , “ImageNet large scale visual recognition challenge,” ArXiv:1409.0575, 2014.
-
- LeCun Y., Bottou L., Bengio Y., and Haffner P., “Gradient-based learning applied to document recognition,” Proc. IEEE, vol. 86, no. 11, pp. 2278–2324, Nov. 1998.
-
- Krizhevsky A., Sutskever I., and Hinton G. E., “ImageNet classification with deep convolutional neural networks,” Proc. NIPS, pp. 1097–1105, 2012.
-
- Krizhevsky A., Learning multiple layers of features from tiny images, M.S. thesisDept. Comp. Sci.Univ. Toronto, Toronto, Canada: 2009.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous