Noise injection for training artificial neural networks: a comparison with weight decay and early stopping
- PMID: 19928111
- PMCID: PMC2771718
- DOI: 10.1118/1.3213517
Noise injection for training artificial neural networks: a comparison with weight decay and early stopping
Abstract
The purpose of this study was to investigate the effect of a noise injection method on the "overfitting" problem of artificial neural networks (ANNs) in two-class classification tasks. The authors compared ANNs trained with noise injection to ANNs trained with two other methods for avoiding overfitting: weight decay and early stopping. They also evaluated an automatic algorithm for selecting the magnitude of the noise injection. They performed simulation studies of an exclusive-or classification task with training datasets of 50, 100, and 200 cases (half normal and half abnormal) and an independent testing dataset of 2000 cases. They also compared the methods using a breast ultrasound dataset of 1126 cases. For simulated training datasets of 50 cases, the area under the receiver operating characteristic curve (AUC) was greater (by 0.03) when training with noise injection than when training without any regularization, and the improvement was greater than those from weight decay and early stopping (both of 0.02). For training datasets of 100 cases, noise injection and weight decay yielded similar increases in the AUC (0.02), whereas early stopping produced a smaller increase (0.01). For training datasets of 200 cases, the increases in the AUC were negligibly small for all methods (0.005). For the ultrasound dataset, noise injection had a greater average AUC than ANNs trained without regularization and a slightly greater average AUC than ANNs trained with weight decay. These results indicate that training ANNs with noise injection can reduce overfitting to a greater degree than early stopping and to a similar degree as weight decay.
Figures




Similar articles
-
Feedforward backpropagation artificial neural networks for predicting mechanical responses in complex nonlinear structures: A study on a long bone.J Mech Behav Biomed Mater. 2022 Apr;128:105079. doi: 10.1016/j.jmbbm.2022.105079. Epub 2022 Jan 11. J Mech Behav Biomed Mater. 2022. PMID: 35114570
-
The superior fault tolerance of artificial neural network training with a fault/noise injection-based genetic algorithm.Protein Cell. 2016 Oct;7(10):735-748. doi: 10.1007/s13238-016-0302-5. Epub 2016 Aug 9. Protein Cell. 2016. PMID: 27502185 Free PMC article.
-
Improving quantitative structure-activity relationship models using Artificial Neural Networks trained with dropout.J Comput Aided Mol Des. 2016 Feb;30(2):177-89. doi: 10.1007/s10822-016-9895-2. Epub 2016 Feb 1. J Comput Aided Mol Des. 2016. PMID: 26830599 Free PMC article.
-
Applications of artificial neural networks in medical science.Curr Clin Pharmacol. 2007 Sep;2(3):217-26. doi: 10.2174/157488407781668811. Curr Clin Pharmacol. 2007. PMID: 18690868 Review.
-
A Systematic Literature Review of the Successors of "NeuroEvolution of Augmenting Topologies".Evol Comput. 2021 Spring;29(1):1-73. doi: 10.1162/evco_a_00282. Epub 2020 Nov 5. Evol Comput. 2021. PMID: 33151100
Cited by
-
DCE-Qnet: Deep Network Quantification of Dynamic Contrast Enhanced (DCE) MRI.ArXiv [Preprint]. 2024 May 20:arXiv:2405.12360v1. ArXiv. 2024. Update in: MAGMA. 2024 Dec;37(6):1077-1090. doi: 10.1007/s10334-024-01189-0. PMID: 38827459 Free PMC article. Updated. Preprint.
-
DCE-Qnet: deep network quantification of dynamic contrast enhanced (DCE) MRI.MAGMA. 2024 Dec;37(6):1077-1090. doi: 10.1007/s10334-024-01189-0. Epub 2024 Aug 8. MAGMA. 2024. PMID: 39112813 Free PMC article.
-
An end-to-end AI-based framework for automated discovery of rapid CEST/MT MRI acquisition protocols and molecular parameter quantification (AutoCEST).Magn Reson Med. 2022 Jun;87(6):2792-2810. doi: 10.1002/mrm.29173. Epub 2022 Jan 28. Magn Reson Med. 2022. PMID: 35092076 Free PMC article.
-
Machine and deep learning methods for radiomics.Med Phys. 2020 Jun;47(5):e185-e202. doi: 10.1002/mp.13678. Med Phys. 2020. PMID: 32418336 Free PMC article. Review.
-
Deep learning with robustness to missing data: A novel approach to the detection of COVID-19.PLoS One. 2021 Jul 30;16(7):e0255301. doi: 10.1371/journal.pone.0255301. eCollection 2021. PLoS One. 2021. PMID: 34329354 Free PMC article.
References
-
- Wu Y., Doi K., Metz C. E., Asada N., and Giger M. L., “Simulation studies of data classification by artificial neural networks: Potential applications in medical imaging and decision making,” J. Digit Imaging ZZZZZZ 6, 117–125 (1993). - PubMed
-
- Jiang Y. et al., “Malignant and benign clustered microcalcifications: Automated feature analysis and classification,” Radiology RADLAX 198, 671–678 (1996). - PubMed
-
- Bishop C. M., Neural Networks for Pattern Recognition (Oxford University Press, New York, 1995).
-
- Sietsma J. and Dow R. J. F., “Creating artificial neural networks that generalize,” Neural Networks NNETEB 4, 67–79 (1991).10.1016/0893-6080(91)90033-2 - DOI
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources