Parameter convergence and learning curves for neural networks

doi:10.1162/089976699300016647

Review

. 1999 Apr 1;11(3):747-70.

doi: 10.1162/089976699300016647.

Parameter convergence and learning curves for neural networks

T L Fine¹, S Mukherjee

Affiliations

PMID: 10085428
DOI: 10.1162/089976699300016647

Review

Parameter convergence and learning curves for neural networks

T L Fine et al. Neural Comput. 1999.

. 1999 Apr 1;11(3):747-70.

doi: 10.1162/089976699300016647.

Authors

T L Fine¹, S Mukherjee

Affiliation

¹ School of Electrical Engineering, ETC 388, Cornell University, Ithaca, NY 14850, USA. tlfine@ee.cornell.edu

PMID: 10085428
DOI: 10.1162/089976699300016647

Abstract

We revisit the oft-studied asymptotic (in sample size) behavior of the parameter or weight estimate returned by any member of a large family of neural network training algorithms. By properly accounting for the characteristic property of neural networks that their empirical and generalization errors possess multiple minima, we rigorously establish conditions under which the parameter estimate converges strongly into the set of minima of the generalization error. Convergence of the parameter estimate to a particular value cannot be guaranteed under our assumptions. We then evaluate the asymptotic distribution of the distance between the parameter estimate and its nearest neighbor among the set of minima of the generalization error. Results on this question have appeared numerous times and generally assert asymptotic normality, the conclusion expected from familiar statistical arguments concerned with maximum likelihood estimators. These conclusions are usually reached on the basis of somewhat informal calculations, although we shall see that the situation is somewhat delicate. The preceding results then provide a derivation of learning curves for generalization and empirical errors that leads to bounds on rates of convergence.

PubMed Disclaimer

Cited by

Deterministic convergence of chaos injection-based gradient method for training feedforward neural networks.
Zhang H, Zhang Y, Xu D, Liu X. Zhang H, et al. Cogn Neurodyn. 2015 Jun;9(3):331-40. doi: 10.1007/s11571-014-9323-z. Epub 2015 Jan 1. Cogn Neurodyn. 2015. PMID: 25972981 Free PMC article.
Analysis of the adsorption and retention models for Cd, Cr, Cu, Ni, Pb, and Zn through neural networks: selection of variables and competitive model.
González-Costa JJ, Reigosa-Roger MJ, Matías JM, Fernández-Covelo E. González-Costa JJ, et al. Environ Sci Pollut Res Int. 2018 Sep;25(25):25551-25564. doi: 10.1007/s11356-018-2101-4. Epub 2018 Jun 29. Environ Sci Pollut Res Int. 2018. PMID: 29959735
Generalization of learning by synchronous waves: from perceptual organization to invariant organization.
Alexander DM, Trengove C, Sheridan PE, van Leeuwen C. Alexander DM, et al. Cogn Neurodyn. 2011 Jun;5(2):113-32. doi: 10.1007/s11571-010-9142-9. Epub 2010 Dec 10. Cogn Neurodyn. 2011. PMID: 22654985 Free PMC article.

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Silverchair Information Systems

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Parameter convergence and learning curves for neural networks

Affiliation

Parameter convergence and learning curves for neural networks

Authors

Affiliation

Abstract

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources