Dynamics of learning near singularities in layered networks
- PMID: 18045020
- DOI: 10.1162/neco.2007.12-06-414
Dynamics of learning near singularities in layered networks
Abstract
We explicitly analyze the trajectories of learning near singularities in hierarchical networks, such as multilayer perceptrons and radial basis function networks, which include permutation symmetry of hidden nodes, and show their general properties. Such symmetry induces singularities in their parameter space, where the Fisher information matrix degenerates and odd learning behaviors, especially the existence of plateaus in gradient descent learning, arise due to the geometric structure of singularity. We plot dynamic vector fields to demonstrate the universal trajectories of learning near singularities. The singularity induces two types of plateaus, the on-singularity plateau and the near-singularity plateau, depending on the stability of the singularity and the initial parameters of learning. The results presented in this letter are universally applicable to a wide class of hierarchical models. Detailed stability analysis of the dynamics of learning in radial basis function networks and multilayer perceptrons will be presented in separate work.
Similar articles
-
Dynamics of learning near singularities in radial basis function networks.Neural Netw. 2008 Sep;21(7):989-1005. doi: 10.1016/j.neunet.2008.06.017. Epub 2008 Jul 1. Neural Netw. 2008. PMID: 18693082
-
Singularities affect dynamics of learning in neuromanifolds.Neural Comput. 2006 May;18(5):1007-65. doi: 10.1162/089976606776241002. Neural Comput. 2006. PMID: 16595057
-
Dynamics of learning in multilayer perceptrons near singularities.IEEE Trans Neural Netw. 2008 Aug;19(8):1313-28. doi: 10.1109/TNN.2008.2000391. IEEE Trans Neural Netw. 2008. PMID: 18701364
-
Nonlinear complex-valued extensions of Hebbian learning: an essay.Neural Comput. 2005 Apr;17(4):779-838. doi: 10.1162/0899766053429381. Neural Comput. 2005. PMID: 15829090 Review.
-
Radial basis function neural networks for nonlinear Fisher discrimination and Neyman-Pearson classification.Neural Netw. 2003 Jun-Jul;16(5-6):529-35. doi: 10.1016/S0893-6080(03)00086-8. Neural Netw. 2003. PMID: 12850004 Review.
Cited by
-
High-dimensional dynamics of generalization error in neural networks.Neural Netw. 2020 Dec;132:428-446. doi: 10.1016/j.neunet.2020.08.022. Epub 2020 Sep 5. Neural Netw. 2020. PMID: 33022471 Free PMC article.
-
Adaptive stimulus optimization for sensory systems neuroscience.Front Neural Circuits. 2013 Jun 6;7:101. doi: 10.3389/fncir.2013.00101. eCollection 2013. Front Neural Circuits. 2013. PMID: 23761737 Free PMC article. Review.
-
A dynamical systems perspective on the relationship between symbolic and non-symbolic computation.Cogn Neurodyn. 2009 Dec;3(4):415-27. doi: 10.1007/s11571-009-9099-8. Epub 2009 Nov 7. Cogn Neurodyn. 2009. PMID: 19898957 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources