Singularities affect dynamics of learning in neuromanifolds
- PMID: 16595057
- DOI: 10.1162/089976606776241002
Singularities affect dynamics of learning in neuromanifolds
Abstract
The parameter spaces of hierarchical systems such as multilayer perceptrons include singularities due to the symmetry and degeneration of hidden units. A parameter space forms a geometrical manifold, called the neuromanifold in the case of neural networks. Such a model is identified with a statistical model, and a Riemannian metric is given by the Fisher information matrix. However, the matrix degenerates at singularities. Such a singular structure is ubiquitous not only in multilayer perceptrons but also in the gaussian mixture probability densities, ARMA time-series model, and many other cases. The standard statistical paradigm of the Cramér-Rao theorem does not hold, and the singularity gives rise to strange behaviors in parameter estimation, hypothesis testing, Bayesian inference, model selection, and in particular, the dynamics of learning from examples. Prevailing theories so far have not paid much attention to the problem caused by singularity, relying only on ordinary statistical theories developed for regular (nonsingular) models. Only recently have researchers remarked on the effects of singularity, and theories are now being developed. This article gives an overview of the phenomena caused by the singularities of statistical manifolds related to multilayer perceptrons and gaussian mixtures. We demonstrate our recent results on these problems. Simple toy models are also used to show explicit solutions. We explain that the maximum likelihood estimator is no longer subject to the gaussian distribution even asymptotically, because the Fisher information matrix degenerates, that the model selection criteria such as AIC, BIC, and MDL fail to hold in these models, that a smooth Bayesian prior becomes singular in such models, and that the trajectories of dynamics of learning are strongly affected by the singularity, causing plateaus or slow manifolds in the parameter space. The natural gradient method is shown to perform well because it takes the singular geometrical structure into account. The generalization error and the training error are studied in some examples.
Similar articles
-
Dynamics of learning near singularities in layered networks.Neural Comput. 2008 Mar;20(3):813-43. doi: 10.1162/neco.2007.12-06-414. Neural Comput. 2008. PMID: 18045020
-
Singularities in mixture models and upper bounds of stochastic complexity.Neural Netw. 2003 Sep;16(7):1029-38. doi: 10.1016/S0893-6080(03)00005-4. Neural Netw. 2003. PMID: 14692637
-
Dynamics of learning in multilayer perceptrons near singularities.IEEE Trans Neural Netw. 2008 Aug;19(8):1313-28. doi: 10.1109/TNN.2008.2000391. IEEE Trans Neural Netw. 2008. PMID: 18701364
-
Statistical decision theory to relate neurons to behavior in the study of covert visual attention.Vision Res. 2009 Jun;49(10):1097-128. doi: 10.1016/j.visres.2008.12.008. Epub 2009 Jan 10. Vision Res. 2009. PMID: 19138699 Review.
-
A free energy principle for the brain.J Physiol Paris. 2006 Jul-Sep;100(1-3):70-87. doi: 10.1016/j.jphysparis.2006.10.001. Epub 2006 Nov 13. J Physiol Paris. 2006. PMID: 17097864 Review.
Cited by
-
Redundancy in synaptic connections enables neurons to learn optimally.Proc Natl Acad Sci U S A. 2018 Jul 17;115(29):E6871-E6879. doi: 10.1073/pnas.1803274115. Epub 2018 Jul 2. Proc Natl Acad Sci U S A. 2018. PMID: 29967182 Free PMC article.
-
Sensorimotor transformation via sparse coding.Sci Rep. 2015 Apr 29;5:9648. doi: 10.1038/srep09648. Sci Rep. 2015. PMID: 25923980 Free PMC article.
-
Adaptive stimulus optimization for sensory systems neuroscience.Front Neural Circuits. 2013 Jun 6;7:101. doi: 10.3389/fncir.2013.00101. eCollection 2013. Front Neural Circuits. 2013. PMID: 23761737 Free PMC article. Review.
-
Neural Field Continuum Limits and the Structure-Function Partitioning of Cognitive-Emotional Brain Networks.Biology (Basel). 2023 Feb 23;12(3):352. doi: 10.3390/biology12030352. Biology (Basel). 2023. PMID: 36979044 Free PMC article. Review.
LinkOut - more resources
Full Text Sources