Model-based learning using a mixture of mixtures of Gaussian and uniform distributions
- PMID: 22383342
- DOI: 10.1109/TPAMI.2011.199
Model-based learning using a mixture of mixtures of Gaussian and uniform distributions
Abstract
We introduce a mixture model whereby each mixture component is itself a mixture of a multivariate Gaussian distribution and a multivariate uniform distribution. Although this model could be used for model-based clustering (model-based unsupervised learning) or model-based classification (model-based semi-supervised learning), we focus on the more general model-based classification framework. In this setting, we fit our mixture models to data where some of the observations have known group memberships and the goal is to predict the memberships of observations with unknown labels. We also present a density estimation example. A generalized expectation-maximization algorithm is used to estimate the parameters and thereby give classifications in this mixture of mixtures model. To simplify the model and the associated parameter estimation, we suggest holding some parameters fixed-this leads to the introduction of more parsimonious models. A simulation study is performed to illustrate how the model allows for bursts of probability and locally higher tails. Two further simulation studies illustrate how the model performs on data simulated from multivariate Gaussian distributions and on data from multivariate t-distributions. This novel approach is also applied to real data and the performance of our approach under the various restrictions is discussed.
Similar articles
-
Gaussian mixture density modeling, decomposition, and applications.IEEE Trans Image Process. 1996;5(9):1293-302. doi: 10.1109/83.535841. IEEE Trans Image Process. 1996. PMID: 18285218
-
Generalized competitive learning of gaussian mixture models.IEEE Trans Syst Man Cybern B Cybern. 2009 Aug;39(4):901-9. doi: 10.1109/TSMCB.2008.2012119. Epub 2009 Apr 7. IEEE Trans Syst Man Cybern B Cybern. 2009. PMID: 19362913
-
Bayesian pixel classification using spatially variant finite mixtures and the generalized EM algorithm.IEEE Trans Image Process. 1998;7(7):1014-28. doi: 10.1109/83.701161. IEEE Trans Image Process. 1998. PMID: 18276317
-
An introduction to finite mixture distributions.Stat Methods Med Res. 1996 Jun;5(2):107-27. doi: 10.1177/096228029600500202. Stat Methods Med Res. 1996. PMID: 8817794 Review.
-
An overview of heavy-tail extensions of multivariate Gaussian distribution and their relations.J Appl Stat. 2022 Mar 2;49(13):3477-3494. doi: 10.1080/02664763.2022.2044018. eCollection 2022. J Appl Stat. 2022. PMID: 36213771 Free PMC article. Review.
Cited by
-
Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data.Genomics Proteomics Bioinformatics. 2024 Jul 3;22(2):qzae014. doi: 10.1093/gpbjnl/qzae014. Genomics Proteomics Bioinformatics. 2024. PMID: 39049508 Free PMC article.
-
A Rough Set Bounded Spatially Constrained Asymmetric Gaussian Mixture Model for Image Segmentation.PLoS One. 2017 Jan 3;12(1):e0168449. doi: 10.1371/journal.pone.0168449. eCollection 2017. PLoS One. 2017. PMID: 28045950 Free PMC article.
-
Detection of pulmonary nodules in CT images based on fuzzy integrated active contour model and hybrid parametric mixture model.Comput Math Methods Med. 2013;2013:515386. doi: 10.1155/2013/515386. Epub 2013 Apr 16. Comput Math Methods Med. 2013. PMID: 23690876 Free PMC article.
-
DeepGene: an advanced cancer type classifier based on deep learning and somatic point mutations.BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):476. doi: 10.1186/s12859-016-1334-9. BMC Bioinformatics. 2016. PMID: 28155641 Free PMC article.
-
Identification of biomarkers associated with diagnosis of postmenopausal osteoporosis patients based on bioinformatics and machine learning.Front Genet. 2023 Jul 3;14:1198417. doi: 10.3389/fgene.2023.1198417. eCollection 2023. Front Genet. 2023. PMID: 37465165 Free PMC article.
Publication types
LinkOut - more resources
Full Text Sources