Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024;33(4):1139-1149.
doi: 10.1080/10618600.2024.2304633. Epub 2024 Feb 9.

Functional Mixed Membership Models

Affiliations

Functional Mixed Membership Models

Nicholas Marco et al. J Comput Graph Stat. 2024.

Abstract

Mixed membership models, or partial membership models, are a flexible unsupervised learning method that allows each observation to belong to multiple clusters. In this paper, we propose a Bayesian mixed membership model for functional data. By using the multivariate Karhunen-Loève theorem, we are able to derive a scalable representation of Gaussian processes that maintains data-driven learning of the covariance structure. Within this framework, we establish conditional posterior consistency given a known feature allocation matrix. Compared to previous work on mixed membership models, our proposal allows for increased modeling flexibility, with the benefit of a directly interpretable mean and covariance structure. Our work is motivated by studies in functional brain imaging through electroencephalography (EEG) of children with autism spectrum disorder (ASD). In this context, our work formalizes the clinical notion of "spectrum" in terms of feature membership proportions. Supplementary materials, including proofs, are available online. The R package BayesFMMM is available to fit functional mixed membership models.

Keywords: Bayesian Methods; EEG; Functional Data; Mixed Membership Models; Neuroimaging.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Generative model illustration. (Left panel) Data generated under the functional clustering framework. (Right panel) Data generated under a mixed membership framework.
Fig. 2
Fig. 2
R-MISE values for the latent feature means and cross-covariances, as well as RMSE values for the allocation parameters, evaluated as sample size increases.
Fig. 3
Fig. 3
AIC, BIC, DIC, and the average log-likelihood evaluated for each of the 10 simulated data sets.
Fig. 4
Fig. 4
(Left Panel) Recovered means of model-based functional clustering with 4 clusters. (Right Panel) Alpha frequency patterns for a sample of EEG recordings from the T8 electrode of children (TD and ASD), styled by estimated cluster membership.
Fig. 5
Fig. 5
(Top Panel) Posterior median and 95% credible intervals (pointwise CI in dark gray and simultaneous CI in light gray) of the mean function for each functional feature. (Bottom Panel) Posterior median of each individual’s (feature-1)-membership, stratified by clinical cohort (cohort average (feature-1)-membership denoted by triangles).

References

    1. Azar Y, Fiat A, Karlin A, McSherry F and Saia J (2001), Spectral analysis of data, in ‘Proceedings of the thirty-third annual ACM symposium on Theory of computing’, pp. 619–626.
    1. Bhattacharya A and Dunson DB (2011), ‘Sparse bayesian infinite factor models’, Biometrika pp. 291–306. - PMC - PubMed
    1. Blei D, Ng A and Jordan M (2003), ‘Latent Dirichlet allocation’, JMLR.
    1. Broderick T, Pitman J and Jordan M (2013), ‘Feature allocations, probability functions, and paintboxes’, Bayesian Analysis 8(4), 801–836.
    1. Chen Y, He S, Yang Y and Liang F (2022), ‘Learning topic models: Identifiability and finite-sample analysis’, Journal of the American Statistical Association pp. 1–16. - PubMed

LinkOut - more resources