Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Dec;75(4):1288-1298.
doi: 10.1111/biom.13064. Epub 2019 May 6.

Fast Bayesian inference in large Gaussian graphical models

Affiliations

Fast Bayesian inference in large Gaussian graphical models

Gwenaël G R Leday et al. Biometrics. 2019 Dec.

Abstract

Despite major methodological developments, Bayesian inference in Gaussian graphical models remains challenging in high dimension due to the tremendous size of the model space. This article proposes a method to infer the marginal and conditional independence structures between variables by multiple testing, which bypasses the exploration of the model space. Specifically, we introduce closed-form Bayes factors under the Gaussian conjugate model to evaluate the null hypotheses of marginal and conditional independence between variables. Their computation for all pairs of variables is shown to be extremely efficient, thereby allowing us to address large problems with thousands of nodes as required by modern applications. Moreover, we derive exact tail probabilities from the null distributions of the Bayes factors. These allow the use of any multiplicity correction procedure to control error rates for incorrect edge inclusion. We demonstrate the proposed approach on various simulated examples as well as on a large gene expression data set from The Cancer Genome Atlas.

Keywords: Bayes factor; Gaussian graphical model; correlation; high-dimensional data; inverse-Wishart distribution.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Running time in seconds (assessed on 3.40 GHz Intel Core i7‐3770 CPU) for each method when p=1000
Figure 2
Figure 2
A, Log‐marginal likelihood of the GC model as a function of α=(δp1)(δ+np1). The vertical and horizontal dotted lines indicates the location of the optimum. B, Degree distribution of the conditional independence graph. GC, Gaussian conjugate
Figure 3
Figure 3
Example of a densely connected gene subgraph identified by the clustering algorithm of Blondel et al. (2008)

References

    1. Baladandayuthapani, V. , Talluri, R. , Ji, Y. , Coombes, K.R. , Lu, Y. , Hennessy, B.T. , Davies, M.A. and Mallick, B.K. (2014). Bayesian sparse graphical models for classification with application to protein expression data. The Annals of Applied Statistics, 8, 1443–1468. - PMC - PubMed
    1. Bhatlekar, S. , Fields, J.Z. and Boman, B.M. (2014). Hox genes and their role in the development of human cancers. Journal of Molecular Medicine, 92, 811–823. - PubMed
    1. Blondel, V.D. , Guillaume, J.‐L. , Lambiotte, R. and Lefebvre, E. (2008). Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, 2008, P10008.
    1. Carvalho, C.M. and Scott, J.G. (2009). Objective Bayesian model selection in Gaussian graphical models. Biometrika, 96, 497–512.
    1. Chen, C.F. (1979). Bayesian inference for a normal dispersion matrix and its application to stochastic multiple regression analysis. Journal of the Royal Statistical Society: Series B, 41, 235–248.

Publication types