Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Sep;59(5):932-947.
doi: 10.1002/bimj.201600090. Epub 2017 Apr 10.

An empirical Bayes approach to network recovery using external knowledge

Affiliations

An empirical Bayes approach to network recovery using external knowledge

Gino B Kpogbezan et al. Biom J. 2017 Sep.

Abstract

Reconstruction of a high-dimensional network may benefit substantially from the inclusion of prior knowledge on the network topology. In the case of gene interaction networks such knowledge may come for instance from pathway repositories like KEGG, or be inferred from data of a pilot study. The Bayesian framework provides a natural means of including such prior knowledge. Based on a Bayesian Simultaneous Equation Model, we develop an appealing Empirical Bayes (EB) procedure that automatically assesses the agreement of the used prior knowledge with the data at hand. We use variational Bayes method for posterior densities approximation and compare its accuracy with that of Gibbs sampling strategy. Our method is computationally fast, and can outperform known competitors. In a simulation study, we show that accurate prior data can greatly improve the reconstruction of the network, but need not harm the reconstruction if wrong. We demonstrate the benefits of the method in an analysis of gene expression data from GEO. In particular, the edges of the recovered network have superior reproducibility (compared to that of competitors) over resampled versions of the data.

Keywords: Empirical Bayes; High-dimensional Bayesian inference; Prior information; Undirected network; Variational approximation.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest

The authors have declared no conflict of interest.

Figures

Figure 1
Figure 1
ROC curves for BSEM (dashed) and BSEMed using perfect prior information (blue), BSEMed using 75% true edges present in the prior (brown), BSEMed using 50% true edges present in the prior (black) and BSEMed using 0% true edges present in the prior (red). Here, p = 100 and n ∈ {50, 200}.
Figure 2
Figure 2
Visualization of BSEMed ’κ¯i,r ’ using perfect prior (b), BSEMed ’κ¯i,r ’ using 50% true edges information (c), BSEM ’κ¯i,r(d) and the true graph (a) in case n = 50 and p = 100.
Figure 3
Figure 3
BSEM vs BSEMed network estimates in lung cancer. Red edges are the overlap edges.
Figure 4
Figure 4
BSEM vs BSEMed network estimates in pancreas cancer. Red edges are the overlap edges.
Figure 5
Figure 5
Venn diagrams displaying the mean overlap of reproduced top-ranking edges, corresponding to the second row of Table 3 (Figure 5.a) and Table 4 (Figure 5.b).
Figure 6
Figure 6
Network in a normal cell vs BSEMed network in lung cancer. Red edges are the overlap edges between prior and posterior networks.
Figure 7
Figure 7
Network in a normal cell vs BSEMed network in pancreas cancer. Red edges are the overlap edges between prior and posterior networks.

References

    1. Allen GI, Liu Z. A Local Poisson Graphical Model for Inferring Networks From Sequencing Data. NanoBioscience, IEEE Transactions on. 2013;12:189–198. - PubMed
    1. Badea L, Herlea V, Dima SO, Dumitrascu T, Popescu I. Combined gene expression analysis of whole-tissue and microdissected pancreatic ductal adenocarcinoma identifies genes specifically overexpressed in tumor epithelia. Hepatogastroenterology. 2008;55:2016–2027. - PubMed
    1. Carvalho CM, Polson NG, Scott JG. The horseshoe estimator for sparse signals. Biometrika. 2010;97:465–480.
    1. Chen S, Witten DM, Shojaie A. Selection and estimation for mixed graphical models. Biometrika. 2015;102:47–64. - PMC - PubMed
    1. Friedman J, Hastie T, Tibshirani R. Sparse inverse covariance estimation with the graphical lasso. Biostatistics. 2008;9:432–441. - PMC - PubMed

LinkOut - more resources