Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jul;56(7):2183-2205.
doi: 10.1016/j.csda.2012.01.010. Epub 2012 Jan 24.

Bayesian Method for Causal Discovery of Latent-Variable Models from a Mixture of Experimental and Observational Data

Affiliations

Bayesian Method for Causal Discovery of Latent-Variable Models from a Mixture of Experimental and Observational Data

Changwon Yoo. Comput Stat Data Anal. 2012 Jul.

Abstract

This paper describes a Bayesian method for learning causal Bayesian networks through networks that contain latent variables from an arbitrary mixture of observational and experimental data. The paper presents Bayesian methods (including a new method) for learning the causal structure and parameters of the underlying causal process that is generating the data, given that the data contain a mixture of observational and experimental cases. These learning methods were applied using as input various mixtures of experimental and observational data that were generated from the ALARM causal Bayesian network. The paper reports how these structure predictions and parameter estimates compare with the true causal structures and parameters as given by the ALARM network. The paper shows that (1) the new method for learning Bayesian network structure from a mixture of data that this paper introduce, Gibbs Volume method, best estimates the probability of the data given the latent variable model and (2) using large data (>10,000 cases), another model, the implicit latent variable method, is asymptotically correct and efficient.

Keywords: Bayesian Method; Bayesian Network; Causal Discovery; Knowledge Discovery from Databases; Latent Variable Modeling.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A causal Bayesian network that represents a hypothetical gene-regulation pathway.
Figure 2
Figure 2
A hypothetical causal Bayesian network structure with manipulation
Figure 3
Figure 3
Twelve hypotheses and six equivalence classes
Figure 4
Figure 4
Scoring E6.
Figure 5
Figure 5
Gibbs Sampling score plot sort by P(Di | S, K) score
Figure 6
Figure 6
Pseudo-code of the Gibbs Volume method given dataset D and number of iterations N.
Figure 7
Figure 7
Average score of dimensionality of eight node pairs vs. number of cases. Obsv# stands for number of observational cases in dataset D(X,Y). Exp#/2 stands for number of experimental cases used each in dataset D(mX,Y) and D(X,mY).

References

    1. Cooper GF. An overview of the representation and discovery of causal relationships using Bayesian networks. In: Glymour C, Cooper GF, editors. Computation, Causation, and Discovery. Cambridge, MA: MIT Press; 1999.
    1. Spirtes P, Glymour C, Scheines R. Causation, prediction, and search. 2 ed. Cambridge, MA: MIT Press; 2000.
    1. Heckerman D, Geiger D, Chickering D. Learning Bayesian networks: The combination of knowledge and statistical data. Machine Learning. 1995;20:197–243.
    1. Heckerman D. Proceedings of the Conference on Uncertainty in Artificial Intelligence. San Francisco, CA: Morgan Kaufmann; 1995. A Bayesian approach to learning causal networks.
    1. Wille A, et al. Sparse graphical Gaussian modeling of the isoprenoid gene network in Arabidopsis thaliana. Genome Biology. 2004;5(11) R92.1-13. - PMC - PubMed

LinkOut - more resources