Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Aug:2018:1551-1560.
doi: 10.1145/3219819.3220104.

Generalized Score Functions for Causal Discovery

Affiliations

Generalized Score Functions for Causal Discovery

Biwei Huang et al. KDD. 2018 Aug.

Abstract

Discovery of causal relationships from observational data is a fundamental problem. Roughly speaking, there are two types of methods for causal discovery, constraint-based ones and score-based ones. Score-based methods avoid the multiple testing problem and enjoy certain advantages compared to constraint-based ones. However, most of them need strong assumptions on the functional forms of causal mechanisms, as well as on data distributions, which limit their applicability. In practice the precise information of the underlying model class is usually unknown. If the above assumptions are violated, both spurious and missing edges may result. In this paper, we introduce generalized score functions for causal discovery based on the characterization of general (conditional) independence relationships between random variables, without assuming particular model classes. In particular, we exploit regression in RKHS to capture the dependence in a non-parametric way. The resulting causal discovery approach produces asymptotically correct results in rather general cases, which may have nonlinear causal mechanisms, a wide class of data distributions, mixed continuous and discrete data, and multidimensional variables. Experimental results on both synthetic and real-world data demonstrate the efficacy of our proposed approach.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
(a) Scatter plot of the estimated noise Ê1 and Ê3; Ê1 and Ê3 are correlated. (b) Scatter plot of X1 and X2; they are uncorrelated.
Figure 2:
Figure 2:
The F1 score of recovered causal graphs. (a.1) Continuous data with n = 500. (a.2) Continuous data with n = 1000. (b.1) Multi-dimensional data with n = 500. (b.2) Multi-dimensional data with n = 1000. (c.1) Mixed continuous and discrete data with n = 500. (c.2) Mixed continuous and discrete data with n = 1000. The x-axis is the graph density. The y-axis is the F1 score; higher F1 score means higher accuracy.
Figure 3:
Figure 3:
The normalized SHD of recovered causal graphs. The y-axis is the normalized SHD score; the lower SHD score means better accuracy.
Figure 4:
Figure 4:
The F1 score of the recovered causal graphs on the two discrete networks. (a) CHILD network. (b) SACHS network.
Figure 5:
Figure 5:
Recovered causal graph from Archaeology data set. The solid lines are shared edges from the CV likelihood and the marginal likelihood. The dashed edges are recovered only by CV likelihood, and the dotted edges are recovered only by marginal likelihood.

References

    1. Aliferis CF, Statnikov AR, Tsamardinos I, Mani S, and Koutsoukos XD. Local causal and markov blanket induction for causal discovery and feature selection for classification part i: Algorithms and empirical evaluation. Journal of Machine Learning Research, 11:171–234, 2010.
    1. Bach FR and Jordan MI. Learning graphical models with mercer kernels Advances in Neural Information Processing Systems, pages 1009–1016, 2002.
    1. Bakken TE, Dale AM, and Schork NJ. A geographic cline of skull and brain morphology among individuals of european ancestry. Hum Hered, 72(1):35–44, 2011. - PMC - PubMed
    1. Bühlmann P, Peters J, and Ernest J. CAM: Causal additive models, high-dimensional order search and penalized regression. Annals of Statistics, 42(6):2526–2556, 2014.
    1. Buntine W. Theory refinment on bayesian networks Uncertainty in Artificial Intelligence, pages 52–60, 1991.

LinkOut - more resources