Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jul 5;113(27):7361-8.
doi: 10.1073/pnas.1510493113.

Methods for causal inference from gene perturbation experiments and validation

Affiliations

Methods for causal inference from gene perturbation experiments and validation

Nicolai Meinshausen et al. Proc Natl Acad Sci U S A. .

Abstract

Inferring causal effects from observational and interventional data is a highly desirable but ambitious goal. Many of the computational and statistical methods are plagued by fundamental identifiability issues, instability, and unreliable performance, especially for large-scale systems with many measured variables. We present software and provide some validation of a recently developed methodology based on an invariance principle, called invariant causal prediction (ICP). The ICP method quantifies confidence probabilities for inferring causal structures and thus leads to more reliable and confirmatory statements for causal relations and predictions of external intervention effects. We validate the ICP method and some other procedures using large-scale genome-wide gene perturbation experiments in Saccharomyces cerevisiae The results suggest that prediction and prioritization of future experimental interventions, such as gene deletions, can be improved by using our statistical inference techniques.

Keywords: genome database validation; graphical models; interventional–observational data; invariant causal prediction.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
The activities of two pairs of genes. Observational data are shown as red crosses and interventional data as blue circles. A blue arrow marks the experiment where an intervention on gene YOL067C occurs (activity shown on the x axis in the left panel) and analogously in the right panel. The intervention on the Left is deemed not “strongly successful” [not fulfilling (i) in the definition of an SIE] as the activity of gene YPR089W (y axis) under the intervention is well within its usual range. The intervention on the Right is called strongly successful as the activity of YMR103C (y axis) under the intervention is outside of the previously seen range and likewise for the gene YMR104C on which the intervention occurs (strongly successful area is marked by the green box).
Fig. 2.
Fig. 2.
The average number of successful interventions (y axis) against the number of selected edges (x axis) for various methods.
Fig. 3.
Fig. 3.
The graph of estimated causal relations between the biochemical agents in the ref. data. Blue edges are found by an invariant prediction approach, whereas red edges are found if allowing hidden variables and feedback with invariant prediction. Purple edges are found with invariant prediction whether allowing for hidden variables or not. The solid edges (including the gray edges) are all relations that have been reported in either the consensus network according to ref. , or the newly reported edges in ref. , , or . See also SI Appendix, Table S1.

References

    1. Stekhoven DJ, et al. Causal stability ranking. Bioinformatics. 2012;28(21):2819–2823. - PubMed
    1. Pearl J. Causality: Models, Reasoning and Inference. Cambridge Univ Press; New York: 2000.
    1. Spirtes P, Glymour C, Scheines R. Causation, Prediction, and Search. 2nd Ed MIT Press; Cambridge, MA: 2000.
    1. Bollen KA. Structural Equations with Latent Variables. Wiley; New York: 1989.
    1. Robins J. A new approach to causal inference in mortality studies with a sustained exposure periodapplication to control of the healthy worker survivor effect. Math Model. 1986;7(9):1393–1512.

Publication types