Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2014 May;25(3):418-26.
doi: 10.1097/EDE.0000000000000078.

Causal models and learning from data: integrating causal modeling and statistical estimation

Affiliations
Review

Causal models and learning from data: integrating causal modeling and statistical estimation

Maya L Petersen et al. Epidemiology. 2014 May.

Abstract

The practice of epidemiology requires asking causal questions. Formal frameworks for causal inference developed over the past decades have the potential to improve the rigor of this process. However, the appropriate role for formal causal thinking in applied epidemiology remains a matter of debate. We argue that a formal causal framework can help in designing a statistical analysis that comes as close as possible to answering the motivating causal question, while making clear what assumptions are required to endow the resulting estimates with a causal interpretation. A systematic approach for the integration of causal modeling with statistical estimation is presented. We highlight some common points of confusion that occur when causal modeling techniques are applied in practice and provide a broad overview on the types of questions that a causal framework can help to address. Our aims are to argue for the utility of formal causal thinking, to clarify what causal models can and cannot do, and to provide an accessible introduction to the flexible and powerful tools provided by causal models.

PubMed Disclaimer

Conflict of interest statement

The authors report no conflicts of interest.

Figures

FIGURE 1
FIGURE 1
A general roadmap for approaching causal questions.
FIGURE 2
FIGURE 2
Causal graphs and corresponding structural equations representing alternative data generating processes. (A), Assumes that W may have affected A, both A and W may have affected Y, and W, A, and Y may all share unmeasured common causes. (B), Assumes that subjects were randomly assigned to an exposure arm R with probability 0.5 and that assigned exposure arm R does not affect the outcome Y other than via uptake of the exposure A. (C), Figure 2A after a hypothetical intervention in which all subjects in the target population are assigned exposure level 0 (however this is defined). (D) and (E), Figure 2A augmented with additional assumptions to ensure identifiability of the distribution of Ya (and thus target quantities such as the average treatment effect) given observed data (W,A,Y). (F), Assumes that Z does not affect A or Y, that A shares no common causes with W or Y, and that W shares no common causes with Y or Z. Figures 2A, 2D, and 2E have no testable implications with W,A,Y observed, and thus imply non-parametric statistical models for the distribution of (W,A,Y). In contrast, Figure 2B implies that R is independent of W, and thus implies a semi-parametric statistical model for the distribution of (W,R,A,Y).
FIGURE 3
FIGURE 3
Translating a scientific question into a target causal quantity.
FIGURE 4
FIGURE 4
Interpreting the results of a data analysis: a hierarchy. The use of a statistical model known to contain the true distribution of the observed data and of an estimator that minimizes bias and provides a valid measure of statistical uncertainty helps to ensure that analyses maintain a valid statistical interpretation. Under additional assumptions, this interpretation can be augmented.

Similar articles

Cited by

References

    1. Greenland S, Pearl J, Robins JM. Causal diagrams for epidemiologic research. Epidemiology. 1999;10:37–48. - PubMed
    1. Robins JM, Hernán MA, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology. 2000;11:550–560. - PubMed
    1. Pearl J. The Causal Foundations of Structural Equation Modeling. In: Hoyle RH, editor. Handbook of Structural Equation Modeling. New York: Guilford Press; 2012. pp. 68–91.
    1. van der Laan M, Rose S. Targeted Learning: Causal Inference for Observational and Experimental Data. Berlin, Heidelberg, New York: Springer; 2011.
    1. Pearl J. Causality: Models, Reasoning, and Inference. New York: Cambridge University Press; 2000.

Publication types