Tutorial on directed acyclic graphs
- PMID: 34371103
- PMCID: PMC8821727
- DOI: 10.1016/j.jclinepi.2021.08.001
Tutorial on directed acyclic graphs
Abstract
Directed acyclic graphs (DAGs) are an intuitive yet rigorous tool to communicate about causal questions in clinical and epidemiologic research and inform study design and statistical analysis. DAGs are constructed to depict prior knowledge about biological and behavioral systems related to specific causal research questions. DAG components portray who receives treatment or experiences exposures; mechanisms by which treatments and exposures operate; and other factors that influence the outcome of interest or which persons are included in an analysis. Once assembled, DAGs - via a few simple rules - guide the researcher in identifying whether the causal effect of interest can be identified without bias and, if so, what must be done either in study design or data analysis to achieve this. Specifically, DAGs can identify variables that, if controlled for in the design or analysis phase, are sufficient to eliminate confounding and some forms of selection bias. DAGs also help recognize variables that, if controlled for, bias the analysis (e.g., mediators or factors influenced by both exposure and outcome). Finally, DAGs help researchers recognize insidious sources of bias introduced by selection of individuals into studies or failure to completely observe all individuals until study outcomes are reached. DAGs, however, are not infallible, largely owing to limitations in prior knowledge about the system in question. In such instances, several alternative DAGs are plausible, and researchers should assess whether results differ meaningfully across analyses guided by different DAGs and be forthright about uncertainty. DAGs are powerful tools to guide the conduct of clinical research.
Copyright © 2021 Elsevier Inc. All rights reserved.
Figures
Paths are sequences of arrows, of any direction, connecting two variables and may be causal or non-causal.
Paths are causal if each variable causes the subsequent variable (all the arrows point in the same direction).
Paths are non-causal if the arrows do not all point in the same direction. They contain confounders and/or colliders.
Confounding occurs because of common (shared) causes (e.g., C) of E and D. To estimate the effect of E on D, it is necessary to control for such common causes or other variables along the non-causal path. For example, control for either C or G would be adequate to eliminate the confounding due to C. G may be preferable, for example if it is easier to obtain a high-quality measurement of G.
Mediators (e.g., M) are caused by E and, in turn, cause D. They should not be controlled for to estimate the total effect of E on D.
Colliders (e.g., S) are so named because they have two arrows pointing into them. Colliders on a path block that path unless they are conditioned on (e.g., by controlling for them) or a consequence of the collider is conditioned on.
Analyses should not adjust for, stratify on, or in any way condition on descendants of D (e.g., Z).
Instrumental variables (e.g., I) are variables related to the exposure of interest that have no association with the outcome except through the exposure. Instrumental variables analysis (a technique common in the economics literature) can be used to derive effect estimates when there is intractable confounding of E and D.
Effect modifiers (e.g., J) are variables that cause D and modify the effect of other causes of D, such as E. If E and J both cause D, then J modifies the effect of E on D on at least one effect-measure scale (additive or multiplicative).
Comment in
-
A cause should not be automatically taken as an effect modifier of other causes: author's reply.J Clin Epidemiol. 2022 Jun;146:127-128. doi: 10.1016/j.jclinepi.2022.02.014. Epub 2022 Feb 25. J Clin Epidemiol. 2022. PMID: 35219802 No abstract available.
-
A cause should not be automatically taken as an effect modifier of other causes: response to Digitale et al.J Clin Epidemiol. 2022 Jun;146:128-130. doi: 10.1016/j.jclinepi.2022.02.013. Epub 2022 Feb 25. J Clin Epidemiol. 2022. PMID: 35219804 No abstract available.
Dataset use reported in
-
Harm of early dexamethasone for COVID-19 and bias in randomized trials.Eur J Intern Med. 2023 Jan;107:100-101. doi: 10.1016/j.ejim.2022.09.014. Epub 2022 Sep 19. Eur J Intern Med. 2023. PMID: 36150979 Free PMC article. No abstract available.
References
-
- Greenland S, Pearl J, Robins JM. Causal diagrams for epidemiologic research. Epidemiol Camb Mass. 1999. Jan;10(1):37–48. - PubMed
-
[Seminal introduction of DAGs in epidemiologic research.]
-
- Glymour MM. Using causal diagrams to understand common problems in social epidemiology. In: Oakes JM, Kaufman JS, editors. Methods in Social Epidemiology. 2nd ed. San Francisco: Jossey-Bass; 2017. p. 458–92.
-
[Comprehensive and approachable explanation of the practical use of DAGs.]
-
- Hernán MA, Hernández-Díaz S, Werler MM, Mitchell AA. Causal knowledge as a prerequisite for confounding evaluation: An application to birth defects epidemiology. Am J Epidemiol. 2002. Jan 15;155(2):176–84. - PubMed
-
[Landmark convincing argument as to why prior causal knowledge about the ambient biologic or behavioral system, encoded in DAGs, should be used to decide whether a variable is contributing to confounding and therefore must be managed in order to avoid bias. Overturns decades of varying and ill-defined approaches to the evaluation of confounding.]
-
- Didelez V, Sheehan N. Mendelian randomization as an instrumental variable approach to causal inference. Stat Methods Med Res. 2007. Aug 1;16(4):309–30. - PubMed
