Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Jun 1;11(6):a039552.
doi: 10.1101/cshperspect.a039552.

Twins and Causal Inference: Leveraging Nature's Experiment

Affiliations
Review

Twins and Causal Inference: Leveraging Nature's Experiment

Tom A McAdams et al. Cold Spring Harb Perspect Med. .

Abstract

In this review, we discuss how samples comprising monozygotic and dizygotic twin pairs can be used for the purpose of strengthening causal inference by controlling for shared influences on exposure and outcome. We begin by briefly introducing how twin data can be used to inform the biometric decomposition of population variance into genetic, shared environmental, and nonshared environmental influences. We then discuss how extensions to this model can be used to explore whether associations between exposure and outcome survive correction for shared etiology (common causes). We review several analytical approaches that can be applied to twin data for this purpose. These include multivariate structural equation models, cotwin control methods, direction of causation models (cross-sectional and longitudinal), and extended family designs used to assess intergenerational associations. We conclude by highlighting some of the limitations and considerations that researchers should be aware of when using twin data for the purposes of interrogating causal hypotheses.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Path diagram of the univariate twin model. All of our path diagrams follow standard structural equation modeling conventions, whereby measured variables are represented with squares and latent variables are represented with circles. Although not included in the path diagram, the variances of all latent variables are fixed to unity. (a11) Additive genetic effects on phenotype, (c11) shared environmental effects on phenotype, (e11) nonshared environmental effects on phenotype, (A1) additive genetic variance component, (C1) shared environmental variance component, (E1) nonshared environmental variance component. In the classical twin design, the predicted variances of each trait (a112 + c112 + e112) and the predictions for the covariances (a112 + c112 for MZ and 0.5a112 + c112 for DZ pairs) are fitted to the observed variances and covariances to obtain the most likely estimates for the A, C, and E effects.
Figure 2.
Figure 2.
Cholesky decomposition twin model.
Figure 3.
Figure 3.
Modeling causality. In the above models, Byx is the causal effect of X on Y. In model i, Vx denotes the variance of X, and Ry denotes the residual variance of Y (the variance of Y remaining after regressing out the effects of X). In model ii, Vz denotes the variance of Z, and Rx and Ry denote the residual variances of X and Y (variance remaining after regressing out the effect of Z (on X) and Z and X (on Y)). Twin models shown are partial path diagrams for a single individual. All latent factors have a variance of 1 (not shown).
Figure 4.
Figure 4.
A phenotypic autoregressive cross-lagged model and partial path diagrams of a twin autoregressive cross-lagged model.
Figure 5.
Figure 5.
The direction of causation (DoC) twin model.
Figure 6.
Figure 6.
Children of twins structural equation model. (a1) Additive genetic effects on parental phenotype, (c1) shared environmental effects on parental phenotype, (e1) nonshared environmental effects on parental phenotype, (a1′) genetic effects common to parental phenotype and offspring phenotype, (c1′) extended family environment effects on offspring phenotype, (a2) genetic effects specific to offspring phenotype, (c2) shared environmental effects on offspring phenotype (not estimable using cousin data), (e2) nonshared environmental effects on offspring phenotype, (p) phenotypic effect of parent on offspring, (NB) the pathway between A1 and A1′ is fixed to 0.50 because parents and children share 50% of their genome. To avoid overcomplicating path diagrams, variance paths have been omitted but for all latent factors, variance = 1. For A1′ this means that residual variance (after accounting for the path between A1 and A1′) is 0.75.

References

    1. Ahmadzadeh YI, Eley TC, Hannigan LJ, Creswell C, Lichtenstein P, Reiss D, Spotts E, Ganigan J, Neiderhiser JM, Rijsdijk FV, et al. 2020. Parental criticism and adolescent internalising symptoms: associations remain after accounting for shared genetic effects. medRxiv 10.1101/2020.05.07.20084319 - DOI
    1. Ball HA, Sumathipala A, Siribaddana SH, Kovas Y, Glozier N, McGuffin P, Hotopf M. 2009. Genetic and environmental contributions to depression in Sri Lanka. Br J Psychiatry 195: 504–509. 10.1192/bjp.bp.109.063529 - DOI - PMC - PubMed
    1. Carlin JB, Gurrin LC, Sterne JA, Morley R, Dwyer T. 2005. Regression models for twin studies: a critical review. Int J Epidemiol 34: 1089–1099. 10.1093/ije/dyi153 - DOI - PubMed
    1. Carr AB, Einstein R, Lai LY, Martin NG, Starmer GA. 1981. Vitamin C and the common cold using identical twins as controls. Med J Aust 2: 411–412. 10.5694/j.1326-5377.1981.tb101032.x - DOI - PubMed
    1. Cecil CA, Barker ED, Jaffee SR, Viding E. 2012. Association between maladaptive parenting and child self-control over time: cross-lagged study using a monozygotic twin difference design. Br J Psychiatry 201: 291–297. 10.1192/bjp.bp.111.107581 - DOI - PubMed

Publication types

LinkOut - more resources