Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023;88(1):38-49.
doi: 10.1159/000529559. Epub 2023 Apr 26.

Methods and Software to Analyze Gene-Environment Interactions under a Case-Mother-Control-Mother Design with Partially Missing Child Genotype

Affiliations

Methods and Software to Analyze Gene-Environment Interactions under a Case-Mother-Control-Mother Design with Partially Missing Child Genotype

Alexandre Bureau et al. Hum Hered. 2023.

Abstract

Introduction: The case-mother-control-mother design allows to study fetal and maternal genetic factors together with environmental exposures on early life outcomes. Mendelian constraints and conditional independence between child genotype and environmental factors enabled semiparametric likelihood methods to estimate logistic models with greater efficiency than standard logistic regression. Difficulties in child genotype collection require methods handling missing child genotype.

Methods: We review a stratified retrospective likelihood and two semiparametric likelihood approaches: a prospective one and a modified retrospective one, the latter either modeling the maternal genotype as a function of covariates or leaving their joint distribution unspecified (robust version). We also review software implementing these modeling alternatives, compare their statistical properties in a simulation study, and illustrate their application, focusing on gene-environment interactions and partially missing child genotype.

Results: The robust retrospective likelihood provides generally unbiased estimates, with standard errors only slightly larger than when modeling maternal genotype based on exposure. The prospective likelihood encounters maximization problems. In the application to the association of small-for-gestational-age babies with CYP2E1 and drinking water disinfection by-products, the retrospective likelihood allowed a full array of covariates, while the prospective likelihood was limited to few covariates.

Conclusion: We recommend the robust version of the modified retrospective likelihood.

Keywords: Association study; Logistic model; Modified profile likelihood; Mother-child pair; R software; Semiparametric likelihood.

PubMed Disclaimer

Conflict of interest statement

The authors have no conflicts of interest to declare.

Figures

Fig. 1.
Fig. 1.
Bias of the estimates of the model terms under linear dependence between environmental exposure X1 and maternal genotype GM. The models applied are identified in the legend by the R function used to fit them. The percentage of missing child genotype data and the missing data mechanism are indicated below the figure panels; MCAR: missing completely at random; MNAR: missing not at random, under the scenario where GC is missing with 14% probability when GC = 0, 28% when GC = 1 and 52% when GC = 2.
Fig. 2.
Fig. 2.
Empirical standard errors of the estimates of the model terms under linear dependence between environmental exposure X1 and maternal genotype GM. The standard errors of Spmlficmcm exceed 0.25 due to a few outlying values. See Figure 1 for the legend.

References

    1. Infante-Rivard C. Studying genetic predisposition among small-for-gestational-age newborns. Semin Perinatol. 2007;31(4):213–8. 10.1053/j.semperi.2007.05.001. - DOI - PubMed
    1. Shi M, Umbach DM, Vermeulen SH, Weinberg CR. Making the most of case-mother/control-mother studies. Am J Epidemiol. 2008;168(5):541–7. 10.1093/aje/kwn149. - DOI - PMC - PubMed
    1. Chen J, Lin D, Hochner H. Semiparametric maximum likelihood methods for analyzing genetic and environmental effects with case-control mother-child pair data. Biometrics. 2012;68(3):869–77. 10.1111/j.1541-0420.2011.01728.x. - DOI - PubMed
    1. Prentice R, Pyke R. Logistic disease incidence models and case-control studies. Biometrika. 1979;66(3):403–11. 10.1093/biomet/66.3.403. - DOI
    1. Gjerdevik M, Haaland ØA, Romanowska J, Lie RT, Jugessur A, Gjessing HK. Parent-of-origin-environment interactions in case-parent triads with or without independent controls. Ann Hum Genet. 2018;82(2):60–73. 10.1111/ahg.12224. - DOI - PMC - PubMed

Publication types