Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009;68(4):268-77.
doi: 10.1159/000228924. Epub 2009 Jul 22.

A new method to account for missing data in case-parent triad studies

Affiliations

A new method to account for missing data in case-parent triad studies

T L Bergemann et al. Hum Hered. 2009.

Abstract

Background/aims: The case-parent triad design is commonly used in genetic association studies. Generally, samples are drawn from an affected offspring, manifesting a phenotype of interest, as well as from the parents. The trio genotypes may be analyzed using a variety of available methods, but we focus on log-linear models because they test for genetic association and additionally estimate the relative risks of transmission. The models need to be modified to adjust for missing genotypes. Furthermore, instability in the parameter estimates can arise when certain kinds of genotype combinations do not appear in the dataset.

Methods: In this paper, we kill two birds with one stone. We propose a new method to simultaneously account for missing genotype data and genotype combinations with zero counts. This method solves a zero-inflated Poisson (ZIP) regression likelihood. The maximum likelihood estimates yield relative risks and the information matrix gives appropriate variance estimates for inference. A likelihood ratio test determines the significance of genetic association.

Results: We compared the ZIP regression to previously proposed methods in both simulation studies and in a dataset that investigates the risk of orofacial clefts. The ZIP likelihood estimates regression coefficients with less bias than other methods when the minor allele frequency is small.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Comparison of Type I error rates for the three methods discussed in Section 2. The upper panels show results for N = 300 and the lower panels for N = 500. The panels from left to right show results for the EM algorithm, the direct likelihood, and the ZIP likelihood respectively.
Fig. 2
Fig. 2
Comparison of power levels for the three methods discussed in Section 2. The upper panels show results for N = 300 and the lower panels for N = 500. The panels from left to right show results for the EM algorithm, the direct likelihood, and the ZIP likelihood respectively.

Similar articles

Cited by

References

    1. Ahsan H, Hodge SE, Heiman GA, Begg MD, Susser ES. Relative risk for genetic associations: the case-parent triad as a variant of case-cohort design. Int J Epidemiol. 2002;31:669–678. - PubMed
    1. Laird NM, Lange C. Family-based methods for linkage and association analysis. Adv Genet. 2008;60:219–252. - PubMed
    1. Steele JR, Wellemeyer AS, Hansen MJ, Reaman GH, Ross JA. Childhood cancer research network: a North American pediatric cancer registry. Cancer Epidemiol Biomarkers Prevention. 2006;15:1241–1242. - PubMed
    1. Spielman RS, McGinnis RE, Ewens WJ. Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM) Am J Hum Genet. 1993;52:506–516. - PMC - PubMed
    1. Horvath S, Xu X, Laird NM. The family based association test method: strategies for studying general genotype-phenotype associations. Eur J Hum Genet. 2001;9:301–306. - PubMed

Publication types

Substances