Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Jan;31(1):18-30.
doi: 10.1002/gepi.20189.

Power calculations for likelihood ratio tests for offspring genotype risks, maternal effects, and parent-of-origin (POO) effects in the presence of missing parental genotypes when unaffected siblings are available

Affiliations

Power calculations for likelihood ratio tests for offspring genotype risks, maternal effects, and parent-of-origin (POO) effects in the presence of missing parental genotypes when unaffected siblings are available

E Rampersaud et al. Genet Epidemiol. 2007 Jan.

Abstract

Genotype-based likelihood-ratio tests (LRT) of association that examine maternal and parent-of-origin effects have been previously developed in the framework of log-linear and conditional logistic regression models. In the situation where parental genotypes are missing, the expectation-maximization (EM) algorithm has been incorporated in the log-linear approach to allow incomplete triads to contribute to the LRT. We present an extension to this model which we call the Combined_LRT that incorporates additional information from the genotypes of unaffected siblings to improve assignment of incompletely typed families to mating type categories, thereby improving inference of missing parental data. Using simulations involving a realistic array of family structures, we demonstrate the validity of the Combined_LRT under the null hypothesis of no association and provide power comparisons under varying levels of missing data and using sibling genotype data. We demonstrate the improved power of the Combined_LRT compared with the family-based association test (FBAT), another widely used association test. Lastly, we apply the Combined_LRT to a candidate gene analysis in Autism families, some of which have missing parental genotypes. We conclude that the proposed log-linear model will be an important tool for future candidate gene studies, for many complex diseases where unaffected siblings can often be ascertained and where epigenetic factors such as imprinting may play a role in disease etiology.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Simulation-based power calculations when effects of the child genotype on risk are simulated and modeled. Results for 2000 samples of 200 families with 1 missing parent simulated under additive, dominant and multiplicative models where R1=1.8 and the allele frequency p=0.30. Triads are represented by □ , quads by formula image , quints by ■ .
Figure 2
Figure 2
Plot of estimated power based on 2000 simulations where maternal effects are modeled, shown as a function of the percentage of families where the father is missing genotype data. The model was simulated so that R1=1, R2=1, S1=1.8, S2=2.5, and Im=1. A 2-df LRT was used to test maternal genotype effects. Dashed lines represent power for the fully typed dataset. The triangles are used to plot the results using only completely typed triads and circles are used for the LRT that incorporates the EM algorithm using triads with missing parents. The cross and square are used to plot the power results for the LRT that incorporates the EM algorithm when 1 or 2 unaffected siblings are used, respectively.
Figure 3
Figure 3
Plot of estimated power based on simulations where a POO effect is modeled and as a function of the percentage of families where 1 parent is missing genotype data. The model was simulated so that R1=1.8, R2=2.5, Im=2.5 and S1=S2=1. A 1-df LRT was used in the analysis. The power for the fully typed dataset is represented by dashed lines. The triangles are used to plot the results using only completely typed triads and circles are used for the LRT that incorporates the EM algorithm using only triads. The cross and square are used to plot the power results for the LRT that incorporates the EM algorithm when 1 or 2 unaffected siblings are used, respectively.
Figure 4
Figure 4
Plot of estimated power based on 2000 simulations where a POO effect is modeled and as a function of the percentage of families where both parents are missing genotype data. The model was simulated so that R1=1.8, R2=2.5, Im=2.5 and S1=S2=1. A 1-df LRT was used in the analysis. The power for the fully typed dataset is represented by dashed lines. The triangles are used to plot the results using only completely typed triads, circles are used for the LRT that incorporates the EM algorithm using only triads. The cross and square are used to plot the power results for the LRT that incorporates the EM algorithm when 1 or 2 unaffected siblings are used, respectively.

References

    1. Abel KM. Fetal origins of schizophrenia: testable hypotheses of genetic and environmental influences. Br J Psychiatry. 2004;184:383–385. - PubMed
    1. Chatkupt S, Lucek PR, Koenigsberger MR, Johnson WG. Parental sex effect in spina bifida: a role for genomic imprinting? Am J Med Genet. 1992;44:508–512. - PubMed
    1. Clayton D. A Generalization of the Transmission/Disequilibrium Test for Uncertain-Haplotype Transmission. Am J Hum Genet. 1999;65:1170–1177. - PMC - PubMed
    1. Cordell HJ, Barratt BJ, Clayton DG. Case/pseudocontrol analysis in genetic association studies: A unified framework for detection of genotype and haplotype associations, gene-gene and gene-environment interactions, and parent-of-origin effects. Genet Epidemiol. 2004;26:167–185. - PubMed
    1. Dempster AP, Laird NM, Rubin DB. Maximum Likelihood from incomplete data via the EM algorithm (with discussion). J R Stat Soc B. 1977;B39:1–38.

Publication types

LinkOut - more resources