Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Feb 2;100(2):193-204.
doi: 10.1016/j.ajhg.2016.12.001. Epub 2017 Jan 5.

The Rare-Variant Generalized Disequilibrium Test for Association Analysis of Nuclear and Extended Pedigrees with Application to Alzheimer Disease WGS Data

Affiliations

The Rare-Variant Generalized Disequilibrium Test for Association Analysis of Nuclear and Extended Pedigrees with Application to Alzheimer Disease WGS Data

Zongxiao He et al. Am J Hum Genet. .

Erratum in

Abstract

Whole-genome and exome sequence data can be cost-effectively generated for the detection of rare-variant (RV) associations in families. Causal variants that aggregate in families usually have larger effect sizes than those found in sporadic cases, so family-based designs can be a more powerful approach than population-based designs. Moreover, some family-based designs are robust to confounding due to population admixture or substructure. We developed a RV extension of the generalized disequilibrium test (GDT) to analyze sequence data obtained from nuclear and extended families. The GDT utilizes genotype differences of all discordant relative pairs to assess associations within a family, and the RV extension combines the single-variant GDT statistic over a genomic region of interest. The RV-GDT has increased power by efficiently incorporating information beyond first-degree relatives and allows for the inclusion of covariates. Using simulated genetic data, we demonstrated that the RV-GDT method has well-controlled type I error rates, even when applied to admixed populations and populations with substructure. It is more powerful than existing family-based RV association methods, particularly for the analysis of extended pedigrees and pedigrees with missing data. We analyzed whole-genome sequence data from families affected by Alzheimer disease to illustrate the application of the RV-GDT. Given the capability of the RV-GDT to adequately control for population admixture or substructure and analyze pedigrees with missing genotype data and its superior power over other family-based methods, it is an effective tool for elucidating the involvement of RVs in the etiology of complex traits.

Keywords: Alzheimer disease; TNK1; exome sequence data; extended and nuclear pedigrees; family-based association analysis; generalized disequilibrium test; missing data; pedigree disequilibrium test; rare variants; whole-genome sequence data.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Pedigree Structures Used in the Simulation Studies (A) Discordant nuclear sib-pair: the family contains parents, an affected child, and an unaffected child. (B) Affected sib-pair: the nuclear family contains parents and two affected children. (C) Extended three-generation pedigree.
Figure 2
Figure 2
Power Comparisons of FBAT, RV-PDT, and RV-GDT for Extended Pedigrees with Family Members Missing Genotype Data Genetic variant data were generated for 1,000 extended pedigrees with ExAC non-Finnish European variant information. Different proportions of the rare nonsense, missense, and splice-site variants were deemed to be causal: 50% (A and D), 75% (B and E), and 100% (C and F) with an OR of 2.5. (A–C) Power comparisons when the probability that each founder was missing all genotype data ranged from 0% to 75%. (D–F) Power comparisons when the probability that each parent (founder or non-founder) was missing all genotype data ranged from 0% to 75%.

Similar articles

Cited by

References

    1. Eichler E.E., Flint J., Gibson G., Kong A., Leal S.M., Moore J.H., Nadeau J.H. Missing heritability and strategies for finding the underlying causes of complex disease. Nat. Rev. Genet. 2010;11:446–450. - PMC - PubMed
    1. Cirulli E.T., Goldstein D.B. Uncovering the roles of rare variants in common disease through whole-genome sequencing. Nat. Rev. Genet. 2010;11:415–425. - PubMed
    1. Ott J., Kamatani Y., Lathrop M. Family-based designs for genome-wide association studies. Nat. Rev. Genet. 2011;12:465–474. - PubMed
    1. Li M., Boehnke M., Abecasis G.R. Efficient study designs for test of genetic association using sibship data and unrelated cases and controls. Am. J. Hum. Genet. 2006;78:778–792. - PMC - PubMed
    1. Mathieson I., McVean G. Differential confounding of rare and common variants in spatially structured populations. Nat. Genet. 2012;44:243–246. - PMC - PubMed

Publication types