Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;7(12):e51589.
doi: 10.1371/journal.pone.0051589. Epub 2012 Dec 17.

Using family-based imputation in genome-wide association studies with large complex pedigrees: the Framingham Heart Study

Affiliations

Using family-based imputation in genome-wide association studies with large complex pedigrees: the Framingham Heart Study

Ming-Huei Chen et al. PLoS One. 2012.

Abstract

Imputation has been widely used in genome-wide association studies (GWAS) to infer genotypes of un-genotyped variants based on the linkage disequilibrium in external reference panels such as the HapMap and 1000 Genomes. However, imputation has only rarely been performed based on family relationships to infer genotypes of un-genotyped individuals. Using 8998 Framingham Heart Study (FHS) participants genotyped with Affymetrix 550K SNPs, we imputed genotypes of same set of SNPs for additional 3121 participants, most of whom were never genotyped due to lack of DNA sample. Prior to imputation, 122 pedigrees were too large to be handled by the imputation software Merlin. Therefore, we developed a novel pedigree splitting algorithm that can maximize the number of genotyped relatives for imputing each un-genotyped individual, while keeping new sub-pedigrees under a pre-specified size. In GWAS of four phenotypes available in FHS (Alzheimer disease, circulating levels of fibrinogen, high-density lipoprotein cholesterol, and uric acid), we compared results using genotyped individuals only with results using both genotyped and imputed individuals. We studied the impact of applying different imputation quality filtering thresholds on the association results and did not found a universal threshold that always resulted in a more significant p-value for previously identified loci. However most of these loci had a lower p-value when we only included imputed genotypes with with ≥60% SNP- and ≥50% person-specific imputation certainty. In summary, we developed a novel algorithm for splitting large pedigrees for imputation and found a plausible imputation quality filtering threshold based on FHS. Further examination may be required to generalize this threshold to other studies.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Box plots of imputation certainty in FHS imputed samples.
Figure 2
Figure 2. Scatter plot of imputation certainty against MAF.
Figure 3
Figure 3. Scatter plot of MAF in well-genotyped sample and filtered (person_specific imputation certainty greater than 50%) imputed sample.
Figure 4
Figure 4. Scatter plots of –log10(p-value) from 550K GWAS and GWAS using incorporated genotype data.
Figure 5
Figure 5. Example for pedigree splitting and trimming.

References

    1. Li Y, Willer C, Sanna S, Abecasis GR (2009) Genotype Imputation. Annu Rev Genomics Hum Genet 10: 387–406. - PMC - PubMed
    1. Chen WM, Abecasis GR (2007) Family-based association tests for genomewide association scans. Am J Hum Genet 81: 913–926. - PMC - PubMed
    1. Lander ES, Green P (1987) Construction of multilocus genetic linkage maps in humans. Proc Natl Acad Sci USA 84: 2363–2367. - PMC - PubMed
    1. Elston RC, Stewart J (1971) A general model for the genetic analysis of pedigree data. Hum Hered 21: 523–542. - PubMed
    1. Abecasis GR, Cherny SS, Cookson WO, Cardon LR (2002) Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet 30: 97–101. - PubMed

Publication types

Substances