Value of Mendelian laws of segregation in families: data quality control, imputation, and beyond

Elizabeth M Blue¹, Lei Sun, Nathan L Tintle, Ellen M Wijsman

Affiliations

PMID: 25112184
PMCID: PMC4135526
DOI: 10.1002/gepi.21821

Value of Mendelian laws of segregation in families: data quality control, imputation, and beyond

Elizabeth M Blue et al. Genet Epidemiol. 2014 Sep.

. 2014 Sep;38 Suppl 1(0 1):S21-8.

doi: 10.1002/gepi.21821.

Authors

Elizabeth M Blue¹, Lei Sun, Nathan L Tintle, Ellen M Wijsman

Affiliation

¹ Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington, United States of America.

PMID: 25112184
PMCID: PMC4135526
DOI: 10.1002/gepi.21821

Abstract

When analyzing family data, we dream of perfectly informative data, even whole-genome sequences (WGSs) for all family members. Reality intervenes, and we find that next-generation sequencing (NGS) data have errors and are often too expensive or impossible to collect on everyone. The Genetic Analysis Workshop 18 working groups on quality control and dropping WGSs through families using a genome-wide association framework focused on finding, correcting, and using errors within the available sequence and family data, developing methods to infer and analyze missing sequence data among relatives, and testing for linkage and association with simulated blood pressure. We found that single-nucleotide polymorphisms, NGS data, and imputed data are generally concordant but that errors are particularly likely at rare variants, for homozygous genotypes, within regions with repeated sequences or structural variants, and within sequence data imputed from unrelated individuals. Admixture complicated identification of cryptic relatedness, but information from Mendelian transmission improved error detection and provided an estimate of the de novo mutation rate. Computationally, fast rule-based imputation was accurate but could not cover as many loci or subjects as more computationally demanding probability-based methods. Incorporating population-level data into pedigree-based imputation methods improved results. Observed data outperformed imputed data in association testing, but imputed data were also useful. We discuss the strengths and weaknesses of existing methods and suggest possible future directions, such as improving communication between data collectors and data analysts, establishing thresholds for and improving imputation quality, and incorporating error into imputation and analytical models.

Keywords: de novo mutation; inference; next-generation sequence data; power; type I error.

PubMed Disclaimer

References

1. Abecasis GR, Cherny SS, Cookson WO, Cardon LR. Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet. 2002;30:97–101. - PubMed
1. Awadalla P, Gauthier J, Myers RA, Casals F, Hamdan FF, Griffing AR, Cote M, Henrion E, Spiegelman D, Tarabeux J, et al. Direct Measure of the De Novo Mutation Rate in Autism and Schizophrenia Cohorts. Am J Hum Genet. 2010;87:316–324. - PMC - PubMed
1. Beecham GW, Martin ER, Gilbert JR, Haines JL, Pericak-Vance MA. APOE is not Associated with Alzheimer Disease: a Cautionary tale of Genotype Imputation. Ann Hum Genet. 2010;74:189–194. - PMC - PubMed
1. Blackburn AN, Dean AK, Lehman DM. Imputation in families using a heuristic phasing approach. BMC Proc. in press. - PMC - PubMed
1. Browning BL, Browning SR. A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals. Am J Hum Genet. 2009;84:210–223. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Value of Mendelian laws of segregation in families: data quality control, imputation, and beyond

Affiliation

Value of Mendelian laws of segregation in families: data quality control, imputation, and beyond

Authors

Affiliation

Abstract

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources