Haplotype and missing data inference in nuclear families
- PMID: 15256514
- PMCID: PMC509272
- DOI: 10.1101/gr.2204604
Haplotype and missing data inference in nuclear families
Abstract
Determining linkage phase from population samples with statistical methods is accurate only within regions of high linkage disequilibrium (LD). Yet, affected individuals in a genetic mapping study, including those involving cases and controls, may share sequences identical-by-descent stretching on the order of 10s to 100s of kilobases, quite possibly over regions of low LD in the population. At the same time, inferring phase from nuclear families may be hampered by missing family members, missing genotypes, and the noninformativity of certain genotype patterns. In this study, we reformulate our previous haplotype reconstruction algorithm, and its associated computer program, to phase parents with information derived from population samples as well as from their offspring. In applications of our algorithm to 100-kb stretches, simulated in accordance to a Wright-Fisher model with typical levels of LD in humans, we find that phase reconstruction for 160 trios with 10% missing data is highly accurate (>90%) over the entire length. Furthermore, our algorithm can estimate allelic status for missing data at high accuracy (>95%). Finally, the input capacity of the program is vast, easily handling thousands of segregating sites in > or = 1000 chromosomes.
Copyright 2004 Cold Spring Harbor Laboratory Press ISSN
Figures



Similar articles
-
Accounting for decay of linkage disequilibrium in haplotype inference and missing-data imputation.Am J Hum Genet. 2005 Mar;76(3):449-62. doi: 10.1086/428594. Epub 2005 Jan 31. Am J Hum Genet. 2005. PMID: 15700229 Free PMC article.
-
Maximum-likelihood estimation of haplotype frequencies in nuclear families.Genet Epidemiol. 2004 Jul;27(1):21-32. doi: 10.1002/gepi.10323. Genet Epidemiol. 2004. PMID: 15185400
-
HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.Bioinformatics. 2005 Jan 1;21(1):90-103. doi: 10.1093/bioinformatics/bth388. Epub 2004 Jul 1. Bioinformatics. 2005. PMID: 15231536
-
Accounting for haplotype phase uncertainty in linkage disequilibrium estimation.Genet Epidemiol. 2008 Feb;32(2):168-78. doi: 10.1002/gepi.20273. Genet Epidemiol. 2008. PMID: 17968987
-
Algorithms for inferring haplotypes.Genet Epidemiol. 2004 Dec;27(4):334-47. doi: 10.1002/gepi.20024. Genet Epidemiol. 2004. PMID: 15368348 Review.
Cited by
-
A haplotype inference algorithm for trios based on deterministic sampling.BMC Genet. 2010 Aug 23;11:78. doi: 10.1186/1471-2156-11-78. BMC Genet. 2010. PMID: 20727218 Free PMC article.
-
Accounting for decay of linkage disequilibrium in haplotype inference and missing-data imputation.Am J Hum Genet. 2005 Mar;76(3):449-62. doi: 10.1086/428594. Epub 2005 Jan 31. Am J Hum Genet. 2005. PMID: 15700229 Free PMC article.
-
XHap: haplotype assembly using long-distance read correlations learned by transformers.Bioinform Adv. 2023 Nov 23;3(1):vbad169. doi: 10.1093/bioadv/vbad169. eCollection 2023. Bioinform Adv. 2023. PMID: 38089113 Free PMC article.
-
Inferring haplotypes and parental genotypes in larger full sib-ships and other pedigrees with missing or erroneous genotype data.BMC Genet. 2012 Oct 10;13:85. doi: 10.1186/1471-2156-13-85. BMC Genet. 2012. PMID: 23046532 Free PMC article.
-
ACKR1 Alleles at 5.6 kb in a Well-Characterized Renewable US Food and Drug Administration (FDA) Reference Panel for Standardization of Blood Group Genotyping.J Mol Diagn. 2020 Oct;22(10):1272-1279. doi: 10.1016/j.jmoldx.2020.06.014. Epub 2020 Jul 17. J Mol Diagn. 2020. PMID: 32688055 Free PMC article.
References
-
- Akey, J., Jin, L., and Xiong, M. 2001. Haplotypes vs. single marker linkage disequilibrium tests: What do we gain? Eur. J. Hum. Genet. 9: 291–300. - PubMed
-
- Becker, T. and Knapp, M. 2002. Efficiency of haplotype frequency estimation when nuclear family information is included. Hum. Hered. 54: 45–53. - PubMed
-
- Chapman, J.M., Cooper, J.D., Todd, J.A., and Clayton, D.G. 2003. Detecting disease associations due to linkage disequilibrium using haplotype tags: A class of tests and the determinants of statistical power. Hum. Hered. 56: 18–31. - PubMed
-
- Clark, A.G. 1990. Inference of haplotypes from PCR-amplified samples of diploid populations. Mol. Biol. Evol. 7: 111–122. - PubMed
WEB SITE REFERENCES
-
- http://www.bioinf.mdc-berlin.de/∼rob/; The Rohde-Fuerst haplotyping program.
-
- http://archimedes.well.ox.ac.uk/pise; PHamily.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Research Materials