Incorporating genotyping uncertainty in haplotype frequency estimation in pedigree studies
- PMID: 17536211
- DOI: 10.1159/000102990
Incorporating genotyping uncertainty in haplotype frequency estimation in pedigree studies
Abstract
Aims: Haplotype frequency estimation is indispensable in studies of human genetics based on haplotypes since studies based on haplotypes are likely to yield more information than those based on single SNP marker. However, most existing algorithms estimate haplotype frequencies under the assumption that all of the genotype data sets are correct. To date, nearly all large genotype data sets have errors, and studies have demonstrated that even a small quantity of genotyping errors can have enormous impact on haplotype frequency estimation.
Methods: Although the GenoSpectrum (GS)-EM algorithm which estimates haplotype frequencies incorporating genotyping uncertainty has been presented recently [1], it can only be suitable for independent individuals rather than dependent pedigree data. In this paper, we describe a new EM algorithm, called GS-PEM, that calculates maximum likelihood estimates (MLEs) of haplotype frequencies based on all possible multilocus genotypes (GenoSpectrum) of each member of the pedigrees through making use of the dependence information of relatives.
Results and conclusion: We evaluate the performance of the GS-PEM by simulation studies and find that our GS-PEM can reduce the impact induced by the genotyping errors in haplotype frequency estimation.
Copyright 2007 S. Karger AG, Basel.
Similar articles
-
Maximum-likelihood estimation of haplotype frequencies in nuclear families.Genet Epidemiol. 2004 Jul;27(1):21-32. doi: 10.1002/gepi.10323. Genet Epidemiol. 2004. PMID: 15185400
-
Haplotype inference for population data with genotyping errors.Biom J. 2009 Aug;51(4):644-58. doi: 10.1002/bimj.200800215. Biom J. 2009. PMID: 19688759
-
[The use of the expectation-maximization (EM) algorithm for maximum likelihood estimation of gametic frequencies of multilocus polymorphic codominant systems based on sampled population data].Genetika. 2002 Mar;38(3):407-18. Genetika. 2002. PMID: 11963570 Russian.
-
Algorithms for inferring haplotypes.Genet Epidemiol. 2004 Dec;27(4):334-47. doi: 10.1002/gepi.20024. Genet Epidemiol. 2004. PMID: 15368348 Review.
-
[Construction of haplotype and haplotype block based on tag single nucleotide polymorphisms and their applications in association studies].Zhonghua Yi Xue Yi Chuan Xue Za Zhi. 2007 Dec;24(6):660-5. Zhonghua Yi Xue Yi Chuan Xue Za Zhi. 2007. PMID: 18067078 Review. Chinese.
Cited by
-
Associations between polymorphisms in the myostatin gene with calving difficulty and carcass merit in cattle.J Anim Sci. 2023 Jan 3;101:skad371. doi: 10.1093/jas/skad371. J Anim Sci. 2023. PMID: 37935361 Free PMC article.
-
Estimating the single nucleotide polymorphism genotype misclassification from routine double measurements in a large epidemiologic sample.Am J Epidemiol. 2008 Oct 15;168(8):878-89. doi: 10.1093/aje/kwn208. Epub 2008 Sep 12. Am J Epidemiol. 2008. PMID: 18791193 Free PMC article.
-
Changes in capture availability due to infection can lead to detectable biases in population-level infectious disease parameters.PeerJ. 2024 Feb 29;12:e16910. doi: 10.7717/peerj.16910. eCollection 2024. PeerJ. 2024. PMID: 38436008 Free PMC article.
-
Impact of genotyping errors on the type I error rate and the power of haplotype-based association methods.BMC Genet. 2009 Jan 29;10:3. doi: 10.1186/1471-2156-10-3. BMC Genet. 2009. PMID: 19178712 Free PMC article.
-
Marker genotyping error effects on genomic predictions under different genetic architectures.Mol Genet Genomics. 2021 Jan;296(1):79-89. doi: 10.1007/s00438-020-01728-z. Epub 2020 Sep 29. Mol Genet Genomics. 2021. PMID: 32995954
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials