Accounting for haplotype phase uncertainty in linkage disequilibrium estimation
- PMID: 17968987
- DOI: 10.1002/gepi.20273
Accounting for haplotype phase uncertainty in linkage disequilibrium estimation
Erratum in
- Genet Epidemiol. 2008 Sep;32(6):586-7
Abstract
The characterization of linkage disequilibrium (LD) is applied in a variety of studies including the identification of molecular determinants of the local recombination rate, the migration and population history of populations, and the role of positive selection in adaptation. LD suffers from the phase uncertainty of the haplotypes used in its calculation, which reflects limitations of the algorithms used for haplotype estimation. We introduce a LD calculation method, which deals with phase uncertainty by weighting all possible haplotype pairs according to their estimated probabilities as evaluated by PHASE. In contrast to the expectation-maximization (EM) algorithm as implemented in the HAPLOVIEW and GENETICS packages, our method considers haplotypes based on the entire genetic information available for the candidate region. We tested the method using simulated and real genotyping data. The results show that, for all practical purposes, the new method is advantageous in comparison with algorithms that calculate LD using only the most probable haplotype or bilocus haplotypes based on the EM algorithm. The new method deals especially well with low LD regions, which contribute strongly to phase uncertainty. Altogether, the method is an attractive alternative to standard LD calculation procedures, including those based on the EM algorithm. We implemented the method in the software suite R, together with an interface to the popular haplotype calculation package PHASE.
Similar articles
-
A novel method to express SNP-based genetic heterogeneity, Psi, and its use to measure linkage disequilibrium for multiple SNPs, D(g), and to estimate absolute maximum of haplotype frequency.Genet Epidemiol. 2007 Nov;31(7):709-26. doi: 10.1002/gepi.20235. Genet Epidemiol. 2007. PMID: 17508358
-
Penalized estimation of haplotype frequencies.Bioinformatics. 2008 Jul 15;24(14):1596-602. doi: 10.1093/bioinformatics/btn236. Epub 2008 May 16. Bioinformatics. 2008. PMID: 18487240
-
HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.Bioinformatics. 2005 Jan 1;21(1):90-103. doi: 10.1093/bioinformatics/bth388. Epub 2004 Jul 1. Bioinformatics. 2005. PMID: 15231536
-
Algorithms for inferring haplotypes.Genet Epidemiol. 2004 Dec;27(4):334-47. doi: 10.1002/gepi.20024. Genet Epidemiol. 2004. PMID: 15368348 Review.
-
[Linkage disequilibrium in the human genome and its exploitation].Arch Inst Pasteur Tunis. 2005;82(1-4):9-21. Arch Inst Pasteur Tunis. 2005. PMID: 16929750 Review. French.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials