Linkage disequilibrium mapping via cladistic analysis of phase-unknown genotypes and inferred haplotypes in the Genetic Analysis Workshop 14 simulated data
- PMID: 16451556
- PMCID: PMC1866839
- DOI: 10.1186/1471-2156-6-S1-S100
Linkage disequilibrium mapping via cladistic analysis of phase-unknown genotypes and inferred haplotypes in the Genetic Analysis Workshop 14 simulated data
Abstract
We recently described a method for linkage disequilibrium (LD) mapping, using cladistic analysis of phased single-nucleotide polymorphism (SNP) haplotypes in a logistic regression framework. However, haplotypes are often not available and cannot be deduced with certainty from the unphased genotypes. One possible two-stage approach is to infer the phase of multilocus genotype data and analyze the resulting haplotypes as if known. Here, haplotypes are inferred using the expectation-maximization (EM) algorithm and the best-guess phase assignment for each individual analyzed. However, inferring haplotypes from phase-unknown data is prone to error and this should be taken into account in the subsequent analysis. An alternative approach is to analyze the phase-unknown multilocus genotypes themselves. Here we present a generalization of the method for phase-known haplotype data to the case of unphased SNP genotypes. Our approach is designed for high-density SNP data, so we opted to analyze the simulated dataset. The marker spacing in the initial screen was too large for our method to be effective, so we used the answers provided to request further data in regions around the disease loci and in null regions. Power to detect the disease loci, accuracy in localizing the true site of the locus, and false-positive error rates are reported for the inferred-haplotype and unphased genotype methods. For this data, analyzing inferred haplotypes outperforms analysis of genotypes. As expected, our results suggest that when there is little or no LD between a disease locus and the flanking region, there will be no chance of detecting it unless the disease variant itself is genotyped.
Figures
Similar articles
-
Linkage disequilibrium assessment via log-linear modeling of SNP haplotype frequencies.Genet Epidemiol. 2003 Sep;25(2):106-14. doi: 10.1002/gepi.10254. Genet Epidemiol. 2003. PMID: 12916019
-
HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.Bioinformatics. 2005 Jan 1;21(1):90-103. doi: 10.1093/bioinformatics/bth388. Epub 2004 Jul 1. Bioinformatics. 2005. PMID: 15231536
-
Little loss of information due to unknown phase for fine-scale linkage-disequilibrium mapping with single-nucleotide-polymorphism genotype data.Am J Hum Genet. 2004 May;74(5):945-53. doi: 10.1086/420773. Epub 2004 Apr 7. Am J Hum Genet. 2004. PMID: 15077198 Free PMC article.
-
On selecting markers for association studies: patterns of linkage disequilibrium between two and three diallelic loci.Genet Epidemiol. 2003 Jan;24(1):57-67. doi: 10.1002/gepi.10217. Genet Epidemiol. 2003. PMID: 12508256 Review.
-
One potato, two potato: haplotype association mapping in autotetraploids.Trends Plant Sci. 2004 Sep;9(9):441-8. doi: 10.1016/j.tplants.2004.07.003. Trends Plant Sci. 2004. PMID: 15337494 Review. No abstract available.
Cited by
-
Dealing with missing phase and missing data in phylogeny-based analysis.BMC Proc. 2007;1 Suppl 1(Suppl 1):S22. doi: 10.1186/1753-6561-1-s1-s22. Epub 2007 Dec 18. BMC Proc. 2007. PMID: 18466519 Free PMC article.
-
Toward a better understanding of ADHD: LPHN3 gene variants and the susceptibility to develop ADHD.Atten Defic Hyperact Disord. 2010 Nov;2(3):139-47. doi: 10.1007/s12402-010-0030-2. Epub 2010 Oct 16. Atten Defic Hyperact Disord. 2010. PMID: 21432600 Free PMC article. Review.
-
A graphical assessment of p-values from sliding window haplotype tests of association to identify asthma susceptibility loci on chromosome 11q.BMC Genet. 2006 Jun 14;7:38. doi: 10.1186/1471-2156-7-38. BMC Genet. 2006. PMID: 16774684 Free PMC article.
-
An integrated in silico gene mapping strategy in inbred mice.Genetics. 2007 Jan;175(1):321-33. doi: 10.1534/genetics.106.065359. Epub 2006 Oct 8. Genetics. 2007. PMID: 17028314 Free PMC article.
References
-
- Stram DO, Pearce CL, Bretsky P, Freedman M, Hirschhorn JN, Altshuler D, Kolonel LN, Henderson BE, Thomas DC. Modelling and E-M estimation of haplotype-specific relative risks from genotype data for a case-control study of unrelated individuals. Hum Hered. 2003;55:179–190. doi: 10.1159/000073202. - DOI - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials