Fine mapping of disease genes via haplotype clustering
- PMID: 16385468
- DOI: 10.1002/gepi.20134
Fine mapping of disease genes via haplotype clustering
Abstract
We propose an algorithm for analysing SNP-based population association studies, which is a development of that introduced by Molitor et al. [2003: Am J Hum Genet 73:1368-1384]. It uses clustering of haplotypes to overcome the major limitations of many current haplotype-based approaches. We define a between-haplotype score that is simple, yet appears to capture much of the information about evolutionary relatedness of the haplotypes in the vicinity of a (unobserved) putative causal locus. Haplotype clusters can then be defined via a putative ancestral haplotype and a cut-off distance. The number of an individual's two haplotypes that lie within the cluster predicts the individual's genotype at the causal locus. This predicted genotype can then be investigated for association with the phenotype of interest. We implement our approach within a Markov-chain Monte Carlo algorithm that, in effect, searches over locations and ancestral haplotypes to identify large, case-rich clusters. The algorithm successfully fine-maps a causal mutation in a test analysis using real data, and achieves almost 98% accuracy in predicting the genotype at the causal locus. A simulation study indicates that the new algorithm is substantially superior to alternative approaches, and it also allows us to identify situations in which multi-point approaches can substantially improve over single-SNP analyses. Our algorithm runs quickly and there is scope for extension to a wide range of disease models and genomic scales.
(c) 2005 Wiley-Liss, Inc.
Similar articles
-
Direct analysis of unphased SNP genotype data in population-based association studies via Bayesian partition modelling of haplotypes.Genet Epidemiol. 2005 Sep;29(2):91-107. doi: 10.1002/gepi.20080. Genet Epidemiol. 2005. PMID: 15940704
-
Disease association tests by inferring ancestral haplotypes using a hidden markov model.Bioinformatics. 2008 Apr 1;24(7):972-8. doi: 10.1093/bioinformatics/btn071. Epub 2008 Feb 23. Bioinformatics. 2008. PMID: 18296746
-
Accounting for haplotype uncertainty in matched association studies: a comparison of simple and flexible techniques.Genet Epidemiol. 2005 Apr;28(3):261-72. doi: 10.1002/gepi.20061. Genet Epidemiol. 2005. PMID: 15637718
-
Algorithms for inferring haplotypes.Genet Epidemiol. 2004 Dec;27(4):334-47. doi: 10.1002/gepi.20024. Genet Epidemiol. 2004. PMID: 15368348 Review.
-
Finding starting points for Markov chain Monte Carlo analysis of genetic data from large and complex pedigrees.Genet Epidemiol. 2003 Jul;25(1):14-24. doi: 10.1002/gepi.10243. Genet Epidemiol. 2003. PMID: 12813723 Review.
Cited by
-
Characterizing a region on BTA11 affecting β-lactoglobulin content of milk using high-density genotyping and haplotype grouping.BMC Genet. 2017 Feb 22;18(1):17. doi: 10.1186/s12863-017-0483-9. BMC Genet. 2017. PMID: 28222684 Free PMC article.
-
TNFAIP2 Inhibits Early TNFα-Induced NF-x03BA;B Signaling and Decreases Survival in Septic Shock Patients.J Innate Immun. 2016;8(1):57-66. doi: 10.1159/000437330. Epub 2015 Sep 9. J Innate Immun. 2016. PMID: 26347487 Free PMC article.
-
Genome-wide association analyses of quantitative traits: the GAW16 experience.Genet Epidemiol. 2009;33 Suppl 1(Suppl 1):S13-8. doi: 10.1002/gepi.20466. Genet Epidemiol. 2009. PMID: 19924711 Free PMC article.
-
A flexible Bayesian framework for modeling haplotype association with disease, allowing for dominance effects of the underlying causative variants.Am J Hum Genet. 2006 Oct;79(4):679-94. doi: 10.1086/508264. Epub 2006 Aug 31. Am J Hum Genet. 2006. PMID: 16960804 Free PMC article.
-
hapConstructor: automatic construction and testing of haplotypes in a Monte Carlo framework.Bioinformatics. 2008 Sep 15;24(18):2105-7. doi: 10.1093/bioinformatics/btn359. Epub 2008 Jul 23. Bioinformatics. 2008. PMID: 18653522 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Molecular Biology Databases