Haplotype block partitioning and tag SNP selection using genotype data and their applications to association studies
- PMID: 15078859
- PMCID: PMC479119
- DOI: 10.1101/gr.1837404
Haplotype block partitioning and tag SNP selection using genotype data and their applications to association studies
Abstract
Recent studies have revealed that linkage disequilibrium (LD) patterns vary across the human genome with some regions of high LD interspersed by regions of low LD. A small fraction of SNPs (tag SNPs) is sufficient to capture most of the haplotype structure of the human genome. In this paper, we develop a method to partition haplotypes into blocks and to identify tag SNPs based on genotype data by combining a dynamic programming algorithm for haplotype block partitioning and tag SNP selection based on haplotype data with a variation of the expectation maximization (EM) algorithm for haplotype inference. We assess the effects of using either haplotype or genotype data in haplotype block identification and tag SNP selection as a function of several factors, including sample size, density or number of SNPs studied, allele frequencies, fraction of missing data, and genotyping error rate, using extensive simulations. We find that a modest number of haplotype or genotype samples will result in consistent block partitions and tag SNP selection. The power of association studies based on tag SNPs using genotype data is similar to that using haplotype data.
Figures







Similar articles
-
Haplotype block structure and its applications to association studies: power and study designs.Am J Hum Genet. 2002 Dec;71(6):1386-94. doi: 10.1086/344780. Epub 2002 Nov 18. Am J Hum Genet. 2002. PMID: 12439824 Free PMC article.
-
HapBlock: haplotype block partitioning and tag SNP selection software using a set of dynamic programming algorithms.Bioinformatics. 2005 Jan 1;21(1):131-4. doi: 10.1093/bioinformatics/bth482. Epub 2004 Aug 27. Bioinformatics. 2005. PMID: 15333454
-
Haplotype and linkage disequilibrium architecture for human cancer-associated genes.Genome Res. 2002 Dec;12(12):1846-53. doi: 10.1101/gr.483802. Genome Res. 2002. PMID: 12466288 Free PMC article.
-
Tag SNP selection for association studies.Genet Epidemiol. 2004 Dec;27(4):365-74. doi: 10.1002/gepi.20028. Genet Epidemiol. 2004. PMID: 15372618 Review.
-
[Analysis and application of SNP and haplotype in the human genome].Yi Chuan Xue Bao. 2005 Aug;32(8):879-89. Yi Chuan Xue Bao. 2005. PMID: 16231744 Review. Chinese.
Cited by
-
Selecting additional tag SNPs for tolerating missing data in genotyping.BMC Bioinformatics. 2005 Nov 1;6:263. doi: 10.1186/1471-2105-6-263. BMC Bioinformatics. 2005. PMID: 16259642 Free PMC article.
-
FastTagger: an efficient algorithm for genome-wide tag SNP selection using multi-marker linkage disequilibrium.BMC Bioinformatics. 2010 Jan 29;11:66. doi: 10.1186/1471-2105-11-66. BMC Bioinformatics. 2010. PMID: 20113476 Free PMC article.
-
Polymorphisms of the IGF1R gene and their genetic effects on chicken early growth and carcass traits.BMC Genet. 2008 Nov 7;9:70. doi: 10.1186/1471-2156-9-70. BMC Genet. 2008. PMID: 18990245 Free PMC article.
-
Personality traits as mediators in the association between SIRT1 rs12415800 polymorphism and depressive symptoms among Chinese college students.Front Psychiatry. 2023 Apr 14;14:1104664. doi: 10.3389/fpsyt.2023.1104664. eCollection 2023. Front Psychiatry. 2023. PMID: 37124257 Free PMC article.
-
An overview of population genetic data simulation.J Comput Biol. 2012 Jan;19(1):42-54. doi: 10.1089/cmb.2010.0188. Epub 2011 Dec 9. J Comput Biol. 2012. PMID: 22149682 Free PMC article. Review.
References
-
- Abecasis, G.R. and Cookson, W.O. 2000. GOLD—Graphical overview of linkage disequilibrium. Bioinformatics 16: 182-183. - PubMed
-
- Cardon, L.R., Ke, X., Lawrence, R., Carter, N., Rogers, J., Stavrides, G., Willey, D., Mullikin, J., Hunt, S., Bentley, D.R., et al. 2003. Towards a fine-scale linkage disequilibrium map of human chromosome 20. Am. J. Hum. Genet. 73 (Suppl): 271. - PubMed
-
- Clark, A.G. 1990. Inference of haplotypes from PCR-amplified samples of diploid populations. Mol. Biol. Evol. 7: 111-122. - PubMed
-
- Daly, M.J., Rioux, J.D., Schaffner, S.F., Hudson, T.J., and Lander, E.S. 2001. High-resolution haplotype structure in the human genome. Nat. Genet. 29: 229-232. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials