Haplotypic structure of the X chromosome in the COGA population sample and the quality of its reconstruction by extant software packages
- PMID: 16451691
- PMCID: PMC1866704
- DOI: 10.1186/1471-2156-6-S1-S77
Haplotypic structure of the X chromosome in the COGA population sample and the quality of its reconstruction by extant software packages
Abstract
Background: The haplotypes of the X chromosome are accessible to direct count in males, whereas the diplotypes of the females may be inferred knowing the haplotype of their sons or fathers. Here, we investigated: 1) the possible large-scale haplotypic structure of the X chromosome in a Caucasian population sample, given the single-nucleotide polymorphism (SNP) maps and genotypes provided by Illumina and Affimetrix for Genetic Analysis Workshop 14, and, 2) the performances of widely used programs in reconstructing haplotypes from population genotypic data, given their known distribution in a sample of unrelated individuals.
Results: All possible unrelated mother-son pairs of Caucasian ancestry (N = 104) were selected from the 143 families of the Collaborative Study on the Genetics of Alcoholism pedigree files, and the diplotypes of the mothers were inferred from the X chromosomes of their sons. The marker set included 313 SNPs at an average density of 0.47 Mb. Linkage disequilibrium between pairs of markers was computed by the parameter D', whereas for measuring multilocus disequilibrium, we developed here an index called D*, and applied it to all possible sliding windows of 5 markers each. Results showed a complex pattern of haplotypic structure, with regions of low linkage disequilibrium separated by regions of high values of D*. The following programs were evaluated for their accuracy in inferring population haplotype frequencies: 1) ARLEQUIN 2.001; 2) PHASE 2.1.1; 3) SNPHAP 1.1; 4) HAPLOBLOCK 1.2; 5) HAPLOTYPER 1.0. Performances were evaluated by Pearson correlation (r) coefficient between the true and the inferred distribution of haplotype frequencies.
Conclusion: The SNP haplotypic structure of the X chromosome is complex, with regions of high haplotype conservation interspersed among regions of higher haplotype diversity. All the tested programs were accurate (r = 1) in reconstructing the distribution of haplotype frequencies in case of high D* values. However, only the program PHASE realized a high correlation coefficient (r > 0.7) in conditions of low linkage disequilibrium.
Figures


Similar articles
-
Linkage disequilibrium mapping via cladistic analysis of phase-unknown genotypes and inferred haplotypes in the Genetic Analysis Workshop 14 simulated data.BMC Genet. 2005 Dec 30;6 Suppl 1(Suppl 1):S100. doi: 10.1186/1471-2156-6-S1-S100. BMC Genet. 2005. PMID: 16451556 Free PMC article.
-
Assessment and implications of linkage disequilibrium in genome-wide single-nucleotide polymorphism and microsatellite panels.Genet Epidemiol. 2005;29 Suppl 1:S72-6. doi: 10.1002/gepi.20112. Genet Epidemiol. 2005. PMID: 16342185
-
HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.Bioinformatics. 2005 Jan 1;21(1):90-103. doi: 10.1093/bioinformatics/bth388. Epub 2004 Jul 1. Bioinformatics. 2005. PMID: 15231536
-
[Analysis and application of haplotype in forensic medicine].Fa Yi Xue Za Zhi. 2009 Apr;25(2):133-7. Fa Yi Xue Za Zhi. 2009. PMID: 19537256 Review. Chinese.
-
Potential forensic use of a 33 X-InDel panel in the Argentinean population.Int J Legal Med. 2017 Jan;131(1):107-112. doi: 10.1007/s00414-016-1399-z. Epub 2016 Jun 9. Int J Legal Med. 2017. PMID: 27282766 Review.
Cited by
-
Merino and Merino-derived sheep breeds: a genome-wide intercontinental study.Genet Sel Evol. 2015 Aug 14;47(1):64. doi: 10.1186/s12711-015-0139-z. Genet Sel Evol. 2015. PMID: 26272467 Free PMC article.
-
Empirical vs Bayesian approach for estimating haplotypes from genotypes of unrelated individuals.BMC Genet. 2007 Jan 29;8:2. doi: 10.1186/1471-2156-8-2. BMC Genet. 2007. PMID: 17261196 Free PMC article.
-
Linkage analysis identifies a novel locus for restless legs syndrome on chromosome 2q in a South Tyrolean population isolate.Am J Hum Genet. 2006 Oct;79(4):716-23. doi: 10.1086/507875. Epub 2006 Aug 14. Am J Hum Genet. 2006. PMID: 16960808 Free PMC article.
-
Shape-IT: new rapid and accurate algorithm for haplotype inference.BMC Bioinformatics. 2008 Dec 16;9:540. doi: 10.1186/1471-2105-9-540. BMC Bioinformatics. 2008. PMID: 19087329 Free PMC article.
References
-
- Clark AG. Inference of haplotypes from PCR-amplified samples of diploid populations. Mol Biol Evol. 1990;7:111–122. - PubMed
-
- Excoffier L, Slatkin M. Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. Mol Biol Evol. 1995;12:921–927. - PubMed
-
- Hawley ME, Kidd KK. HAPLO: a program using the EM algorithm to estimate the frequencies of multi-site haplotypes. J Hered. 1995;86:409–411. - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous