Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2003 Apr;72(4):891-902.
doi: 10.1086/373881. Epub 2003 Feb 27.

On the identification of disease mutations by the analysis of haplotype similarity and goodness of fit

Affiliations
Comparative Study

On the identification of disease mutations by the analysis of haplotype similarity and goodness of fit

Jung-Ying Tzeng et al. Am J Hum Genet. 2003 Apr.

Abstract

The observation that haplotypes from a particular region of the genome differ between affected and unaffected individuals or between chromosomes transmitted to affected individuals versus those not transmitted is sound evidence for a disease-liability mutation in the region. Tests for differentiation of haplotype distributions often take the form of either Pearson's chi(2) statistic or tests based on the similarity among haplotypes in the different populations. In this article, we show that many measures of haplotype similarity can be expressed in the same quadratic form, and we give the general form of the variance. As we describe, these methods can be applied to either phase-known or phase-unknown data. We investigate the performance of Pearson's chi(2) statistic and haplotype similarity tests through use of evolutionary simulations. We show that both approaches can be powerful, but under quite different conditions. Moreover, we show that the power of both approaches can be enhanced by clustering rare haplotypes from the distributions before performing a test.

PubMed Disclaimer

Figures

Figure  1
Figure 1
Distribution of haplotypes in affected (top) and unaffected (bottom) individuals for four populations. In populations I and II, the disease mutation occurred on a common background; in population III, it occurred on a haplotype so rare that it was not sampled in our simulation of the controls; and in population IV, it occurred on a haplotype that is rare in the normal population. In these bar plots, haplotypes that differ by a single mutation tend to appear adjacent to one another.
Figure  2
Figure 2
Performance of GOF and similarity statistics, in terms of power. The dotted, dashed, dot-dashed, and solid lines display the matching, count, length, and GOF statistics, respectively. The results represent nine separate populations (X-axis) ordered by power, which are connected to facilitate visualization. For panels A–C, the disease mutation occurred on a common background; for panel D, the disease mutation occurred on a rare background. Tests are performed using six-marker haplotypes (A), three-marker haplotypes (B and D) and three-marker genotypes (C).

References

    1. Bourgain C, Génin E, Holopainen P, Mustalahti K, Mäki M, Partanen J (2001) Use of closely related affected individuals for the genetic study of complex diseases in founder populations. Am J Hum Genet 68:154–159 - PMC - PubMed
    1. Bourgain C, Génin E, Quesneville H, Clerget-Darpoux F (2000) Search for multifactorial disease susceptibility genes in founder populations. Ann Hum Genet 64:255–265 - PubMed
    1. Cheung VG, Nelson SF (1998) Genomic mismatch scanning identifies human genomic DNA shared identical by descent. Genomics 47:1–7 - PubMed
    1. Clayton DG (1999) A generalization of the transmission/disequilibrium test for uncertain-haplotype transmission. Am J Hum Genet 65:1170–1177 - PMC - PubMed
    1. Culverhouse R, Lin J, Liu KY, Suarez BK (2001) Exploiting linkage disequilibrium in population isolates. Genet Epidemiol 21 Suppl 1:S429–S434 - PubMed

Publication types

Substances