Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Sep;128(3):325-44.
doi: 10.1007/s00439-010-0854-z. Epub 2010 Jul 6.

Genome-wide association filtering using a highly locus-specific transmission/disequilibrium test

Affiliations

Genome-wide association filtering using a highly locus-specific transmission/disequilibrium test

María M Abad-Grau et al. Hum Genet. 2010 Sep.

Abstract

Multimarker transmission/disequilibrium tests (TDTs) are powerful association and linkage tests used to perform genome-wide filtering in the search for disease susceptibility loci. In contrast to case/control studies, they have a low rate of false positives for population stratification and admixture. However, the length of a region found in association with a disease is usually very large because of linkage disequilibrium (LD). Here, we define a multimarker proportional TDT (mTDT ( P )) designed to improve locus specificity in complex diseases that has good power compared to the most powerful multimarker TDTs. The test is a simple generalization of a multimarker TDT in which haplotype frequencies are used to weight the effect that each haplotype has on the whole measure. Two concepts underlie the features of the metric: the 'common disease, common variant' hypothesis and the decrease in LD with chromosomal distance. Because of this decrease, the frequency of haplotypes in strong LD with common disease variants decreases with increasing distance from the disease susceptibility locus. Thus, our haplotype proportional test has higher locus specificity than common multimarker TDTs that assume a uniform distribution of haplotype probabilities. Because of the common variant hypothesis, risk haplotypes at a given locus are relatively frequent and a metric that weights partial results for each haplotype by its frequency will be as powerful as the most powerful multimarker TDTs. Simulations and real data sets demonstrate that the test has good power compared with the best tests but has remarkably higher locus specificity, so that the association rate decreases at a higher rate with distance from a disease susceptibility or disease protective locus.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Association rate based on 100 simulations of 200 family trios as a function of the recombination rate using haplotypes of length 4 and different genotype models (rows). The first column shows results for one disease susceptibility locus and the second and third show results for two disease loci. A nominal level of α = 0.05 and a relative risk of 6 were used for all plots. Results for mTDT P, mTDT and mTDT Y are plotted in red, blue and green, respectively
Fig. 2
Fig. 2
Association rate based on 100 simulations of 200 family trios as a function of the recombination rate using haplotypes of length 10 and different genotype models (rows). The first column shows results for one disease susceptibility locus and the second and third show results for two disease loci. A nominal level of α = 0.05 and a relative risk of 4 were used for all plots. Results for mTDT P, mTDT and mTDT Y are plotted in red, blue and green, respectively
Fig. 3
Fig. 3
Association rate based on 100 simulations of 200 family trios as a function of the recombination rate using haplotypes of length 10 and different genotype models (rows). The first column shows results for one disease susceptibility locus and the second and third show results for two disease loci. A nominal level of α = 0.05 and a relative risk of 8 were used for all plots. Results for mTDT P, mTDT and mTDT Y are plotted in red, blue and green, respectively
Fig. 4
Fig. 4
Association rate based on 100 simulations of 500 family trios as a function of the recombination rate using haplotypes of length 4 and different genotype models (rows). The first column shows results for one disease susceptibility locus and the second and third show results for two disease loci. A nominal level of α = 0.05 and a relative risk of 2 were used for all plots. Results for mTDT P, mTDT and mTDT Y are plotted in red, blue and green, respectively
Fig. 5
Fig. 5
Association rate based on 100 simulations of 500 family trios as a function of the recombination rate using haplotypes of length 10 and different genotype models (rows). The first column shows results for one disease susceptibility locus and the second and third show results for two disease loci. A nominal level of α = 0.05 and a relative risk of 1.6 were used for all plots. Results for mTDT P, mTDT and mTDT Y are plotted in red, blue and green, respectively
Fig. 6
Fig. 6
Association rate based on 100 simulations of 500 family trios as a function of the recombination rate using haplotypes of length 10 and different genotype models (rows). The first column shows results for one disease susceptibility locus and the second and third show results for two disease loci. A nominal level of α = 0.05 and a relative risk of 2.4 were used for all plots. Results for mTDT P, mTDT and mTDT Y are plotted in red, blue and green, respectively
Fig. 7
Fig. 7
Association rate based on 100 simulations of 500 family trios as a function of the recombination rate using haplotypes of length 10 and different genotype models (rows). The first column shows results for one disease susceptibility locus and the second and third show results for two disease loci. A nominal level of α = 0.05 and a haplotype length of 10 were used for all plots. Results for mTDT P, mTDT and mTDT Y are plotted in red, blue and green, respectively. Simple lines show values for frequency of the disease mutation in the interval [0.2, 0.4] while lines with diamonds show results for mutation frequencies in the interval [0.1, 0.2]
Fig. 8
Fig. 8
Comparative TDT maps for loci, a KIAA0350 and b IRF5 data sets using sliding windows of width 6 and offset 1. Rows in gray below each TDT map (colored on a white background) show results for the IHMP data sets as a control test. Results correspond to the following TDTs from top to bottom: mTDT P (red), mTDT (blue), mTDT Y (green), mTDT YP (cyan), mTDT L1 (purple), mTDT L2 (scarlet), mTDT E (violet), mTDT 1T (orange), mTDT 1U (pink), mTDT LC (black) and mTDT SR (brown)

Similar articles

Cited by

References

    1. Abecasis GR, Martin R, Lewitzky S. Estimation of haplotype frequencies from diploid data. Am J Hum Genet. 2001;69:198. doi: 10.1086/321263. - DOI - PMC - PubMed
    1. Abramowitz M, Stegun I. Handbook of mathematical functions. New York: Dover; 1972.
    1. Betensky R, Rabinowitz D. Simple approximations for the maximal transmission/disequilibrium test with a multi-allelic marker. Ann Hum Genet. 2000;64:567–574. doi: 10.1046/j.1469-1809.2000.6460567.x. - DOI - PubMed
    1. Bodner W, Bonilla C. Common and rare variants in multifactorial susceptibility to common diseases. Nat Genet. 2008;40:695–701. doi: 10.1038/ng.f.136. - DOI - PMC - PubMed
    1. Bourgain C, Genin E, Holopainen P, Mustalahti K, Mki M, Partanen J. Maximum identity length contrast: a powerful method for susceptibility gene detection in isolated populations. Am J Hum Genet. 2001;68:154–159. doi: 10.1086/316933. - DOI - PubMed

Substances