Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;7(2):e29613.
doi: 10.1371/journal.pone.0029613. Epub 2012 Feb 17.

Sample reproducibility of genetic association using different multimarker TDTs in genome-wide association studies: characterization and a new approach

Affiliations

Sample reproducibility of genetic association using different multimarker TDTs in genome-wide association studies: characterization and a new approach

Mara M Abad-Grau et al. PLoS One. 2012.

Abstract

Multimarker Transmission/Disequilibrium Tests (TDTs) are very robust association tests to population admixture and structure which may be used to identify susceptibility loci in genome-wide association studies. Multimarker TDTs using several markers may increase power by capturing high-degree associations. However, there is also a risk of spurious associations and power reduction due to the increase in degrees of freedom. In this study we show that associations found by tests built on simple null hypotheses are highly reproducible in a second independent data set regardless the number of markers. As a test exhibiting this feature to its maximum, we introduce the multimarker 2-Groups TDT (mTDT(2G)), a test which under the hypothesis of no linkage, asymptotically follows a χ2 distribution with 1 degree of freedom regardless the number of markers. The statistic requires the division of parental haplotypes into two groups: disease susceptibility and disease protective haplotype groups. We assessed the test behavior by performing an extensive simulation study as well as a real-data study using several data sets of two complex diseases. We show that mTDT(2G) test is highly efficient and it achieves the highest power among all the tests used, even when the null hypothesis is tested in a second independent data set. Therefore, mTDT(2G) turns out to be a very promising multimarker TDT to perform genome-wide searches for disease susceptibility loci that may be used as a preprocessing step in the construction of more accurate genetic models to predict individual susceptibility to complex diseases.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Association rates of and using a second data set to test reproducibility.
Results for 100 simulations of 250 family trios as a function of the recombination rate using the dominant and one-locus genetic model and haplotypes of lengths formula image (left plot), formula image (plot in the middle) and formula image (right plot). A nominal level of formula image and a relative risk of formula image were used for all plots. Results for formula image and formula image are plotted in purple circles and blue triangles respectively. Dashed lines show results for the data subset (125 trios randomly chosen) used to build the model while solid lines show results for a second data subset (the remaining 125 trios) used to test reproducibility.
Figure 2
Figure 2. Association rates under the holdout approach using a second data set to test reproducibility.
Results for 100 simulations of formula image family trios as a function of the recombination rate using the recessive and one-locus genetic model and haplotypes of lengths formula image (left plot), formula image (plot in the middle) and formula image (right plot). A nominal level of formula image and a relative risk of formula image were used for all plots. Results for formula image formula image formula image and formula image i.e. all tests were applied under the holdout approach, are plotted in purple circles, blue triangles, green squares and red diamonds respectively. Dashed lines show results for a data subset of formula image trios randomly chosen while solid lines show results for a second data subset of formula image trios used to test reproducibility of the holdout approach.
Figure 3
Figure 3. Association rates for different proportions of missing haplotypes.
Results for 100 simulations of formula image family trios as a function of the proportion of missing haplotypes using the additive and one-locus genetic model and haplotypes of lengths formula image (left plot), formula image (plot in the middle) and formula image (right plot). A nominal level of formula image and a relative risk of formula image were used for all plots. Results for formula image formula image formula image and formula image i.e. all tests were applied under the holdout approach, are plotted in purple circles, blue triangles, green squares and red diamonds respectively.
Figure 4
Figure 4. Sliding window maps for the -affected data set.
Window size is formula image TDTs used were formula image (red diamonds), formula image (green squares), formula image (purple circles) and formula image (blue triangles).
Figure 5
Figure 5. Sliding window maps for the -affected data set.
Window size is formula image TDTs used were formula image (red diamonds), formula image (green squares), formula image (purple circles) and formula image (blue triangles).
Figure 6
Figure 6. Sliding window maps for the -affected data set.
Window size is formula image TDTs used were formula image (red diamonds), formula image (green squares), formula image (purple circles) and formula image (blue triangles).

Similar articles

Cited by

References

    1. Spielman RS, McGinnis RE, Ewens WJ. Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). American Journal of Human Genetics. 1993;52:506–516. - PMC - PubMed
    1. Zhang S, Sha Q, Chen H, Dong J, Jiang R. Transmission/Disequilibrium test based on haplotype sharing for tightly linked markers. American Journal of Human Genetics. 2003;73:566–79. - PMC - PubMed
    1. BickeBöller H, Clerget-Darpoux F. Statistical properties of the allelic and genotypic transmission/disequilibrium test for multiallelic markers. Genetic Epidemiology. 1995;12:865–70. - PubMed
    1. Sham PC, Curtis D. An extended transmission/disequilibrium test (tdt) for multiallelic marker loci. Annals of Human Genetics. 1995;59:323–336. - PubMed
    1. Yu K, Gu CC, Xiong C, An P, Province M. Global Transmission/Disequilibrium tests based on haplotype sharing in multiple candidate genes. Genetic Epidemiology. 2005;29:223–35. - PubMed

Publication types

Substances