Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2001 Aug;69(2):371-80.
doi: 10.1086/321981. Epub 2001 Jul 5.

A transmission/disequilibrium test that allows for genotyping errors in the analysis of single-nucleotide polymorphism data

Affiliations

A transmission/disequilibrium test that allows for genotyping errors in the analysis of single-nucleotide polymorphism data

D Gordon et al. Am J Hum Genet. 2001 Aug.

Abstract

The present study assesses the effects of genotyping errors on the type I error rate of a particular transmission/disequilibrium test (TDT(std)), which assumes that data are errorless, and introduces a new transmission/disequilibrium test (TDT(ae)) that allows for random genotyping errors. We evaluate the type I error rate and power of the TDT(ae) under a variety of simulations and perform a power comparison between the TDT(std) and the TDT(ae), for errorless data. Both the TDT(std) and the TDT(ae) statistics are computed as two times a log-likelihood difference, and both are asymptotically distributed as chi(2) with 1 df. Genotype data for trios are simulated under a null hypothesis and under an alternative (power) hypothesis. For each simulation, errors are introduced randomly via a computer algorithm with different probabilities (called "allelic error rates"). The TDT(std) statistic is computed on all trios that show Mendelian consistency, whereas the TDT(ae) statistic is computed on all trios. The results indicate that TDT(std) shows a significant increase in type I error when applied to data in which inconsistent trios are removed. This type I error increases both with an increase in sample size and with an increase in the allelic error rates. TDT(ae) always maintains correct type I error rates for the simulations considered. Factors affecting the power of the TDT(ae) are discussed. Finally, the power of TDT(std) is at least that of TDT(ae) for simulations with errorless data. Because data are rarely error free, we recommend that researchers use methods, such as the TDT(ae), that allow for errors in genotype data.

PubMed Disclaimer

Figures

Figure  1
Figure 1
Average power for the TDTstd and TDTae statistics, for errorless data sets in which the 1-allele frequency = .25, pd=.2, and %LD = 50, and for which the disease-locus model is a fully penetrant recessive locus. Each average is computed over 250 replicates. The suffixes “-std” and “-ae” in the Significance Level legend indicate power for the TDTstd and TDTae statistics, respectively.

References

Electronic-Database Information

    1. SIMULATE ftp site, ftp://linkage.rockefeller.edu/software/simulate/ (for SIMULATE software)
    1. Statgen Software, http://watson.hgen.pitt.edu/register/soft_doc.html (for FASTSLINK software)

References

    1. Akey JM, Zhang K, Xiong M, Doris P, Jin L (2001) The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures. Am J Hum Genet 68:1447–1456 - PMC - PubMed
    1. Box GEP, Hunter WG, Hunter JS (1978) Statistics for experimenters. John Wiley & Sons, New York
    1. Broman KW (1999) Cleaning genotype data. Genet Epidemiol 17 Suppl 1:S79–S83 - PubMed
    1. Brzustowicz LM, Merette C, Xie X, Townsend T, Gilliam C, Ott J (1993) Molecular and statistical approaches to the detection and correction of errors in genotype databases. Am J Hum Genet 53:1137–1145 - PMC - PubMed
    1. Buetow KH (1991) Influence of aberrant observations on high-resolution linkage analysis outcomes. Am J Hum Genet 49:985–994 - PMC - PubMed

Publication types