A transmission/disequilibrium test that allows for genotyping errors in the analysis of single-nucleotide polymorphism data
- PMID: 11443542
- PMCID: PMC1235309
- DOI: 10.1086/321981
A transmission/disequilibrium test that allows for genotyping errors in the analysis of single-nucleotide polymorphism data
Abstract
The present study assesses the effects of genotyping errors on the type I error rate of a particular transmission/disequilibrium test (TDT(std)), which assumes that data are errorless, and introduces a new transmission/disequilibrium test (TDT(ae)) that allows for random genotyping errors. We evaluate the type I error rate and power of the TDT(ae) under a variety of simulations and perform a power comparison between the TDT(std) and the TDT(ae), for errorless data. Both the TDT(std) and the TDT(ae) statistics are computed as two times a log-likelihood difference, and both are asymptotically distributed as chi(2) with 1 df. Genotype data for trios are simulated under a null hypothesis and under an alternative (power) hypothesis. For each simulation, errors are introduced randomly via a computer algorithm with different probabilities (called "allelic error rates"). The TDT(std) statistic is computed on all trios that show Mendelian consistency, whereas the TDT(ae) statistic is computed on all trios. The results indicate that TDT(std) shows a significant increase in type I error when applied to data in which inconsistent trios are removed. This type I error increases both with an increase in sample size and with an increase in the allelic error rates. TDT(ae) always maintains correct type I error rates for the simulations considered. Factors affecting the power of the TDT(ae) are discussed. Finally, the power of TDT(std) is at least that of TDT(ae) for simulations with errorless data. Because data are rarely error free, we recommend that researchers use methods, such as the TDT(ae), that allow for errors in genotype data.
Figures

References
Electronic-Database Information
-
- SIMULATE ftp site, ftp://linkage.rockefeller.edu/software/simulate/ (for SIMULATE software)
-
- Statgen Software, http://watson.hgen.pitt.edu/register/soft_doc.html (for FASTSLINK software)
References
-
- Box GEP, Hunter WG, Hunter JS (1978) Statistics for experimenters. John Wiley & Sons, New York
-
- Broman KW (1999) Cleaning genotype data. Genet Epidemiol 17 Suppl 1:S79–S83 - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous