Precision and type I error rate in the presence of genotype errors and missing parental data: a comparison between the original transmission disequilibrium test (TDT) and TDTae statistics
- PMID: 16451611
- PMCID: PMC1866784
- DOI: 10.1186/1471-2156-6-S1-S150
Precision and type I error rate in the presence of genotype errors and missing parental data: a comparison between the original transmission disequilibrium test (TDT) and TDTae statistics
Abstract
Background: Two factors impacting robustness of the original transmission disequilibrium test (TDT) are: i) missing parental genotypes and ii) undetected genotype errors. While it is known that independently these factors can inflate false-positive rates for the original TDT, no study has considered either the joint impact of these factors on false-positive rates or the precision score of TDT statistics regarding these factors. By precision score, we mean the absolute difference between disease gene position and the position of markers whose TDT statistic exceeds some threshold.
Methods: We apply our transmission disequilibrium test allowing for errors (TDTae) and the original TDT to phenotype and modified single-nucleotide polymorphism genotype simulation data from Genetic Analysis Workshop. We modify genotype data by randomly introducing genotype errors and removing a percentage of parental genotype data. We compute empirical distributions of each statistic's precision score for a chromosome harboring a simulated disease locus. We also consider inflation in type I error by studying markers on a chromosome harboring no disease locus.
Results: The TDTae shows median precision scores of approximately 13 cM, 2 cM, 0 cM, and 0 cM at the 5%, 1%, 0.1%, and 0.01% significance levels, respectively. By contrast, the original TDT shows median precision scores of approximately 23 cM, 21 cM, 15 cM, and 7 cM at the corresponding significance levels, respectively. For null chromosomes, the original TDT falsely rejects the null hypothesis for 28.8%, 14.8%, 5.4%, and 1.7% at the 5%, 1%, 0.1% and 0.01%, significance levels, respectively, while TDTae maintains the correct false-positive rate.
Conclusion: Because missing parental genotypes and undetected genotype errors are unknown to the investigator, but are expected to be increasingly prevalent in multilocus datasets, we strongly recommend TDTae methods as a standard procedure, particularly where stricter significance levels are required.
Figures


Similar articles
-
A transmission/disequilibrium test that allows for genotyping errors in the analysis of single-nucleotide polymorphism data.Am J Hum Genet. 2001 Aug;69(2):371-80. doi: 10.1086/321981. Epub 2001 Jul 5. Am J Hum Genet. 2001. PMID: 11443542 Free PMC article.
-
A transmission disequilibrium test for general pedigrees that is robust to the presence of random genotyping errors and any number of untyped parents.Eur J Hum Genet. 2004 Sep;12(9):752-61. doi: 10.1038/sj.ejhg.5201219. Eur J Hum Genet. 2004. PMID: 15162128 Free PMC article.
-
A family-based likelihood ratio test for general pedigree structures that allows for genotyping error and missing data.Hum Hered. 2008;66(2):99-110. doi: 10.1159/000119109. Epub 2008 Mar 31. Hum Hered. 2008. PMID: 18382089
-
New approach to association testing in case-parent designs under informative parental missingness.Genet Epidemiol. 2004 Sep;27(2):131-40. doi: 10.1002/gepi.20004. Genet Epidemiol. 2004. PMID: 15305329 Review.
-
Combining the case-control methodology with the small size transmission/disequilibrium test for multiallelic markers.Eur J Hum Genet. 2005 Sep;13(9):1007-12. doi: 10.1038/sj.ejhg.5201453. Eur J Hum Genet. 2005. PMID: 15957000 Review.
Cited by
-
The future is now - will the real disease gene please stand up?Hum Hered. 2008;66(2):127-35. doi: 10.1159/000119112. Epub 2008 Mar 31. Hum Hered. 2008. PMID: 18382092 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources