Identification and analysis of error types in high-throughput genotyping

K R Ewen¹, M Bahlo, S A Treloar, D F Levinson, B Mowry, J W Barlow, S J Foote

Affiliations

Affiliation

¹ Australian Genome Research Facility, The Walter and Eliza Hall Institute of Medical Research, Royal Melbourne Hospital, Victoria 3050, Australia. ewen@wehi.edu.au

PMID: 10924406
PMCID: PMC1287531
DOI: 10.1086/303048

Identification and analysis of error types in high-throughput genotyping

K R Ewen et al. Am J Hum Genet. 2000 Sep.

. 2000 Sep;67(3):727-36.

doi: 10.1086/303048. Epub 2000 Aug 2.

Authors

K R Ewen¹, M Bahlo, S A Treloar, D F Levinson, B Mowry, J W Barlow, S J Foote

Affiliation

¹ Australian Genome Research Facility, The Walter and Eliza Hall Institute of Medical Research, Royal Melbourne Hospital, Victoria 3050, Australia. ewen@wehi.edu.au

PMID: 10924406
PMCID: PMC1287531
DOI: 10.1086/303048

Abstract

Although it is clear that errors in genotyping data can lead to severe errors in linkage analysis, there is as yet no consensus strategy for identification of genotyping errors. Strategies include comparison of duplicate samples, independent calling of alleles, and Mendelian-inheritance-error checking. This study aimed to develop a better understanding of error types associated with microsatellite genotyping, as a first step toward development of a rational error-detection strategy. Two microsatellite marker sets (a commercial genomewide set and a custom-designed fine-resolution mapping set) were used to generate 118,420 and 22,500 initial genotypes and 10,088 and 8,328 duplicates, respectively. Mendelian-inheritance errors were identified by PedManager software, and concordance was determined for the duplicate samples. Concordance checking identifies only human errors, whereas Mendelian-inheritance-error checking is capable of detection of additional errors, such as mutations and null alleles. Neither strategy is able to detect all errors. Inheritance checking of the commercial marker data identified that the results contained 0.13% human errors and 0.12% other errors (0.25% total error), whereas concordance checking found 0.16% human errors. Similarly, Mendelian-inheritance-error checking of the custom-set data identified 1.37% errors, compared with 2.38% human errors identified by concordance checking. A greater variety of error types were detected by Mendelian-inheritance-error checking than by duplication of samples or by independent reanalysis of gels. These data suggest that Mendelian-inheritance-error checking is a worthwhile strategy for both types of genotyping data, whereas fine-mapping studies benefit more from concordance checking than do studies using commercial marker data. Maximization of error identification increases the likelihood of linkage when complex diseases are analyzed.

PubMed Disclaimer

Figures

**Figure 1**
Percentage of total errors (A) and total genotypes (B), for each error type.

**Figure 2**
Preferential amplification. Paternal allele 168 (children 2 and 3) and 172 (child 1) did not amplify as well as did the allele 148.

**Figure 3**
Microsatellite mutation to new allele. The child’s maternal allele (124) has mutated to a new allele (126), whereas the paternal allele (118) is normal.

See this image and copyright information in PMC

References

Electronic-Database Information

1. Australian Genome Research Facility, http://www.agrf.org.au/ (for PedManager version 0.9)
1. Division of Statistical Genetics, Department of Human Genetics, University of Pittsburgh, http://watson.hgen.pitt.edu/register/soft_doc.html (for PedCheck)

References

1. Bahlo M, Broman KW (1999) Identification of and adjustment for genotyping errors in data on sibpairs when parental genotypes are unavailable. Am J Hum Genet Suppl 65:A241
1. Bowcock AM, Ruix-Linares A, Tomfohrde J, Minch E, Kidd JR, Cavalli-Sforza LL (1994) High resolution of human evolutionary trees with polymorphic microsatellites. Nature 368:455–457 - PubMed
1. Brownstein MJ, Carpten JD, Smith JR (1996) Modulation of non-templated nucleotide addition by Taq DNA polymerase: primer modifications that facilitate genotyping. Biotechniques 20:1004–1010 - PubMed
1. Dixon MJ, Dixon J, Raskova D, Le Beau MM, Williamson R, Klinger C, Landes GM (1992) Genetic and physical mapping of the Treacher Collins syndrome locus: refinement of the localisation to chromosome 5q 32-33.2.1. Hum Mol Genet 1:249–253 - PubMed
1. Douglas JA, Boehnke M, Lange K (2000) A multipoint method for detecting genotyping errors and mutations in sibling-pair linkage data. Am J Hum Genet 66:1287–1297 - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Identification and analysis of error types in high-throughput genotyping

Affiliation

Identification and analysis of error types in high-throughput genotyping

Authors

Affiliation

Abstract

Figures

References

Electronic-Database Information

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources