Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Aug;176(4):2521-7.
doi: 10.1534/genetics.106.063982. Epub 2007 Feb 4.

Genetic mapping in the presence of genotyping errors

Affiliations

Genetic mapping in the presence of genotyping errors

Dustin A Cartwright et al. Genetics. 2007 Aug.

Abstract

Genetic maps are built using the genotypes of many related individuals. Genotyping errors in these data sets can distort genetic maps, especially by inflating the distances. We have extended the traditional likelihood model used for genetic mapping to include the possibility of genotyping errors. Each individual marker is assigned an error rate, which is inferred from the data, just as the genetic distances are. We have developed a software package, called TMAP, which uses this model to find maximum-likelihood maps for phase-known pedigrees. We have tested our methods using a data set in Vitis and on simulated data and confirmed that our method dramatically reduces the inflationary effect caused by increasing the number of markers and leads to more accurate orders.

PubMed Disclaimer

Figures

F<sc>igure</sc> 1.—
Figure 1.—
Graphical representation of the error model. Each node represents an abstract marker, i.e., genotypes for all individuals in the pedigree. The leaf nodes are the known, observed, possibly erroneous markers, and the internal nodes are the inferred, unobserved, error-free markers. Thus, except for the terminal markers, each physical marker corresponds to two nodes, one error free and one observed. Each arc represents separation between two markers, either because of recombination (vertical) or because of errors (horizontal).
F<sc>igure</sc> 2.—
Figure 2.—
Illustration of the two types of permutations used in the marker-ordering algorithm: moves (left) and flips (right). Each square represents a single marker.
F<sc>igure</sc> 3.—
Figure 3.—
Effect on linkage group size of removing every other marker both with and without compensation for errors. Error compensation leads to more consistent genetic distances.
F<sc>igure</sc> 4.—
Figure 4.—
Simulation of the effect of errors on marker ordering. In a linkage group of 19 markers, the 10th marker was simulated with errors, and the markers were ordered, using three different likelihood models. The first uses TMAP with the error model described in this article. The second uses a version of TMAP that assumes a fixed error rate of 2% for every marker. The third does not model any error at all.
F<sc>igure</sc> 5.—
Figure 5.—
Effect of removing one of the two permutation types on the speed of convergence to the correct order.
F<sc>igure</sc> 6.—
Figure 6.—
Distribution of nonzero error rates in the Vitis data set. In addition, 625 markers (67%) had an estimated error rate of exactly 0%.
F<sc>igure</sc> 7.—
Figure 7.—
Comparison of the estimated marker error rates and the size of the 1-unit-down intervals. The 1-unit-down intervals are computed by placing the marker at regular steps along the length of the linkage group and computing the interval where the log10 likelihood is 1 unit less than the maximum. These approximate the 90% confidence intervals for the marker's position.

References

    1. Abkevich, V., N. J. Camp, A. Gutin, J. Farnham, L. Cannon-Albright et al., 2001. A robust multipoint linkage statistic (tlod) for mapping complex trait loci. Genet. Epidemiol. 21(Suppl. 1): S492–S497. - PubMed
    1. Broman, K. W., H. Wu, S. Sen and G. A. Churchill, 2003. R/qtl: QTL mapping in experimental crosses. Bioinformatics 19: 889–890. - PubMed
    1. Castiglione, P., C. Pozzi, M. Heun, V. Terzi, K. J. Müller et al., 1998. An AFLP-based procedure for the efficient mapping of mutations and DNA probes in barley. Genetics 149: 2039–2056. - PMC - PubMed
    1. de Givry, S., M. Bouchez, P. Chabrier, D. Milan and T. Schiex, 2005. CarthaGène: multipopulation integrated genetic and radiation hybrid mapping. Bioinformatics 21: 1703–1704. - PubMed
    1. Douglas, J. A., M. Boehnke and K. Lange, 2000. A multipoint method for detecting genotyping errors and mutations in sibling-pair linkage data. Am. J. Hum. Genet. 66: 1287–1297. - PMC - PubMed

Publication types

Substances

LinkOut - more resources