Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Sep 12;363(1505):2841-50.
doi: 10.1098/rstb.2008.0043.

Bayesian inference of species hybrids using multilocus dominant genetic markers

Affiliations

Bayesian inference of species hybrids using multilocus dominant genetic markers

Eric C Anderson. Philos Trans R Soc Lond B Biol Sci. .

Abstract

Neutral genetic markers are useful for identifying species hybrids in natural populations, especially when used in conjunction with statistical methods like the one implemented in the software NEWHYBRIDS. Here, a short description of the extension of NEWHYBRIDS to dominant markers is given. Subsequently, an extensive series of simulations of amplified fragment length polymorphism (AFLP) data is performed to evaluate the prospects for hybrid identification with (possibly non-diagnostic) dominant markers. Distinguishing between F1's and F2's is shown to be difficult, possibly requiring upwards of 100 AFLP markers to be done accurately. Discriminating between pure-bred and non-pure (hybrid) individuals, however, is shown to be much easier, requiring perhaps as few as 10 dominant markers, even from relatively weakly diverged species.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Directed graphs describing the NewHybrids model. Black-shaded nodes represent observed data, grey-shaded nodes represent parameters of prior distributions and unshaded nodes represent unobserved variables. The boxes are ‘plates’ which denote multiple, conditionally independent replicates (indexed by the subscript over loci and i over individuals in the sample) of the enclosed nodes. (a) The model for codominant data. Note that the observed data are the allelic types Yi,,1 and Yi,,2 of the two gene copies at each locus in each individual (hence those nodes are shaded black). (b) The model for dominant data: each dominant locus is modelled as a diallelic locus with a recessive allele and a dominant one. These allele types are no longer observed, so they are latent variables. At each dominant locus, the observed datum is the presence or absence of a band, Yi,dom.
Figure 2
Figure 2
Posterior mean estimates (y-axis) versus true values (x-axis) of the frequency of the recessive r allele from either species gene pool (A or B) across all simulated datasets (both LD and HD conditions) with H=100 and L=100.There are 20 000 points in total, each one corresponding to an allele frequency at a single locus in species A or B.
Figure 3
Figure 3
Mean posterior probability of hybrid category membership. (af) The mean over all simulations with different simulation scenarios denoted by the plot symbols as follows: circles, LD and H=100; triangles, LD and H=1000; pluses, HD and H=100; crosses, HD and H=1000. The x-axis of (af) shows L, the number of markers used. (af) Correspond to different hybrid categories as indicated on each plot. Note that the y-axis scale differs on each plot: (a) A, (b) F1, (c) BC1A, (d) B, (e) F2, (f) BC1B.
Figure 4
Figure 4
ROC curves based on P(Pure) (see text) computed from simulation output for the LD scenario with H=100 and L equal to 10 (long-dashed line), 25 (short dashed line) and 50 (solid line). A separate ROC curve has been plotted for each true hybrid category, i.e. (a) shows the rate at which pure A individuals are allocated to the pure category as a function of the rate at which individuals from the non-pure categories are incorrectly assigned to the pure category; (b) shows the rate that F1's are allocated to the non-pure category as a function of the rate at which pure A and B individuals are incorrectly assigned to the non-pure category, etc. (a) A, (b) F1, (c) BC1A, (d) B, (e) F2, (f) BC1B.

References

    1. Albert V, Jónsson B, Bernatchez L. Natural hybrids in Atlantic eels (Anguilla anguilla, A. rostrata): evidence for successful reproduction and flucuating abundance in space and time. Mol. Ecol. 2006;15:1903–1916. doi: 10.1111/j.1365-294X.2006.02917.x. - DOI - PubMed
    1. Anderson, E. C. 2003 User's guide to the program NewHybrids, version 1.1 beta. Technical report, 7 April, 2003.
    1. Anderson E.C, Thompson E.A. A model-based method for identifying species hybrids using multilocus genetic data. Genetics. 2002;160:1217–1229. - PMC - PubMed
    1. Barton N.H. The dynamics of hybrid zones. Heredity. 1979;43:341–359. doi: 10.1038/hdy.1979.87. - DOI
    1. Barton N.H, Gale K.S. Genetic analysis of hybrid zones. In: Harrison R.G, editor. Hybrid zones and the evolutionary process. Oxford University Press; Oxford, UK: 1993. pp. 13–45.

Substances

LinkOut - more resources