Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Mar 29:6:e4573.
doi: 10.7717/peerj.4573. eCollection 2018.

Simulated Disperser Analysis: determining the number of loci required to genetically identify dispersers

Affiliations

Simulated Disperser Analysis: determining the number of loci required to genetically identify dispersers

Adam P A Cardilini et al. PeerJ. .

Abstract

Empirical genetic datasets used for estimating contemporary dispersal in wild populations and to correctly identify dispersers are rarely tested to determine if they are capable of providing accurate results. Here we test whether a genetic dataset provides sufficient information to accurately identify first-generation dispersers. Using microsatellite data from three wild populations of common starlings (Sturnus vulgaris), we artificially simulated dispersal of a subset of individuals; we term this 'Simulated Disperser Analysis'. We then ran analyses for diminishing numbers of loci, to assess at which point simulated dispersers could no longer be correctly identified. Not surprisingly, the correct identification of dispersers varied significantly depending on the individual chosen to 'disperse', the number of loci used, whether loci had high or low Polymorphic Information Content and the location to which the dispersers were moved. A review of the literature revealed that studies that have implemented first-generation migrant detection to date have used on average 10 microsatellite loci. Our results suggest at least 27 loci are required to accurately identify dispersers in the study system evaluated here. We suggest that future studies use the approach we describe to determine the appropriate number of markers needed to accurately identify dispersers in their study system; the unique nature of natural systems means that the number of markers required for each study system will vary. Future studies can use Simulated Disperser Analysis on pilot data to test marker panels for robustness to contemporary dispersal identification, providing a powerful tool in the efficient and accurate design of studies using genetic data to estimate dispersal.

Keywords: Ecological Genetics; GeneClass2; Migrant; Population Genetics; Power Analysis.

PubMed Disclaimer

Conflict of interest statement

Lee A. Rollins is an Academic Editor for PeerJ.

Figures

Figure 1
Figure 1. Probability of correctly identifying a simulated disperser by the number of loci used.
Genetic assignment tests were used to determine whether three common starlings (Sturnus vulgaris) from three genetically distinct collection localities would be identified as dispersers when their movement to a new location was simulated. Shown is the relationship between the probabilities (y-axis), from a Generalised Linear Mixed Model, of correctly identifying a simulated disperser plotted against an increase in the number of loci used. The top line indicates the response when using loci with the highest Polymorphic Information Content (PIC) first (blue line), and the bottom line indicates the response when using loci with the lowest PIC first (red line). The coloured shading corresponding to each line encompasses the upper and lower confidence intervals of the models.
Figure 2
Figure 2. Heatmap showing the assignment status of each simulated disperser.
Genetic assignment tests were used to determine whether three common starlings (Sturnus vulgaris) from three genetically distinct collection localities (A, Munglinup; B, Mallala; C, Orange) would be identified as dispersers when their movement to a new location was simulated. Each small coloured rectangle represents the results of a genetic assignment test; blue, individual was correctly assigned to its collection locality; red, individual was not recognised as a disperser or resident; orange, individual was identified as a disperser but assigned to the incorrect collection locality. The three panels represent the results for simulated disperser A, C and B. Within each panel the top heat map shows the results when first using loci with the highest Polymorphic Information Content (PIC) and the bottom heat map shows the results when first using loci with the lowest PIC. The grey arrows below each inset show the direction the loci were sorted, from high PIC to low PIC (top) or vice versa (bottom). Each inset is sorted on the y-axis by the location the individual was found and the movement treatment. Each row corresponds to a different treatment. The treatment order of each simulated migrant’s heat map row can be found in Table S4.
Figure 3
Figure 3. Data extracted during a literature review showing the relationship between the number of loci used and FST value.
A literature review was conducted to gather data on studies that used genetic assignment tests in GeneClass2.0 and their global FST. This graph shows the non-significant relationship (GLM, t71 =  − 1.312) between the number of loci used and global FST of 72 datasets. The grey band shows the 95% confidence interval on the fitted values.
Figure 4
Figure 4. Number of papers that cited Piry et al. (2004) and contained the search term ‘microsatellite’ each year between 2005 and 2016.
Figure 5
Figure 5. Recommended workflow indicating the steps necessary for validating genetic assignment tests.

References

    1. Abdelkrim J, Robertson BC, Stanton J-AL, Gemmell NJ. Fast, cost-effective development of species-specific microsatellite markers by genomic sequencing. BioTechniques. 2009;46:185–192. doi: 10.2144/000113084. - DOI - PubMed
    1. Bates DM, Maechler M, Bolker B. lme4: Linear mixed-effects models using S4 classes. 2012. https://cran.r-project.org/web/packages/lme4/index.html https://cran.r-project.org/web/packages/lme4/index.html
    1. Bernatchez L, Duchesne P. Individual-based genotype analysis in studies of parentage and population assignment: how many loci, how many alleles? Canadian Journal of Fisheries and Aquatic Sciences. 2000;57:1–12. doi: 10.1139/f99-271. - DOI
    1. Berry O, Tocher MD, Sarre SD. Can assignment tests measure dispersal? Molecular Ecology. 2004;13:551–561. doi: 10.1046/j.1365-294X.2004.2081.x. - DOI - PubMed
    1. Bohonak AJ. Dispersal, gene flow, and population structure. The Quarterly Review of Biology. 1999;74:21–45. doi: 10.1086/392950. - DOI - PubMed

LinkOut - more resources