Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2019 Feb 16;9(5):2996-3010.
doi: 10.1002/ece3.4757. eCollection 2019 Mar.

Incomplete estimates of genetic diversity within species: Implications for DNA barcoding

Affiliations
Review

Incomplete estimates of genetic diversity within species: Implications for DNA barcoding

Jarrett D Phillips et al. Ecol Evol. .

Abstract

DNA barcoding has greatly accelerated the pace of specimen identification to the species level, as well as species delineation. Whereas the application of DNA barcoding to the matching of unknown specimens to known species is straightforward, its use for species delimitation is more controversial, as species discovery hinges critically on present levels of haplotype diversity, as well as patterning of standing genetic variation that exists within and between species. Typical sample sizes for molecular biodiversity assessment using DNA barcodes range from 5 to 10 individuals per species. However, required levels that are necessary to fully gauge haplotype variation at the species level are presumed to be strongly taxon-specific. Importantly, little attention has been paid to determining appropriate specimen sample sizes that are necessary to reveal the majority of intraspecific haplotype variation within any one species. In this paper, we present a brief outline of the current literature and methods on intraspecific sample size estimation for the assessment of COI DNA barcode haplotype sampling completeness. The importance of adequate sample sizes for studies of molecular biodiversity is stressed, with application to a variety of metazoan taxa, through reviewing foundational statistical and population genetic models, with specific application to ray-finned fishes (Chordata: Actinopterygii). Finally, promising avenues for further research in this area are highlighted.

Keywords: DNA barcoding; cytochrome c oxidase subunit I; sampling; species; statistics; sufficiency.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1
Figure 1
Longfin damselfish (Stegastes diencaeus) TCS (Templeton et al., 1992) haplotype network depicting an overall skewed distribution of observed haplotypes. Sizes of circles reflect the number of DNA sequences contained within each vertex. Tick marks indicate the number of mutational differences separating sampled haplotypes. DNA barcode sequence data used in the generation of the network were taken from supplemental material accompanying Phillips et al. (2015). The software PopArt (Leigh & Bryant, 2015) was used to create the haplotype network
Figure 2
Figure 2
Graphical depiction of Phillips et al.'s (2015) sampling model as described in detail within the main text. The x‐axis is meant to depict the number of specimens sampled, whereas the y‐axis is meant to convey the cumulative number of unique haplotypes uncovered for every additional individual that is randomly sampled. N and H refer to specimen and haplotype numbers that are observed for a given species. N* is the total sample size that is needed to capture all H* haplotypes that exist for a species

References

    1. Adcock, C. (1997). Sample size determination: A review. Journal of the Royal Statistical Society: Series D. The Statistician, 46(2), 261–283.
    1. April, J. , Mayden, R. L. , Hanner, R. H. , & Bernatchez, L. (2011). Genetic 734 calibration of species diversity among North America's freshwater fishes. Proceedings of the National Academy of Sciences of the United States of America, 108(26), 10602–10607. 10.1073/pnas.1016437108 - DOI - PMC - PubMed
    1. Athey, T. (2013). Assessing errors in DNA barcode sequence records. Master's thesis, University of Guelph.
    1. Austerlitz, F. , David, O. , Schaeffer, B. , Bleakley, K. , Olteanu, M. , Leblois, R. , … Laredo, C. (2009). DNA barcode analysis: A comparison of phylogenetic and statistical classification methods. BMC Bioinformatics, 10(14), S10 10.1186/1471-2105-10-S14-S10 - DOI - PMC - PubMed
    1. Avise, J. C. , Arnold, J. , Ball, R. M. , Bermingham, E. , Lamb, T. , Neigel, J. E. , … Saunders, N. C. (1987). Intraspecific phylogeography: The mitochondrial DNA bridge between population genetics and systematics. Annual Review of Ecology and Systematics, 18(1), 489–522. 10.1146/annurev.es.18.110187.002421 - DOI

LinkOut - more resources