Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul;29(5):e01914.
doi: 10.1002/eap.1914. Epub 2019 Jun 12.

Categorization of species as native or nonnative using DNA sequence signatures without a complete reference library

Affiliations

Categorization of species as native or nonnative using DNA sequence signatures without a complete reference library

Jeremy C Andersen et al. Ecol Appl. 2019 Jul.

Abstract

New genetic diagnostic approaches have greatly aided efforts to document global biodiversity and improve biosecurity. This is especially true for organismal groups in which species diversity has been underestimated historically due to difficulties associated with sampling, the lack of clear morphological characteristics, and/or limited availability of taxonomic expertise. Among these methods, DNA sequence barcoding (also known as "DNA barcoding") and by extension, meta-barcoding for biological communities, has emerged as one of the most frequently utilized methods for DNA-based species identifications. Unfortunately, the use of DNA barcoding is limited by the availability of complete reference libraries (i.e., a collection of DNA sequences from morphologically identified species), and by the fact that the vast majority of species do not have sequences present in reference databases. Such conditions are critical especially in tropical locations that are simultaneously biodiversity rich and suffer from a lack of exploration and DNA characterization by trained taxonomic specialists. To facilitate efforts to document biodiversity in regions lacking complete reference libraries, we developed a novel statistical approach that categorizes unidentified species as being either likely native or likely nonnative based solely on measures of nucleotide diversity. We demonstrate the utility of this approach by categorizing a large sample of specimens of terrestrial insects and spiders (collected as part of the Moorea BioCode project) using a generalized linear mixed model (GLMM). Using a training data set of known endemic (n = 45) and known introduced species (n = 102), we then estimated the likely native/nonnative status for 4,663 specimens representing an estimated 1,288 species (412 identified species), including both those specimens that were either unidentified or whose endemic/introduced status was uncertain. Using this approach, we were able to increase the number of categorized specimens by a factor of 4.4 (from 794 to 3,497), and the number of categorized species by a factor of 4.8 from (147 to 707) at a rate much greater than chance (77.6% accuracy). The study identifies phylogenetic signatures of both native and nonnative species and suggests several practical applications for this approach including monitoring biodiversity and facilitating biosecurity.

Keywords: DNA barcoding; Moorea BioCode; alien invasive species; biomonitoring; biosecurity; community barcoding; metabarcoding.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Comparison of the biogeographical categorizations based on Ramage (2017) or based on GLMM predictions. Categorizations are presented so that the biogeographic assignment in Ramage (2017) is before the colon and the biogeographic assignment based on the GLMM predictions is presented after the colon. Categorizations follow Table 3, except that two categories for the GLMM predictions (Undet and NA) have been combined into a sing category here labeled “Not Categorized.”

References

    1. Agnarsson, I. , Kuntner M., and Paterson A.. 2007. Taxonomy in a changing world: seeking solutions for a science in crisis. Systematic Biology 56:531–539. - PubMed
    1. Allendorf, F. W. , Hohenlohe P. A., and Luikart G.. 2010. Genomics and the future of conservation genetics. Nature Reviews Genetics 11:697–709. - PubMed
    1. Altschul, S. F. , Gish W., Miller W., Myers E. W., and Lipman D. J.. 1990. Basic local alignment search tool. Journal of Molecular Biology 215:403–410. - PubMed
    1. Ardura, A. , and Planes S.. 2017. Rapid assessment of non‐indigenous species in the era of the eDNA barcoding: A Mediterranean case study. Estuarine Coastal and Shelf Science 188:81–87.
    1. Armstrong, K. F. , and Ball S. L.. 2005. DNA barcodes for biosecurity: invasive species identification. Philosophical Transactions of the Royal Society B: Biological Sciences 360:1813–1823. - PMC - PubMed

Publication types