Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Feb 6:15:44.
doi: 10.1186/1471-2105-15-44.

The HTS barcode checker pipeline, a tool for automated detection of illegally traded species from high-throughput sequencing data

Affiliations

The HTS barcode checker pipeline, a tool for automated detection of illegally traded species from high-throughput sequencing data

Youri Lammers et al. BMC Bioinformatics. .

Abstract

Background: Mixtures of internationally traded organic substances can contain parts of species protected by the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES). These mixtures often raise the suspicion of border control and customs offices, which can lead to confiscation, for example in the case of Traditional Chinese medicines (TCMs). High-throughput sequencing of DNA barcoding markers obtained from such samples provides insight into species constituents of mixtures, but manual cross-referencing of results against the CITES appendices is labor intensive. Matching DNA barcodes against NCBI GenBank using BLAST may yield misleading results both as false positives, due to incorrectly annotated sequences, and false negatives, due to spurious taxonomic re-assignment. Incongruence between the taxonomies of CITES and NCBI GenBank can result in erroneous estimates of illegal trade.

Results: The HTS barcode checker pipeline is an application for automated processing of sets of 'next generation' barcode sequences to determine whether these contain DNA barcodes obtained from species listed on the CITES appendices. This analytical pipeline builds upon and extends existing open-source applications for BLAST matching against the NCBI GenBank reference database and for taxonomic name reconciliation. In a single operation, reads are converted into taxonomic identifications matched with names on the CITES appendices. By inclusion of a blacklist and additional names databases, the HTS barcode checker pipeline prevents false positives and resolves taxonomic heterogeneity.

Conclusions: The HTS barcode checker pipeline can detect and correctly identify DNA barcodes of CITES-protected species from reads obtained from TCM samples in just a few minutes. The pipeline facilitates and improves molecular monitoring of trade in endangered species, and can aid in safeguarding these species from extinction in the wild. The HTS barcode checker pipeline is available at https://github.com/naturalis/HTS-barcode-checker.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Steps of the analysis pipeline. Section A shows the process of both controlling the version of the local CITES database as well as updating the database with NCBI taxonomy IDs of CITES-protected taxa. Section B shows the process of running a BLAST search on the user input FASTA file, filtering the output according to minimum BLAST quality settings and blacklisted GenBank entries and flagging CITES protected taxa by comparing the BLAST hits against the local CITES database (and optionally other databases).

References

    1. Appendices I, II and III. [ http://www.cites.org/eng/app/appendices.php]
    1. Coghlan ML, Haile J, Houston J, Murray DC, White NE, Moolhuijzen P, Bellgard MI, Bunce M. Deep Sequencing of Plant and Animal DNA Contained within Traditional Chinese Medicines Reveals Legality Issues and Health Safety Concerns. PLoS Genetics. 2012;8(4) doi:10.1371/journal.pgen.1002657. - PMC - PubMed
    1. Gathier G, van der Niet T, Peelen T, van Vugt RR, Eurlings MCM, Gravendeel B. Forensic Identification of CITES protected Slimming Cactus (Hoodia) using DNA barcoding. J Forensic Sci. 2013;58(6):1467–1471. doi: 10.1111/1556-4029.12184. - DOI - PubMed
    1. Savolainen V, Cowan RS, Vogler AP, Roderick GK, Lane R. Towards writing the encyclopaedia of life: an introduction to DNA barcoding. Roy Soc B Biol Sci. 2005;360:1805–1811. - PMC - PubMed
    1. Hebert PDN, Cywinska A, Ball SL, deWaard JR. Biological identifications through DNA barcodes. Roy Soc B Biol Sci. 2003;270(1512):313–321. doi: 10.1098/rspb.2002.2218. - DOI - PMC - PubMed

MeSH terms

Substances

LinkOut - more resources