Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Aug 5:10:e13790.
doi: 10.7717/peerj.13790. eCollection 2022.

Skimming for barcodes: rapid production of mitochondrial genome and nuclear ribosomal repeat reference markers through shallow shotgun sequencing

Affiliations

Skimming for barcodes: rapid production of mitochondrial genome and nuclear ribosomal repeat reference markers through shallow shotgun sequencing

Mykle L Hoban et al. PeerJ. .

Abstract

DNA barcoding is critical to conservation and biodiversity research, yet public reference databases are incomplete. Existing barcode databases are biased toward cytochrome oxidase subunit I (COI) and frequently lack associated voucher specimens or geospatial metadata, which can hinder reliable species assignments. The emergence of metabarcoding approaches such as environmental DNA (eDNA) has necessitated multiple marker techniques combined with barcode reference databases backed by voucher specimens. Reference barcodes have traditionally been generated by Sanger sequencing, however sequencing multiple markers is costly for large numbers of specimens, requires multiple separate PCR reactions, and limits resulting sequences to targeted regions. High-throughput sequencing techniques such as genome skimming enable assembly of complete mitogenomes, which contain the most commonly used barcoding loci (e.g., COI, 12S, 16S), as well as nuclear ribosomal repeat regions (e.g., ITS1&2, 18S). We evaluated the feasibility of genome skimming to generate barcode references databases for marine fishes by assembling complete mitogenomes and nuclear ribosomal repeats. We tested genome skimming across a taxonomically diverse selection of 12 marine fish species from the collections of the National Museum of Natural History, Smithsonian Institution. We generated two sequencing libraries per species to test the impact of shearing method (enzymatic or mechanical), extraction method (kit-based or automated), and input DNA concentration. We produced complete mitogenomes for all non-chondrichthyans (11/12 species) and assembled nuclear ribosomal repeats (18S-ITS1-5.8S-ITS2-28S) for all taxa. The quality and completeness of mitogenome assemblies was not impacted by shearing method, extraction method or input DNA concentration. Our results reaffirm that genome skimming is an efficient and (at scale) cost-effective method to generate all mitochondrial and common nuclear DNA barcoding loci for multiple species simultaneously, which has great potential to scale for future projects and facilitate completing barcode reference databases for marine fishes.

Keywords: Collections; DNA barcoding; Fishes; Genome skimming; Metabarcoding; Mitochondrial genomes; Museum.

PubMed Disclaimer

Conflict of interest statement

The authors declare there are no competing interests.

Figures

Figure 1
Figure 1. Species included in this MiSeq-based pilot study.
(A) Gymnura altavela, Spiny Butterfly Ray, length unknown. (B) Gymnothorax fimbriatus, Fimbriated moray, USNM 395396, 850 mm TL. (C) Gymnothorax undulatus, Undulated moray, USNM 442319, 132 mm TL. (D) Saurida nebulosa, Clouded Lizardfish, USNM 442473, 56.2 mm SL. (E) Brosme brosme, Cusk, length unknown. (F) Myripristis vittata, Whitetip Soldierfish, USNM 411102, 120.1 mm SL. (G) Neoniphon sammara, Sammara Squirrelfish, USNM 442483, 130 mm SL. (H) Tylosurus crocodilus, Houndfish, USNM 442362, 13.6 mm SL. (I) Scomberoides lysan, Doublespotted Queenfish, USNM 442297, 22.3 mm SL. (J) Forcipiger flavissimus, Longnose Butterflyfish, USNM 411089, 129.1 mm SL. (K) Ostracion whitleyi, Whitley’s Boxfish, USNM 411029, 81.2 mm SL. (L) Canthigaster amboinensis, Ambon Toby, USNM 442417, 64 mm SL. All photographs except A and E are the individuals for which we sequenced the mitogenome. Photographs A and E by Donald D. Flescher, NOAA; photographs B, F, J, and K by Jeff Williams, NMNH; and photographs C, D, G, H, I, and L by Diane Pitassy NMNH.
Figure 2
Figure 2. Assembled and annotated mitogenome of Canthigaster amboinensis, Ambon Toby, USNM 442417, 64 mm SL.
Photograph by Diane Pitassy, NMNH.
Figure 3
Figure 3. Results of phylogenetic analysis of 52 fish mitogenomes.
Tree shown is the result of the Bayesian analysis confirming that 12 focal taxa (shown in red) are correctly placed among confamilials in corresponding families. Node support values <95% are shown for nodes at family- and genus-level splits. Bayesian posterior probability is as labeled, and ML bootstrap support is indicated by the color of the node symbols. Unlabeled family- and genus-level nodes had 100% posterior probability and bootstrap support.

Similar articles

Cited by

References

    1. Adamowicz SJ, Boatwright JS, Chain F, Fisher BL, Hogg ID, Leese F, Lijtmaer DA, Mwale M, Naaum AM, Pochon X, Steinke D, Wilson J-J, Wood S, Xu J, Xu S, Zhou X, Van der Bank M. Trends in DNA barcoding and metabarcoding. Genome. 2019;62:v–viii. doi: 10.1139/gen-2019-0054. - DOI - PubMed
    1. Alexander JB, Bunce M, White N, Wilkinson SP, Adam AAS, Berry T, Stat M, Thomas L, Newman SJ, Dugal L, Richards ZT. Development of a multi-assay approach for monitoring coral diversity using eDNA metabarcoding. Coral Reefs. 2020;39:159–171. doi: 10.1007/s00338-019-01875-9. - DOI
    1. Alsos IG, Lavergne S, Merkel MKF, Boleda M, Lammers Y, Alberti A, Pouchon C, Denoeud F, Pitelkova I, Puşcaş M, Roquet C, Hurdu B-I, Thuiller W, Zimmermann NE, Hollingsworth PM, Coissac E. The treasure vault can be opened: large-scale genome skimming works well using herbarium and silica gel dried material. Plants. 2020;9:432. doi: 10.3390/plants9040432. - DOI - PMC - PubMed
    1. Berry TE, Osterrieder SK, Murray DC, Coghlan ML, Richardson AJ, Grealy AK, Stat M, Bejder L, Bunce M. DNA metabarcoding for diet analysis and biodiversity: A case study using the endangered Australian sea lion (Neophoca cinerea) Ecology and Evolution. 2017;7:5435–5453. doi: 10.1002/ece3.3123. - DOI - PMC - PubMed
    1. Besnard G, Christin P-A, Malé P-JG, Coissac E, Ralimanana H, Vorontsova MS. Phylogenomics and taxonomy of Lecomtelleae (Poaceae), an isolated panicoid lineage from Madagascar. Annals of Botany. 2013;112:1057–1066. doi: 10.1093/aob/mct174. - DOI - PMC - PubMed

Publication types

LinkOut - more resources