Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jan 5;7(1):19-29.
doi: 10.1534/g3.116.036145.

Using Next-Generation Sequencing for DNA Barcoding: Capturing Allelic Variation in ITS2

Affiliations

Using Next-Generation Sequencing for DNA Barcoding: Capturing Allelic Variation in ITS2

Jana Batovska et al. G3 (Bethesda). .

Abstract

Internal Transcribed Spacer 2 (ITS2) is a popular DNA barcoding marker; however, in some animal species it is hypervariable and therefore difficult to sequence with traditional methods. With next-generation sequencing (NGS) it is possible to sequence all gene variants despite the presence of single nucleotide polymorphisms (SNPs), insertions/deletions (indels), homopolymeric regions, and microsatellites. Our aim was to compare the performance of Sanger sequencing and NGS amplicon sequencing in characterizing ITS2 in 26 mosquito species represented by 88 samples. The suitability of ITS2 as a DNA barcoding marker for mosquitoes, and its allelic diversity in individuals and species, was also assessed. Compared to Sanger sequencing, NGS was able to characterize the ITS2 region to a greater extent, with resolution within and between individuals and species that was previously not possible. A total of 382 unique sequences (alleles) were generated from the 88 mosquito specimens, demonstrating the diversity present that has been overlooked by traditional sequencing methods. Multiple indels and microsatellites were present in the ITS2 alleles, which were often specific to species or genera, causing variation in sequence length. As a barcoding marker, ITS2 was able to separate all of the species, apart from members of the Culex pipiens complex, providing the same resolution as the commonly used Cytochrome Oxidase I (COI). The ability to cost-effectively sequence hypervariable markers makes NGS an invaluable tool with many applications in the DNA barcoding field, and provides insights into the limitations of previous studies and techniques.

Keywords: Culicidae; NGS; amplicon sequencing; indels; microsatellites.

PubMed Disclaimer

Figures

Figure 1
Figure 1
ITS2 sequence information. (A) Comparison of mean Sanger sequence length post-trimming (red) and mean NGS allele length (blue) between species with standard error shown. (B) Number of alleles per individual shown as a histogram. (C) Number of alleles per species shown as a histogram. NGS, next-generation sequencing.
Figure 2
Figure 2
Comparison of sequences generated by Sanger and NGS technology in ITS2 regions containing different types of polymorphisms observed in this study. (A) Single nucleotide polymorphisms, (B) homopolymer region, (C) dinucleotide repeat, (D) insertion/deletion, and (E) combination. The multiple upper sequences show alleles derived from NGS data, while the lower chromatogram and single accompanying sequence were produced through Sanger sequencing. B and C are examples of microsatellites. NGS, next-generation sequencing.
Figure 3
Figure 3
The location of microsatellites within 26 ITS2 sequences (shown from 100 bp), each representative of a species. The size of the light blue labels are proportionate to repeat length, whereas the number inside the label is representative of the repeat type. Microsatellites within all 382 alleles can be found in Figure S2.
Figure 4
Figure 4
A summarized neighbor-joining tree, with bootstrap support values (%), based on p-distance comparisons between 382 ITS2 contig sequences from 88 mosquito samples. Tribal groups are indicated by brackets.
Figure 5
Figure 5
Distribution of percentage difference (p-distances) for ITS2 in different taxonomic categories. Comparison of conspecific and congeneric differences of (A) all 88 samples, and (B) all samples excluding those from the Dobrotworskyius genus. (C) Differences between genera in 88 samples, with peaks corresponding to the differences within and between mosquito tribes, and between mosquito subfamilies. Brackets indicate the range of p-distances for each group.

References

    1. Adey A., Morrison H. G., Asan X., Xun J. O., Kitzman, et al. , 2010. Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition. Genome Biol. 11: 1–17. - PMC - PubMed
    1. Álvarez I., Wendel J. F., 2003. Ribosomal ITS sequences and plant phylogenetic inference. Mol. Phylogenet. Evol. 29: 417–434. - PubMed
    1. Arif C., Daniels C., Bayer T., Banguera-Hinestroza E., Barbrook A., et al. , 2014. Assessing Symbiodinium diversity in scleractinian corals via next-generation sequencing-based genotyping of the ITS2 rDNA region. Mol. Ecol. 23: 4418–4433. - PMC - PubMed
    1. Armstrong K. F., Ball S. L., 2005. DNA barcodes for biosecurity: invasive species identification. Philos. Trans. R. Soc. Lond. B Biol. Sci. 360: 1813–1823. - PMC - PubMed
    1. Aspen S., Crabtree M. B., Savage H. M., 2003. Polymerase chain reaction assay identifies Culex nigripalpus: part of an assay for molecular identification of the common Culex (Culex) mosquitoes of the eastern United States. J. Am. Mosq. Control Assoc. 19: 115–120. - PubMed

Publication types

Substances

LinkOut - more resources