Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb 26:13:e19054.
doi: 10.7717/peerj.19054. eCollection 2025.

Optimal input DNA thresholds for genome skimming in marine crustacean zooplankton

Affiliations

Optimal input DNA thresholds for genome skimming in marine crustacean zooplankton

Junya Hirai. PeerJ. .

Abstract

Crustacean zooplanktons are key secondary and tertiary producers in marine ecosystems, yet their genomic resources remain poorly understood. To advance biodiversity research on crustacean zooplankton, this study evaluated the effectiveness of genome skimming, a method that assembles genetic regions, including mitogenome, from shotgun genome sequencing data. Because the small amount of DNA available is a limitation in zooplankton genetics, different input DNA amounts (1 pg-10 ng) were prepared for library construction for genome skimming using two large species: Euphausia pacifica (Euphausiacea) and Calanus glacialis (Copepoda). Additionally, de novo assembly was used to obtain long contigs from short reads because reference-guided assembly can not be applied to all crustacean zooplankton. Evaluation of the raw sequence reads showed increased proportions of high-quality and distinct reads (low duplication levels) for large DNA inputs. By contrast, low sequence quality and high sequence duplication were observed for ≤ 10 pg DNA samples, owing to increased DNA amplification cycles. Complete mitogenomes, including all 37 genes, were successfully retrieved for ≥ 10 pg (E. pacifica) and ≥ 100 pg (C. glacialis) of DNA. Despite the large estimated genome sizes of these zooplankton species, only ≥ 1 and ≥ 3 M reads were sufficient for mitogenome assembly for E. pacifica and C. glacialis, respectively. Nuclear ribosomal repeats and histone 3 were identified in the assembled contigs. As obtaining sufficient DNA amounts (≥ 100 pg) is feasible even from small crustacean zooplankton, genome skimming is a powerful approach for robust phylogenetics and population genetics in marine zooplankton.

Keywords: Copepoda; Euphausiacea; Genome skimming; Mitogenome; Zooplankton.

PubMed Disclaimer

Conflict of interest statement

The author declares there are no competing interests.

Figures

Figure 1
Figure 1. Workflow for genome skimming in this study.
Figure 2
Figure 2. Quality of sequence reads.
(A) Proportions of high-quality sequence reads after the quality-filtering step. (B) Proportions of sequence reads that remained if duplicated reads are removed (distinct reads). The numbers of reads used for analyses are listed in Table S2.
Figure 3
Figure 3. Mitochondrial genome sequences for different DNA inputs.
(A) Euphausia pacifica. (B) Calanus glacialis. Note that no >1,500 bp mitochondrial contigs were obtained for 10 and 1 pg samples of C. glacialis. Note that start and end points are different between reference sequences and assembled contigs.
Figure 4
Figure 4. The longest mitochondrial contig in different numbers of sequence reads for assembly.
The samples with 100 pg input DNA are used for Euphausia pacifica and Calanus glacialis.

Similar articles

References

    1. Arab MA, zu Siederdissen CH, Tout K, Sahyoun AH, Stadler PF, Bernt M. Accurate annotation of protein-coding genes in mitochondrial genomes. Molecular Phylogenetics and Evolution. 2017;106:209–216. doi: 10.1016/j.ympev.2016.09.024. - DOI - PubMed
    1. Baker ADC, Boden BP, Brinton E. Practical guide to the euphausiids of the world. London: British Museum (Natural History); 1990.
    1. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. Journal of Computational Biology. 2012;19:455–477. doi: 10.1089/cmb.2012.0021. - DOI - PMC - PubMed
    1. Blanco-Bercial L, Cornils A, Copley N, Bucklin A. DNA barcoding of marine copepods: assessment of analytical approaches to species identification. PLOS Current. 2014;6 doi: 10.1371/currents.tol.cdf8b74881f87e3b01d56b43791626d2. ecurrents.tol.cdf8b74881f87e3b01d56b43791626d2. - DOI - PMC - PubMed
    1. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. - DOI - PMC - PubMed

LinkOut - more resources