Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Aug 30;7(4):e0043022.
doi: 10.1128/msystems.00430-22. Epub 2022 Jul 14.

Comprehensive Evaluation of RNA and DNA Viromic Methods Based on Species Richness and Abundance Analyses Using Marmot Rectal Samples

Affiliations

Comprehensive Evaluation of RNA and DNA Viromic Methods Based on Species Richness and Abundance Analyses Using Marmot Rectal Samples

Yue Sun et al. mSystems. .

Abstract

Viral metagenomics is the most powerful tool to profile viromic composition for a given sample. Different viromic methods, including amplification-free ones, have been developed, but choosing them for different purposes requires comprehensive benchmarks. Here, we assessed the performance of four routinely used methods, i.e., multiple displacement amplification (MDA), direct metagenomic sequencing (MTG), sequence-independent single-primer amplification (SIA), and metatranscriptomic sequencing (MTT), using marmot rectal samples as the templates spiked with five known viruses of different genome types. The obtained clean data were differently contaminated by host and bacterial genomes, resulting in MDA having the most, with ~72.1%, but MTT had only ~7.5% data, useful for follow-up viromic analysis. MDA showed a broader spectrum with higher efficiency to profile the DNA virome, and MTT captured almost all RNA viruses with extraordinary sensitivity; hence, they are advisable in richness-based viromic studies. MTG was weak in capturing single-stranded DNA viruses, and SIA could detect both RNA and DNA viruses but with high randomness. Due to biases to certain types of viruses, the four methods caused different alterations to species abundance compared to the initial virus composition. SIA and MDA introduced greater stochastic errors to relative abundances of species, genus, and family taxa, whereas the two amplification-free methods were more tolerant toward such errors and thus are recommendable in abundance-based analyses. In addition, genus taxon is a compromising analytic level that ensures technically supported and biologically and/or ecologically meaningful viromic conclusions. IMPORTANCE Viral metagenomics can be roughly divided into species richness-based studies and species abundance-based analyses. Viromic methods with different principles have been developed, but rational selection of these techniques according to different purposes requires comprehensive understanding of their properties. By assessing the four most widely used methods using template samples, we found that multiple displacement amplification (MDA) and metatranscriptomic sequencing (MTT) are advisable for species richness-based viromic studies, as they show excellent efficiency to detect DNA and RNA viruses. Meanwhile, metagenomic sequencing (MTG) and MTT are more compatible with stochastic errors of methods introduced into relative abundance of viromic taxa and hence are rational choices in species abundance-based analyses. This study also highlights that MTG needs to tackle host genome contamination and ameliorate the capacity to detect single-stranded DNA viruses in the future, and the MTT method requires an improvement in bacterial rRNA depletion prior to library preparation.

Keywords: performance comparison; species abundance; species richness; stochastic error; taxonomic rank; viral metagenomics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

FIG 1
FIG 1
Bacterial and host contamination in high-throughput sequencing (HTS) clean data. (A to C) The alignment rates of small subunit (SSU) (A) and large subunit (LSU) (B) rRNA and bacterial marker (BM) (C) of these data were evaluated using ViromeQC 1.0. (D) These HTS data are composed of host genome, bacterial contamination (classified using Kraken2), and unclassified reads. MDA, multiple displacement amplification; MTG, direct metagenomic sequencing; SIA, sequence-independent single-primer amplification; MTT, meta-transcriptomic sequencing.
FIG 2
FIG 2
Virus operational taxonomic unit (vOTU) accumulation curves of MDA (A), MTG (B), SIA (C), and MTT (D) viromic techniques in repeated sequencing (RPS) (referring to the bottom axes) and ultradeep sequencing (UDS) (referring to the top axes).
FIG 3
FIG 3
Principal coordinates analysis (PCoA) analyses revealed that libraries (triangles) aggregated into four clusters corresponding to MDA, MTG, SIA, and MTT viromic techniques that show different preferences to vOTUs (filled circles) of double-stranded DNA (dsDNA) (A), single-stranded DNA (ssDNA) (B), circular single-stranded DNA (cssDNA) (C), reverse transcribing RNA (rtRNA) (D), double-stranded RNA (dsRNA) (E), single-stranded RNA (and ssRNA) (F). The circled libraries are rebuilt MTG and MTT ones that detach from the initial ones due to batch effects.
FIG 4
FIG 4
Stochastic errors of the four methods cause different variation to relative abundance (A) of taxa at the vOTU, genus, and family levels, which further affect Shannon (B) and Simpson (C) α-diversity indices and Bray-Curtis distances (D) between libraries. CV, coefficient of variation; RA, relative abundance. *, P < 0.05; **, P < 0.01; ***, P < 0.001; ****, P < 0.0001.
FIG 5
FIG 5
The variations in the relative abundance of vOTU, genus, and family taxa are negatively correlated with relative abundance.
FIG 6
FIG 6
Read numbers (left column), the complete genome coverage (middle column), and the linear relationship (right column) between the read abundance and the viral concentration (gene copies/μL) of five spiking viruses in the four viromic techniques. The colors in each panel represent the different viruses, as shown in the key at the bottom of the figure. RPKM, reads per kilobase per million mapped reads; PCV2, porcine circovirus 2; PPV1, porcine parvovirus 1; PRV, porcine pseudorabies virus; RVA, group A rotavirus; RABV, rabies virus.

Similar articles

Cited by

References

    1. Zhang Y-Z, Shi M, Holmes EC. 2018. Using metagenomics to characterize an expanding virosphere. Cell 172:1168–1172. doi:10.1016/j.cell.2018.02.043. - DOI - PubMed
    1. Mokili JL, Rohwer F, Dutilh BE. 2012. Metagenomics and future perspectives in virus discovery. Curr Opin Virol 2:63–77. doi:10.1016/j.coviro.2011.12.004. - DOI - PMC - PubMed
    1. Zhang Y-Z, Chen Y-M, Wang W, Qin X-C, Holmes EC. 2019. Expanding the RNA virosphere by unbiased metagenomics. Annu Rev Virol 6:119–139. doi:10.1146/annurev-virology-092818-015851. - DOI - PubMed
    1. Shkoporov AN, Hill C. 2019. Bacteriophages of the human gut: the “Known Unknown” of the microbiome. Cell Host Microbe 25:195–209. doi:10.1016/j.chom.2019.01.017. - DOI - PubMed
    1. Jain R, Rivera MC, Lake JA. 1999. Horizontal gene transfer among genomes: the complexity hypothesis. Proc Natl Acad Sci USA 96:3801–3806. doi:10.1073/pnas.96.7.3801. - DOI - PMC - PubMed

Publication types

LinkOut - more resources