VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses
- PMID: 33522966
- PMCID: PMC7852108
- DOI: 10.1186/s40168-020-00990-y
VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses
Abstract
Background: Viruses are a significant player in many biosphere and human ecosystems, but most signals remain "hidden" in metagenomic/metatranscriptomic sequence datasets due to the lack of universal gene markers, database representatives, and insufficiently advanced identification tools.
Results: Here, we introduce VirSorter2, a DNA and RNA virus identification tool that leverages genome-informed database advances across a collection of customized automatic classifiers to improve the accuracy and range of virus sequence detection. When benchmarked against genomes from both isolated and uncultivated viruses, VirSorter2 uniquely performed consistently with high accuracy (F1-score > 0.8) across viral diversity, while all other tools under-detected viruses outside of the group most represented in reference databases (i.e., those in the order Caudovirales). Among the tools evaluated, VirSorter2 was also uniquely able to minimize errors associated with atypical cellular sequences including eukaryotic genomes and plasmids. Finally, as the virosphere exploration unravels novel viral sequences, VirSorter2's modular design makes it inherently able to expand to new types of viruses via the design of new classifiers to maintain maximal sensitivity and specificity.
Conclusion: With multi-classifier and modular design, VirSorter2 demonstrates higher overall accuracy across major viral groups and will advance our knowledge of virus evolution, diversity, and virus-microbe interaction in various ecosystems. Source code of VirSorter2 is freely available ( https://bitbucket.org/MAVERICLab/virsorter2 ), and VirSorter2 is also available both on bioconda and as an iVirus app on CyVerse ( https://de.cyverse.org/de ). Video abstract.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures





Similar articles
-
Phytovirome Analysis of Wild Plant Populations: Comparison of Double-Stranded RNA and Virion-Associated Nucleic Acid Metagenomic Approaches.J Virol. 2019 Dec 12;94(1):e01462-19. doi: 10.1128/JVI.01462-19. Print 2019 Dec 12. J Virol. 2019. PMID: 31597769 Free PMC article.
-
Unveiling Crucivirus Diversity by Mining Metagenomic Data.mBio. 2020 Sep 1;11(5):e01410-20. doi: 10.1128/mBio.01410-20. mBio. 2020. PMID: 32873755 Free PMC article.
-
Benchmarking informatics approaches for virus discovery: caution is needed when combining in silico identification methods.mSystems. 2024 Mar 19;9(3):e0110523. doi: 10.1128/msystems.01105-23. Epub 2024 Feb 20. mSystems. 2024. PMID: 38376167 Free PMC article.
-
Metagenomic characterization of viral communities in corals: mining biological signal from methodological noise.Environ Microbiol. 2015 Oct;17(10):3440-9. doi: 10.1111/1462-2920.12803. Epub 2015 Mar 27. Environ Microbiol. 2015. PMID: 25708646 Review.
-
Bats as Viral Reservoirs.Annu Rev Virol. 2016 Sep 29;3(1):77-99. doi: 10.1146/annurev-virology-110615-042203. Epub 2016 Aug 22. Annu Rev Virol. 2016. PMID: 27578437 Review.
Cited by
-
Phage-plasmids promote recombination and emergence of phages and plasmids.Nat Commun. 2024 Feb 20;15(1):1545. doi: 10.1038/s41467-024-45757-3. Nat Commun. 2024. PMID: 38378896 Free PMC article.
-
Global diversity and ecological functions of viruses inhabiting oil reservoirs.Nat Commun. 2024 Aug 8;15(1):6789. doi: 10.1038/s41467-024-51101-6. Nat Commun. 2024. PMID: 39117673 Free PMC article.
-
Community ecology and functional potential of bacteria, archaea, eukarya and viruses in Guerrero Negro microbial mat.Sci Rep. 2024 Jan 31;14(1):2561. doi: 10.1038/s41598-024-52626-y. Sci Rep. 2024. PMID: 38297006 Free PMC article.
-
Fecal microbiota transplantation alters gut phage communities in a clinical trial for obesity.Microbiome. 2024 Jul 6;12(1):122. doi: 10.1186/s40168-024-01833-w. Microbiome. 2024. PMID: 38970126 Free PMC article. Clinical Trial.
-
Global diversity and distribution of prophages are lineage-specific within the Ralstonia solanacearum species complex.BMC Genomics. 2022 Oct 6;23(1):689. doi: 10.1186/s12864-022-08909-7. BMC Genomics. 2022. PMID: 36199029 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources