TAR-VIR: a pipeline for TARgeted VIRal strain reconstruction from metagenomic data
- PMID: 31164077
- PMCID: PMC6549370
- DOI: 10.1186/s12859-019-2878-2
TAR-VIR: a pipeline for TARgeted VIRal strain reconstruction from metagenomic data
Abstract
Background: Strain-level RNA virus characterization is essential for developing prevention and treatment strategies. Viral metagenomic data, which can contain sequences of both known and novel viruses, provide new opportunities for characterizing RNA viruses. Although there are a number of pipelines for analyzing viruses in metagenomic data, they have different limitations. First, viruses that lack closely related reference genomes cannot be detected with high sensitivity. Second, strain-level analysis is usually missing.
Results: In this study, we developed a hybrid pipeline named TAR-VIR that reconstructs viral strains without relying on complete or high-quality reference genomes. It is optimized for identifying RNA viruses from metagenomic data by combining an effective read classification method and our in-house strain-level de novo assembly tool. TAR-VIR was tested on both simulated and real viral metagenomic data sets. The results demonstrated that TAR-VIR competes favorably with other tested tools.
Conclusion: TAR-VIR can be used standalone for viral strain reconstruction from metagenomic data. Or, its read recruiting stage can be used with other de novo assembly tools for superior viral functional and taxonomic analyses. The source code and the documentation of TAR-VIR are available at https://github.com/chjiao/TAR-VIR .
Keywords: RNA virus; Read classification; Strain assembly; Viral metagenomics.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures



Similar articles
-
CHEER: HierarCHical taxonomic classification for viral mEtagEnomic data via deep leaRning.Methods. 2021 May;189:95-103. doi: 10.1016/j.ymeth.2020.05.018. Epub 2020 May 23. Methods. 2021. PMID: 32454212 Free PMC article.
-
De novo assembly of highly polymorphic metagenomic data using in situ generated reference sequences and a novel BLAST-based assembly pipeline.BMC Bioinformatics. 2017 Apr 26;18(1):223. doi: 10.1186/s12859-017-1630-z. BMC Bioinformatics. 2017. PMID: 28446139 Free PMC article.
-
A binning tool to reconstruct viral haplotypes from assembled contigs.BMC Bioinformatics. 2019 Nov 4;20(1):544. doi: 10.1186/s12859-019-3138-1. BMC Bioinformatics. 2019. PMID: 31684876 Free PMC article.
-
Metagenomic characterization of viral communities in corals: mining biological signal from methodological noise.Environ Microbiol. 2015 Oct;17(10):3440-9. doi: 10.1111/1462-2920.12803. Epub 2015 Mar 27. Environ Microbiol. 2015. PMID: 25708646 Review.
-
Overview of Virus Metagenomic Classification Methods and Their Biological Applications.Front Microbiol. 2018 Apr 23;9:749. doi: 10.3389/fmicb.2018.00749. eCollection 2018. Front Microbiol. 2018. PMID: 29740407 Free PMC article. Review.
Cited by
-
Agnostic Sequencing for Detection of Viral Pathogens.Clin Microbiol Rev. 2023 Mar 23;36(1):e0011922. doi: 10.1128/cmr.00119-22. Epub 2023 Feb 27. Clin Microbiol Rev. 2023. PMID: 36847515 Free PMC article. Review.
-
VirStrain: a strain identification tool for RNA viruses.Genome Biol. 2022 Jan 31;23(1):38. doi: 10.1186/s13059-022-02609-x. Genome Biol. 2022. PMID: 35101081 Free PMC article.
-
Current trends in RNA virus detection through metatranscriptome sequencing data.FEBS Open Bio. 2023 Jun;13(6):992-1000. doi: 10.1002/2211-5463.13626. Epub 2023 May 20. FEBS Open Bio. 2023. PMID: 37163224 Free PMC article. Review.
-
UnCoVar: a reproducible and scalable workflow for transparent and robust virus variant calling and lineage assignment using SARS-CoV-2 as an example.BMC Genomics. 2024 Jun 28;25(1):647. doi: 10.1186/s12864-024-10539-0. BMC Genomics. 2024. PMID: 38943066 Free PMC article.
-
Setu: a pipeline for the robust assembly of SARS-CoV-2 genomes.Microbiol Resour Announc. 2024 Jul 18;13(7):e0023724. doi: 10.1128/mra.00237-24. Epub 2024 Jun 7. Microbiol Resour Announc. 2024. PMID: 38847537 Free PMC article.
References
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases