Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Jun 15;86(2):e0000421.
doi: 10.1128/mmbr.00004-21. Epub 2022 Mar 21.

Computational Tools for the Analysis of Uncultivated Phage Genomes

Affiliations
Review

Computational Tools for the Analysis of Uncultivated Phage Genomes

Juan Sebastián Andrade-Martínez et al. Microbiol Mol Biol Rev. .

Abstract

Over a century of bacteriophage research has uncovered a plethora of fundamental aspects of their biology, ecology, and evolution. Furthermore, the introduction of community-level studies through metagenomics has revealed unprecedented insights on the impact that phages have on a range of ecological and physiological processes. It was not until the introduction of viral metagenomics that we began to grasp the astonishing breadth of genetic diversity encompassed by phage genomes. Novel phage genomes have been reported from a diverse range of biomes at an increasing rate, which has prompted the development of computational tools that support the multilevel characterization of these novel phages based solely on their genome sequences. The impact of these technologies has been so large that, together with MAGs (Metagenomic Assembled Genomes), we now have UViGs (Uncultivated Viral Genomes), which are now officially recognized by the International Committee for the Taxonomy of Viruses (ICTV), and new taxonomic groups can now be created based exclusively on genomic sequence information. Even though the available tools have immensely contributed to our knowledge of phage diversity and ecology, the ongoing surge in software programs makes it challenging to keep up with them and the purpose each one is designed for. Therefore, in this review, we describe a comprehensive set of currently available computational tools designed for the characterization of phage genome sequences, focusing on five specific analyses: (i) assembly and identification of phage and prophage sequences, (ii) phage genome annotation, (iii) phage taxonomic classification, (iv) phage-host interaction analysis, and (v) phage microdiversity.

Keywords: computational analysis; microdiversity; phage and prophage identification; phage annotation; phage metagenomics; phage taxonomy; phage-host interaction; uncultivated viruses; viromes.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

FIG 1
FIG 1
Proposed workflow for the analysis of phage metagenomic data. Raw metagenomic reads are first filtered for contaminants and then assembled into contigs and binned. While in principle the identification of phage and/or prophage contigs could be omitted if the researcher knows that the reads are enriched for viruses, we suggest performing it as an additional way to filter contaminant nonviral bins. Phage or prophage contigs can then be subjected to genome annotation, taxonomic classification, microdiversity analysis, and host-association analysis. Moreover, while they are different analyses, genome annotation and taxonomic classification are usually done conjointly. While one can carry out microdiversity and/or phage-host analysis without prior annotation and/or taxonomic classification, the former two are usually done before either of the latter.

Similar articles

Cited by

References

    1. Garza DR, Dutilh BE. 2015. From cultured to uncultured genome sequences: metagenomics and modeling microbial ecosystems. Cell Mol Life Sci 72:4287–4308. doi: 10.1007/s00018-015-2004-1. - DOI - PMC - PubMed
    1. Fancello L, Raoult D, Desnues C. 2012. Computational tools for viral metagenomics and their application in clinical research. Virology 434:162–174. doi: 10.1016/j.virol.2012.09.025. - DOI - PMC - PubMed
    1. Simmonds P, Adams MJ, Benkő M, Breitbart M, Brister JR, Carstens EB, Davison AJ, Delwart E, Gorbalenya AE, Harrach B, Hull R, King AMQ, Koonin EV, Krupovic M, Kuhn JH, Lefkowitz EJ, Nibert ML, Orton R, Roossinck MJ, Sabanadzovic S, Sullivan MB, Suttle CA, Tesh RB, van der Vlugt RA, Varsani A, Zerbini FM. 2017. Consensus statement: virus taxonomy in the age of metagenomics. Nat Rev Microbiol 15:161–168. doi: 10.1038/nrmicro.2016.177. - DOI - PubMed
    1. Edwards RA, Rohwer F. 2005. Viral metagenomics. Nat Rev Microbiol 3:504–510. doi: 10.1038/nrmicro1163. - DOI - PubMed
    1. Cook R, Brown N, Redgwell T, Rihtman B, Barnes M, Clokie M, Stekel DJ, Hobman J, Jones MA, Millard A. 2021. INfrastructure for a PHAge REference Database: identification of large-scale biases in the current collection of cultured phage genomes. Phage (New Rochelle) 2:214–223. doi: 10.1089/phage.2021.0007. - DOI - PMC - PubMed

Publication types

LinkOut - more resources