Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 27;16(4):513.
doi: 10.3390/v16040513.

Genome Analysis of Epsilon CrAss-like Phages

Affiliations

Genome Analysis of Epsilon CrAss-like Phages

Igor V Babkin et al. Viruses. .

Abstract

CrAss-like phages play an important role in maintaining ecological balance in the human intestinal microbiome. However, their genetic diversity and lifestyle are still insufficiently studied. In this study, a novel CrAssE-Sib phage genome belonging to the epsilon crAss-like phage genomes was found. Comparative analysis indicated that epsilon crAss-like phages are divided into two putative genera, which were proposed to be named Epsilonunovirus and Epsilonduovirus; CrAssE-Sib belongs to the former. The crAssE-Sib genome contains a diversity-generating retroelement (DGR) cassette with all essential elements, including the reverse transcriptase (RT) and receptor binding protein (RBP) genes. However, this RT contains the GxxxSP motif in its fourth domain instead of the usual GxxxSQ motif found in all known phage and bacterial DGRs. RBP encoded by CrAssE-Sib and other Epsilonunoviruses has an unusual structure, and no similar phage proteins were found. In addition, crAssE-Sib and other Epsilonunoviruses encode conserved prophage repressor and anti-repressors that could be involved in lysogenic-to-lytic cycle switches. Notably, DNA primase sequences of epsilon crAss-like phages are not included in the monophyletic group formed by the DNA primases of all other crAss-like phages. Therefore, epsilon crAss-like phage substantially differ from other crAss-like phages, indicating the need to classify these phages into a separate family.

Keywords: crAss-like phages; diversity-generating retroelements; genome; repressors/anti-repressors; reverse transcriptase; virus taxonomy.

PubMed Disclaimer

Conflict of interest statement

All co-authors have seen and agree with the contents of the manuscript and the order of authors, and there is no financial interest to report. All co-authors declare that they have no conflicts of interest.

Figures

Figure 1
Figure 1
The whole genome map of crAssE-Sib phage. ORFs are colored according to their proposed function: DNA replication—blue; head assembly and structural—green; transcription—black; reverse transcriptase—red; other ORFs are colored with magenta.
Figure 2
Figure 2
ViPTree analysis of the crAssE-Sib phage. Dendrogram plotted by ViPTree version 3.7 using crAssE-Sib and phages with SG values > 0.001. The crAssE-Sib phage is marked with a blue asterisk; phi14:2 phage is marked with a black triangle; phage sequences that were downloaded from the NCBI GenBank manually are marked with red phylogenetic branches.
Figure 3
Figure 3
Bidirectional clustering heatmap visualizing VIRIDIC-generated similarity matrix for crAssE-Sib and related phage genomes. The crAssE-Sib phage is marked with a blue asterisk.
Figure 4
Figure 4
Comparative genome alignment of the crAssE-Sib and close phage genomes. Analysis was performed using VipTree software v. 3.7. The percentage of sequence similarity is indicated in color; the color scale is shown at the top.
Figure 5
Figure 5
Maximum likelihood phylogenetic tree of the crAssE-Sib terminase large subunit generated using IQ-tree software v. 2.0.5. The crAssE-Sib phage is marked with a black circle. Nodes with 95% statistical significance calculated from 1000 ultrafast bootstrap (UFBOOT) replicates are marked with asterisks. The scale bar represents the number of substitutions per site.
Figure 6
Figure 6
Maximum likelihood phylogenetic tree of the crAssE-Sib primase generated using IQ-tree software. The crAssE-Sib phage is marked with a black circle. Nodes with 95% statistical significance calculated from 1000 ultrafast bootstrap (UFBOOT) replicates are marked with asterisks. The scale bar represents the number of substitutions per site.
Figure 7
Figure 7
Maximum likelihood phylogenetic tree of the crAssE-Sib RNA polymerase generated using IQ-tree software. The crAssE-Sib phage is marked with a black circle. Nodes with 95% statistical significance calculated from 1000 ultrafast bootstrap (UFBOOT) replicates are marked with asterisks. The scale bar represents the number of substitutions per site.
Figure 8
Figure 8
Schematic structure of DGR cassettes found in the genomes of crAssE-Sib and other members of the Epsilonunovirus genus. Schematic structure of DGR cassettes found in the genomes of crAssE-Sib and other members of the Epsilonunovirus genus. DGRs contain the receptor binding protein (RBP—green arrow) and reverse transcriptase (RT—red arrow) genes as well as template repeat (TR—light blue) with IMH* sequence (orange) and variable repeat (VR—blue) with initiation of the mutagenic homing (IMH—yellow) sequence. RT is fragmented or absent in many Epsilonunoviruses. Empty arrows indicate hypothetical proteins.
Figure 9
Figure 9
Multiple sequence alignment representations of aa sequences of the VRs region of crAssE-Sib and related phages.
Figure 10
Figure 10
Comparison between 3D model of crAssE-Sib receptor binding protein (RBP) and experimental structures of TaqVP protein from Thermus aquaticus (pdb id 5VF4) and Mtd protein of the BPP-1 phage (pdb id 1YU0). VR regions are in green; magenta alpha helices show similar orientation of the molecules. All molecules are on the same scale.
Figure 11
Figure 11
Maximum likelihood phylogenetic tree of the crAssE-Sib RT generated using IQ-tree software. The crAssE-Sib phage is marked with a black circle. Nodes with 95% statistical significance are marked with asterisks calculated from 1000 ultrafast bootstrap (UFBOOT) replicates. The scale bar represents the number of substitutions per site.

References

    1. Dutilh B.E., Cassman N., McNair K., Sanchez S.E., Silva G.G.Z., Boling L., Barr J.J., Speth D.R., Seguritan V., Aziz R.K., et al. A Highly Abundant Bacteriophage Discovered in the Unknown Sequences of Human Faecal Metagenomes. Nat. Commun. 2014;5:4498. doi: 10.1038/ncomms5498. - DOI - PMC - PubMed
    1. Yutin N., Makarova K.S., Gussow A.B., Krupovic M., Segall A., Edwards R.A., Koonin E.V. Discovery of an expansive bacteriophage family that includes the most abundant viruses from the human gut. Nat. Microbiol. 2018;3:38–46. doi: 10.1038/s41564-017-0053-y. - DOI - PMC - PubMed
    1. Guerin E., Shkoporov A., Stockdale S.R., Clooney A.G., Ryan F.J., Sutton T.D.S., Draper L.A., Gonzalez-Tortuero E., Ross R.P., Hill C. Biology and taxonomy of crAss-like Bacteriophages, the most abundant virus in the human gut. Cell Host Microbe. 2018;24:653–664. doi: 10.1016/j.chom.2018.10.002. - DOI - PubMed
    1. Smith L., Goldobina E., Govi B., Shkoporov A.N. Bacteriophages of the Order Crassvirales: What Do We Currently Know about This Keystone Component of the Human Gut Virome? Biomolecules. 2023;13:584. doi: 10.3390/biom13040584. - DOI - PMC - PubMed
    1. Shkoporov A.N., Khokhlova E.V., Fitzgerald C.B., Stockdale S.R., Draper L.A., Ross R.P., Hill C. ΦCrAss001 Represents the Most Abundant Bacteriophage Family in the Human Gut and Infects Bacteroides intestinalis. Nat. Commun. 2018;9:4781. doi: 10.1038/s41467-018-07225-7. - DOI - PMC - PubMed

Publication types

LinkOut - more resources