Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Nov 18:11:775402.
doi: 10.3389/fcimb.2021.775402. eCollection 2021.

Streptococcus pneumoniae: a Plethora of Temperate Bacteriophages With a Role in Host Genome Rearrangement

Affiliations

Streptococcus pneumoniae: a Plethora of Temperate Bacteriophages With a Role in Host Genome Rearrangement

Antonio J Martín-Galiano et al. Front Cell Infect Microbiol. .

Abstract

Bacteriophages (phages) are viruses that infect bacteria. They are the most abundant biological entity on Earth (current estimates suggest there to be perhaps 1031 particles) and are found nearly everywhere. Temperate phages can integrate into the chromosome of their host, and prophages have been found in abundance in sequenced bacterial genomes. Prophages may modulate the virulence of their host in different ways, e.g., by the secretion of phage-encoded toxins or by mediating bacterial infectivity. Some 70% of Streptococcus pneumoniae (the pneumococcus)-a frequent cause of otitis media, pneumonia, bacteremia and meningitis-isolates harbor one or more prophages. In the present study, over 4000 S. pneumoniae genomes were examined for the presence of prophages, and nearly 90% were found to contain at least one prophage, either defective (47%) or present in full (43%). More than 7000 complete putative integrases, either of the tyrosine (6243) or serine (957) families, and 1210 full-sized endolysins (among them 1180 enzymes corresponding to 318 amino acid-long N-acetylmuramoyl-L-alanine amidases [LytAPPH]) were found. Based on their integration site, 26 different pneumococcal prophage groups were documented. Prophages coding for tRNAs, putative virulence factors and different methyltransferases were also detected. The members of one group of diverse prophages (PPH090) were found to integrate into the 3' end of the host lytASpn gene encoding the major S. pneumoniae autolysin without disrupting it. The great similarity of the lytASpn and lytAPPH genes (85-92% identity) allowed them to recombine, via an apparent integrase-independent mechanism, to produce different DNA rearrangements within the pneumococcal chromosome. This study provides a complete dataset that can be used to further analyze pneumococcal prophages, their evolutionary relationships, and their role in the pathogenesis of pneumococcal disease.

Keywords: Streptococcus pneumoniae; endolysin; genomic rearrangements; integrase; lytic enzymes; prophage; tRNAs; virulence factors.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Sequence similarities between PPHs genomes. (A) Venn diagrams showing equivalent or very similar pneumococcal prophages. Two PPH genomes were considered ‘equivalent’ when they overlapped for ≥90% of their length and showed ≥90% sequence identity (shown on a white background). When two genomes overlapped between 80 and 89% of their length, and nucleotide identities were ≥90%, the corresponding PPHs were considered as ‘very similar’. They are shown on a gray background. (B). wGRR similarity heatmap depicting the amino acid identity shared between pairs of prophage sequences (either full-length or defective prophages). Different colors were applied to clusters showing distance thresholds > 4. For agglomerative hierarchical clustering, distances are shown in the X axis. The dendrogram and heatmap python tools of the scipy.cluster.hierarchy library were used to depict final agglomerative hierarchical clustering and heatmap plots, respectively. Green and pink arrows point to new clusters or singletons respectively. The correspondence between PPHs and the cluster organization of pneumococcal prophages previously reported (Brueggemann et al., 2017; Rezaei Javan et al., 2019) is indicated at the upper part of this panel. See Table S4 for a proposed correlation between previously described pneumococcal prophages and PPH groups.
Figure 2
Figure 2
A proposal for phage integration into the S. pneumoniae genome after the recombination of lytASpn and lytA PPH. (A) The circular phage genome (red) and the pneumococcal chromosome (dotted line) are shown. Genes are drawn as arrows indicating the direction of transcription. An ‘X’ indicates the recombination event. The pneumococcal genes plyA and dinF that flank the lytASpn gene are shown for reference (Morales et al., 2015). The asterisk (*) indicates that the phage and bacterial lytA genes were probably chimeric because they contain sequences of both phage and bacterial origin. (B) Novel genomic arrangement of this plyA—lytA island in the pneumococcal strain SMRU257. The location of a gene (AD529_RS01375) encoding a large protein (3628 aa residues; WP_142367134) is shown. For comparison, the corresponding DNA region of the control D39 strain is shown at the bottom.
Figure 3
Figure 3
Organization of the DNA region located between the 3’ ends of the int and lytASpn * genes. Genes are not drawn to scale. Four prophages of the PPH090 family were examined. The intergenic region is longer in PPH090_4 (belonging to the serine Int family) than in the other PPHs with tyrosine Ints. The percentage nucleotide identities are indicated. The approximate positions of the attP core sequences are shown. Two of the prophages belong to the PPH015 group and the corresponding whole regions are 97.4% identical. Orange rectangles represent the 51 bp-long perfectly conserved sequence located immediately 3’ from lytA PPH and lytASpn *.
Figure 4
Figure 4
Sequence signatures of lytASpn * and lytA PPH_090* alleles. (A) Bacterial and prophage lytA genes of strains harboring PPH090 group phages were aligned and the positions where the bacterial (on one side) and phage (on the other) genes diverged were marked. For simplicity, the highly polymorphic region characteristic of the lytA gene (positions 421–480) (Morales et al., 2010) was not included in the alignment. The nucleotide positions (taking 1 as the first nucleotide of the ATG initiation codon) should be read vertically. (B) Distribution of polymorphic sites in bacterial and prophage alleles according to the alignment shown in panel (A) The vertical arrow points to the data at the polymorphic position 951. The total length of the lytA gene is 957 bp.
Figure 5
Figure 5
Schematic representation of several DNA rearrangements in the 3’ region of lytASpn *. Genes are represented as arrows indicating the direction of transcription and are numbered as those of the S. pneumoniae D39 strain (SPD_). Narrow open arrows indicate frameshifted open reading frames. The lytASpn gene (SPD_RS09250) is represented by a striped arrow. Panels (A–E) respectively show the rearrangements of genes located in the vicinity of SPD_RS00120 (coding for a nucleoside deaminase), SPD_RS00125 (ffs encoding the signal recognition particle sRNA), SPD_RS07415 (whiA), SPD_RS09110 (coding for a single-stranded DNA-binding protein), or within SPD_RS09885 (encoding ComGC, the major subunit of the competence pilus). Green and red arrows respectively indicate genes translocated or inverted with respect to their position in S. pneumoniae D39. In arrangement D2, an additional recombination event caused the inversion of several genes (indicated by hatched arrows). A gray square indicates the region located 3’ of lytASpn shown in detail in Figure 6 .
Figure 6
Figure 6
Diagram of the region located immediately downstream of the termination codon of lytASpn * in 14 pneumococcal strains with differently rearranged genomes. The region corresponds to that indicated with a gray square in Figure 4 . DNA regions sharing ≥95% nucleotide identity are shown with the same color and shadowing. The type of arrangement (A to E) is indicated in parentheses to the right of the name of each strain. The different core attachment sites are shown as diamonds. The deep blue rectangles represent the conserved 51 bp-long sequence mentioned in the text. The red rectangles correspond to the reverse complement of positions 24037–24121 of the S. pneumoniae D39 genome (between SPD_RS00120 encoding a nucleoside deaminase and SPD_RS00115 coding for an adenylosuccinate synthase). The purple rectangle corresponds to positions 1,712,485–1,712,386 (between SPD_RS09110 and SPD_RS09115 in the D39 chromosome). For additional information see Tables S6 and S8 . At the bottom, the nucleotide sequence surrounding the initiation codon of SPD_RS09110 in strains D39 and SMRU1319 (arrangement D2) is shown. The correct initiation codon is shown in red, bold font, and underlined. Another in-frame initiation codon is not underlined and apparently lacks a potential ribosome-binding site (RBS) (Acc. No. CRIC01000015). The predicted RBS of SPD_RS09110 is shown in pink lettering. The 14 bp-long repeat potentially responsible for a chromosomal inversion in strain SMRU1319 is inserted in a gray box. The att core sequence of PPH085 is underlined.

Similar articles

Cited by

References

    1. Achaz G., Coissac E., Netter P., Rocha E. P. C. (2003). Associations Between Inverted Repeats and the Structural Evolution of Bacterial Genomes. Genetics 164, 1279–1289. doi: 10.1093/genetics/164.4.1279 - DOI - PMC - PubMed
    1. Afzal M., Shafeeq S., Henriques-Normark B., Kuipers O. P. (2015). UlaR Activates Expression of the Ula Operon in Streptococcus Pneumoniae in the Presence of Ascorbic Acid. Microbiology 161, 41–49. doi: 10.1099/mic.0.083899-0 - DOI - PubMed
    1. Andam C. P., Worby C. J., Gierke R., McGee L., Pilishvili T., Hanage W. P. (2017). Penicillin Resistance of Nonvaccine Type Pneumococcus Before and After PCV13 Introduction, United States. Emerg. Infect. Dis. 23, 1012–1015. doi: 10.3201/eid2306.161331 - DOI - PMC - PubMed
    1. Arndt D., Grant J. R., Marcu A., Sajed T., Pon A., Liang Y., et al. (2016). PHASTER: A Better, Faster Version of the PHAST Phage Search Tool. Nucleic Acids Res. 44, W16–W21. doi: 10.1093/nar/gkw387 - DOI - PMC - PubMed
    1. Bailly-Bechet M., Vergassola M., Rocha E. (2007). Causes for the Intriguing Presence of tRNAs in Phages. Genome Res. 17, 1486–1495. doi: 10.1101/gr.6649807 - DOI - PMC - PubMed

Publication types

MeSH terms

Substances