Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jul 14:10:e67113.
doi: 10.7554/eLife.67113.

The role of interspecies recombination in the evolution of antibiotic-resistant pneumococci

Collaborators, Affiliations

The role of interspecies recombination in the evolution of antibiotic-resistant pneumococci

Joshua C D'Aeth et al. Elife. .

Abstract

Multidrug-resistant Streptococcus pneumoniae emerge through the modification of core genome loci by interspecies homologous recombinations, and acquisition of gene cassettes. Both occurred in the otherwise contrasting histories of the antibiotic-resistant S. pneumoniae lineages PMEN3 and PMEN9. A single PMEN3 clade spread globally, evading vaccine-induced immunity through frequent serotype switching, whereas locally circulating PMEN9 clades independently gained resistance. Both lineages repeatedly integrated Tn916-type and Tn1207.1-type elements, conferring tetracycline and macrolide resistance, respectively, through homologous recombination importing sequences originating in other species. A species-wide dataset found over 100 instances of such interspecific acquisitions of resistance cassettes and flanking homologous arms. Phylodynamic analysis of the most commonly sampled Tn1207.1-type insertion in PMEN9, originating from a commensal and disrupting a competence gene, suggested its expansion across Germany was driven by a high ratio of macrolide-to-β-lactam consumption. Hence, selection from antibiotic consumption was sufficient for these atypically large recombinations to overcome species boundaries across the pneumococcal chromosome.

Keywords: AMR; epidemiology; genetics; genomics; infectious disease; microbiology; recombination; streptococcus pneumoniae.

PubMed Disclaimer

Conflict of interest statement

JD, Mv, LM, Hd, PT, JS, SL, RG, RS, KK, WH, RB, BB, SB No competing interests declared, NC has consulted for Antigen Discovery Inc Has received an investigator-initiated award from GlaxoSmithKline.

Figures

Figure 1.
Figure 1.. Phylogenomic analysis of PMEN3 lineage.
(A) Maximum likelihood phylogeny generated from the nonrecombinant regions of the whole-genome alignment of 669 isolates from the PMEN3 lineage. Branches are colored by clade, as identified in the key. Units for the scale bar are the number of point mutations along a branch. (B) Bars highlighting the country of origin; serotype; resistance to penicillin, trimethoprim and sulfamethoxazole, and the presence of the mobile genetic elements (MGEs) Tn1207.1 and Tn916 among isolates. The abbreviated resistance phenotypes are resistant (R), intermediate-level resistant (I), and susceptible (S). Bars map across to isolates on the phylogeny. (C) Simplified genome annotation of the PMEN3 reference isolate RMV4. The highlighted regions correspond to peaks of recombination event frequency. Blue bars represent individual genes annotated within the assembly. (D) Distribution of recombination events across the PMEN3 lineage. In the upper half of the graph, red bars indicate recombination events occurring on internal nodes in the tree, which were subsequently inherited by multiple descendant isolates. These bars align with isolates in the phylogeny in section A and map to regions in the genome annotated in section C. Blue bars indicate recombination events on terminal branches of the tree, identified in only one isolate. In the bottom half of the graph, the line represents the frequency of recombination events along the genome’s length.
Figure 1—figure supplement 1.
Figure 1—figure supplement 1.. Root-to-tip analysis of PMEN3 lineage.
(A) The 663-isolate phylogeny of PMEN3 with node tips colored by date of isolation. The tree is topologically identical to that in Figure 1, except that six isolates for which the dates of isolation were unknown were omitted. (B) Linear regression of root-to-tip distance against sampling date for isolates.
Figure 1—figure supplement 2.
Figure 1—figure supplement 2.. Serotype switching events across the PMEN3 lineage.
Colored bars represent the recombination events associated with switches in serotype inferred from the phylogeny. The bars map to the genome annotation below, with the length of the bar indicating the size of the recombination event. The cps and surrounding loci are annotated at the bottom. Some recombination events span the pbp1a and pbp2x genes, as well as the cps locus itself. The black bars represent recombinations across the cps loci that were not associated with a serotype switch.
Figure 2.
Figure 2.. Phylogenomic analysis of PMEN9 lineage.
(A) Maximum likelihood phylogeny generated from the nonrecombinant regions of a whole-genome alignment of 575 isolates from the PMEN9 lineage. Branches are colored by clade, as identified in the key. Units for the scale bar are the number of point mutations along a branch. (B) Bars highlighting the country of origin; serotype; resistance to penicillin, trimethoprim, and sulfamethoxazole, and the presence of the mobile genetic elements (MGEs) Tn1207.1 and Tn916 among isolates. The abbreviated resistance phenotypes are resistant (R), intermediate-level resistant (I), and susceptible (S). Bars map across to isolates on the phylogeny. (C) Simplified annotated genome of the PMEN9 reference isolate INV200. The highlighted regions correspond to peaks of recombination event frequency. Blue bars represent individual genes annotated within the assembly. (D) Distribution of recombination events across the PMEN9 lineage. In the upper half of the graph, red bars indicate recombination events occurring on internal nodes in the tree, which were subsequently inherited by multiple descendant isolates. These bars are aligned with isolates in the phylogeny in section A and map to regions in the genome annotated in section C. Blue bars indicate recombination events on terminal nodes of the tree, occurring in only one isolate. In the bottom half of the graph, the line represents the frequency of recombination events along the genome’s length.
Figure 2—figure supplement 1.
Figure 2—figure supplement 1.. Root-to-tip analysis of PMEN9 lineage.
(A) The 529-isolate phylogeny of PMEN9, with node tips colored by date of isolation. The tree is topologically identical to that in Figure 2, except that isolates for which the dates of isolate were unknown, and a divergent clade of isolates of serotype 7C and 19A, were omitted. (B) Linear regression of root-to-tip distance against sampling date for isolates.
Figure 2—figure supplement 2.
Figure 2—figure supplement 2.. Serotype switching events across the PMEN9 lineage.
Colored bars represent the recombination events associated with switches in serotype. The bars map to the genome annotation below, with the length of the bar indicating the size of the recombination event. The cps and surrounding loci are annotated at the bottom. Some recombination events span the pbp1a and pbp2x genes, as well as the cps locus itself. The black bars represent recombinations across the cps locus that were not associated with a serotype switch.
Figure 3.
Figure 3.. Summary of recombination differences between the PMEN3 and PMEN9 lineages.
(A) Plotting the SNP density of recombination events relative to their length across both lineages. Dots represent individual recombination events and are colored by the lineage in which they were inferred. (B) Overlaid histograms of the SNP densities of recombination events across PMEN3 and PMEN9. (C) Overlaid histograms of the length, in bases, of each recombination event across PMEN3 and PMEN9.
Figure 4.
Figure 4.. Emergence of resistant lineages within PMEN3 through time.
(A) Time-calibrated phylogeny of PMEN3. Branches are colored by inferred resistance phenotype. Pie charts present at nodes represent the inferred probability of each phenotype by an ancestral reconstruction. Blue bars across the nodes represent the 95% credible interval for the age of the node. (B) The reconstructed absolute number of branches per resistant phenotype through time. (C) The proportion of total branches over time reconstructed as having each of the resistance phenotypes.
Figure 4—figure supplement 1.
Figure 4—figure supplement 1.. Comparisons of category agreement across different penicillin resistance breakpoints.
The minimum inhibitory concentrations (MICs) of each isolate in the PMEN collection were predicted using a random forest (RF) model trained on a 4342-isolate CDC dataset. These MICs were then assigned to resistance phenotypes using two different sets of breakpoints. The 903 PMEN lineage isolates with available MIC data were used as a test dataset to validate the predictions. Bars represent the category agreement (the number of categories correctly predicted across the testing or training dataset) for the two breakpoint sets.
Figure 4—figure supplement 2.
Figure 4—figure supplement 2.. Histograms of recorded minimum inhibitory concentration (MIC) values for penicillin across the PMEN3 and PMEN9 lineages.
(A) MIC values for penicillin for the primarily-resistant clades ST156 and 19A within the PMEN3 lineage (N = 313 for the ST156 clade and N = 25 for the 19A clade). (B) MIC for penicillin across both PMEN3 and PMEN9 (N = 438 for PMEN3 and N = 465 for PMEN9). (C) MIC values for penicillin for the primarily resistant USA and South African clades within the PMEN9 lineage (N = 68 for the USA clade and N = 64 for the South African clade).
Figure 4—figure supplement 3.
Figure 4—figure supplement 3.. Origin of pbp genes for penicillin-resistant isolates.
Scatterplot and violin plot depicting the distribution of γ scores for pbp gene sequences within the GPS collection, where these genes were present within an homologous recombination associated with a gain of resistance to penicillin. Dots represent the individual observations of γ scores, and the violin plot summarizes their overall distribution.
Figure 4—figure supplement 4.
Figure 4—figure supplement 4.. Analysis of the origin of the murM gene across the PMEN3 and PMEN9 lineages.
(A.1) The whole-genome phylogeny of the PMEN3 lineage, as shown in Figure 1, with branches colored by clade of interest, as described in the main text. (A.2) Bars indicating the penicillin resistance category of isolates, and the lineage inferred for the murM gene of each isolate. (A.3) Representation of the murM gene. (A.4) This panel shows the inferred lineage for each base of the murM sequence across the PMEN3 phylogeny. Solid unbroken horizontal bars indicate sequences belonging to a particular lineage, as indicated by the color. Changes in colour across a bar indicate different murM segments were inferred to originate in different lineages, suggesting a mosaic allele generated by recombination. (B.1) Recombination-corrected whole-genome phylogeny of the PMEN9 lineage, as shown in Figure 2, with branches colored by clade of interest. (B.2) Bars indicating the penicillin resistance category of isolates and the overall lineage inferred for the murM gene of each isolate. (B.3) Representation of the murM gene. (B.4) This panel shows the inferred lineage for each base of each murM sequence across the PMEN9 phylogeny, as in panel (A.4).
Figure 5.
Figure 5.. Expansion of a macrolide-resistant clade in Germany prior to vaccine introduction.
(A) The ratio of macrolide-to-β-lactam consumption in Germany. (B) The change in Ne through time inferred by Skygrowth, with the red line showing the results of the analysis that did not include covariates, and the blue line showing the results of the analysis that incorporated the macrolide-to-β-lactam ratio into the phylodynamic reconstruction. Shaded regions represent the 95% credible intervals. (C) The reconstruction of the growth rate of Ne through time. The red line represents the result of model fitting without covariates, and the blue line when the macrolide-to-β-lactam ratio data were incorporated. Shaded regions represent the 95% credible interval for the reconstruction.
Figure 5—figure supplement 1.
Figure 5—figure supplement 1.. Ratio of macrolide-to-β-lactam consumption in Europe.
The ratios of macrolide use, in DDD, for β-lactam use for six major European countries between 1997 and 2010.
Figure 5—figure supplement 2.
Figure 5—figure supplement 2.. Root-to-tip analysis of 162 German isolates within PMEN9.
(A) The 162-isolate phylogeny with node tips colored by date of isolation. (B) Linear regression of root-to-tip distance against sampling date for isolates.
Figure 5—figure supplement 3.
Figure 5—figure supplement 3.. Skygrowth analysis incorporating β-lactam consumption data.
(A) The rate of β-lactam consumption in Germany in DDD. (B) The change in Ne through time inferred by Sygrowth, with the red line showing the results when no covariates were incorporated, and the blue line showing the equivalent results when β-lactam consumption data were incorporated into the reconstruction. Shaded regions represent the 95% credible intervals. (C) The reconstruction of the growth rate of Ne through time. The red line represents the result of model fitting without covariates, and the blue line shows the results when β-lactam consumption data were incorporated. Shaded regions represent the 95% credible intervals for the reconstructions.
Figure 5—figure supplement 4.
Figure 5—figure supplement 4.. Skygrowth analysis incorporating macrolide consumption data.
(A) The rate of macrolide consumption in Germany in DDD. (B) The change in Ne through time inferred by Skygrowth, with the red line representative of when no covariates were incorporated into the reconstruction and the blue line when the macrolide conusmption data were incorporated into the reconstruction. Shaded regions represent the 95% credible intervals. (C) The reconstruction of the growth rate of Ne through time. The red line represents the result of model fitting without covariates, and the blue line shows the results when the macrolide consumption data were incorporated. Shaded regions represents the 95% credible interval for the reconstructions.
Figure 5—figure supplement 5.
Figure 5—figure supplement 5.. Insertion of Tn1207.1 within PMEN9 reference genome.
Comparison of the Tn1207.1 element insertion, highlighted in pink within the INV200 genome, with the orthologous unmodified locus in sample 2456_01. The red bands between the genomes represent sequence matches identified by BLASTN. These bars are shaded by the percentage identity between the sequences. The intact comEC gene is colored cyan within 2456_01, where the fragments of this gene generated by the Tn1207.1 insertion are colored brown in INV200. Arrows along the INV200 genome mark the start and end of the recombination event inferred to have imported Tn1207.1 by the phylogenetic analyses.
Figure 6.
Figure 6.. Detected insertion points of Tn1207.1-type elements within S. pneumoniae.
Genome of the reference S. pneumoniae RMV4 isolate (ENA accession code: ERS1681526) annotated with genes that Tn1207.1-type elements have inserted either into, or adjacent to, among the collection. Only genes present within this mobile genetic element (MGE)-free reference are annotated. Gray bars represent coding sequences (CDS): lighter gray bars represent CDS annotated on the forward strand of the genome, darker gray bars represent those on the reverse strand of the genome. The inner heat map represents the number of isolates that have hits inserted into, or adjacent to, the annotated genes. The color scale is logarithmically transformed.
Figure 6—figure supplement 1.
Figure 6—figure supplement 1.. Insertion of a Tn1207.1-type element as Mega within tag.
Comparison of a Tn1207.1-type element, a Mega cassette, highlighted in pink within the 1018_00 genome, with the orthologous unmodified locus in INV200. Data are shown as described in Figure 5—figure supplement 5.
Figure 7.
Figure 7.. Detected insertion points of Tn916-type elements within S. pneumoniae.
Annotated genome of the reference S. pneumoniae RMV4 isolate (ENA accession number: ERS1681526) with genes where Tn916-type elements have inserted either into, or adjacent to, among the collection. Only genes present within this mobile genetic element (MGE)-free reference are annotated. Gray bars represent coding sequences (CDS): lighter gray bars indicate CDS encoded by the forward strand of the genome, darker gray bars indicate CDS encoded by the reverse strand of the genome. The inner heat map represents the number of isolates that have hits inserted into, or adjacent to, each of the annotated genes. The color scale is logarithmically transformed.
Figure 7—figure supplement 1.
Figure 7—figure supplement 1.. Insertion of a Tn916-type element downstream of recJ.
Comparison of the Tn916-type element insertion, highlighted in pink within the PT2807 genome, with the orthologous unmodified locus in the RMV4 sample. Data shown are as described in Figure 5—figure supplement 5.
Figure 7—figure supplement 2.
Figure 7—figure supplement 2.. Insertion of a Tn916-type element downstream of gmuF.
Comparison of the Tn916-type element insertion, highlighted in pink within the LMG423 genome, with the orthologous unmodified locus in the RMV4 sample. Data shown are as described in Figure 5—figure supplement 5.
Figure 7—figure supplement 3.
Figure 7—figure supplement 3.. Insertion of a Tn916-type element upstream of gidB.
Comparison of the Tn916-type element insertion, highlighted in pink within the SPN11900 genome, with the orthologous unmodified locus in the RMV4 sample. Data shown are as described in Figure 5—figure supplement 5.
Figure 7—figure supplement 4.
Figure 7—figure supplement 4.. Insertion of a Tn916-type element upstream of rplL.
Comparison of the Tn916-type element insertion, highlighted in pink within the SPN11878 genome, with the orthologous unmodified locus in the RMV4 sample. Data shown are as described in Figure 5—figure supplement 5.
Figure 7—figure supplement 5.
Figure 7—figure supplement 5.. Insertion of a Tn916-type element upstream of rplL.
Comparison of the Tn916-type element insertion, highlighted in pink within the GPS_SI_790_P genome, and the orthologous unmodified locus in the RMV4 sample with no insertion. Data are as described in Figure 5—figure supplement 5.
Figure 7—figure supplement 6.
Figure 7—figure supplement 6.. Insertion of a Tn916-type element upstream of rplL.
Comparison of the Tn916-type element insertion, highlighted in pink within the GPS_IL_9193 genome, with the orthologous unmodified locus in the RMV4 sample with no insertion. Data are as described in Figure 5—figure supplement 5.
Figure 8.
Figure 8.. Comparison of length and SNP density of recombination events.
(A) Scatter plot of the SNP density and size of homologous recombinations, with events classified based on whether they imported Tn916-type or Tn1207.10-type mobile genetic elements (MGEs). Blue contour lines summarize the density of all points. (B) Overlaid histogram comparing the SNP density of recombination events that imported MGEs relative to those that did not. (C) Overlaid histogram comparing the length of recombinations that imported MGEs relative to those that did not.
Figure 9.
Figure 9.. Identification of likely sources of Tn916-type and Tn1207.1-type elements.
Flanking regions upstream and downstream from mobile genetic element (MGE) insertion sites were compared to a reference streptococcal database. Lines represent the proportion of matches, across all element acquisitions reconstructed as having occurred through homologous recombination, that correspond to the four species present in the reference streptococcal database. The dashed unmodified locus lines represent data from the orthologous regions of isolates without the MGE insertion. These proportions were calculated at 500 bp increments over 7.5 kb-long flanking regions either side of the insertion site.
Figure 9—figure supplement 1.
Figure 9—figure supplement 1.. Flanking region origin for Tn1207.1-type element tag insertions.
(A) The median γ score of upstream flanking regions of the Tn1207.1-type element insertions into the tag gene, highlighted in coral. The median γ score for the orthologous regions in isolates without the mobile genetic element (MGE), not expected to be modified by homologous recombination, are highlighted in cyan. Shaded regions represent the interquartile range (IQR) of the γ score. (B) The γ score for regions extracted downstream of the insertion. Shaded regions represent the IQR of the γ score. (C) The distribution of γ scores across both flanks, upstream and downstream, of the Tn1207.1-type element tag insertions. Data are shown separately for isolates with this particular insert, and those without Tn1207.1-type elements integrated in this orthologous region.
Figure 10.
Figure 10.. Overview of pipeline used to analyze the acquisition of Tn916-type and Tn1207.1-type elements.
Red boxes represent data input into the pipeline, blue boxes the individual analysis steps within the pipeline, and purple the pipeline’s output.

References

    1. Adrian PV, Klugman KP. Mutations in the dihydrofolate reductase gene of trimethoprim-resistant isolates of Streptococcus pneumoniae. Antimicrobial Agents and Chemotherapy. 1997;41:2406–2413. doi: 10.1128/AAC.41.11.2406. - DOI - PMC - PubMed
    1. Apagyi KJ, Fraser C, Croucher NJ. Transformation asymmetry and the evolution of the bacterial accessory genome. Molecular Biology and Evolution. 2018;35:575–581. doi: 10.1093/molbev/msx309. - DOI - PMC - PubMed
    1. Appelbaum PC. Worldwide development of antibiotic resistance in pneumococci. European Journal of Clinical Microbiology. 1987;8:367–377. doi: 10.1007/BF02013089. - DOI - PubMed
    1. Baker KS, Dallman TJ, Field N, Childs T, Mitchell H, Day M, Weill F-X, Lefèvre S, Tourdjman M, Hughes G, Jenkins C, Thomson N. Horizontal antimicrobial resistance transfer drives epidemics of multiple Shigella species. Nature Communications. 2018;9:1462. doi: 10.1038/s41467-018-03949-8. - DOI - PMC - PubMed
    1. Bergé M, Moscoso M, Prudhomme M, Martin B, Claverys JP. Uptake of transforming DNA in Gram-positive Bacteria: a view from Streptococcus pneumoniae. Molecular Microbiology. 2002;45:411–421. doi: 10.1046/j.1365-2958.2002.03013.x. - DOI - PubMed

Publication types