Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 21;8(1):e0046922.
doi: 10.1128/msphere.00469-22. Epub 2022 Dec 12.

Genomic Characterization of Skin and Soft Tissue Streptococcus pyogenes Isolates from a Low-Income and a High-Income Setting

Affiliations

Genomic Characterization of Skin and Soft Tissue Streptococcus pyogenes Isolates from a Low-Income and a High-Income Setting

Saikou Y Bah et al. mSphere. .

Abstract

Streptococcus pyogenes is a leading cause of human morbidity and mortality, especially in resource-limited settings. The development of a vaccine against S. pyogenes is a global health priority to reduce the burden of postinfection rheumatic heart disease. To support this, molecular characterization of circulating S. pyogenes isolates is needed. We performed whole-genome analyses of S. pyogenes isolates from skin and soft tissue infections in Sukuta, The Gambia, a low-income country (LIC) in West Africa where there is a high burden of such infections. To act as a comparator to these LIC isolates, skin infection isolates from Sheffield, United Kingdom (a high-income country [HIC]), were also sequenced. The LIC isolates from The Gambia were genetically more diverse (46 emm types in 107 isolates) than the HIC isolates from Sheffield (23 emm types in 142 isolates), with only 7 overlapping emm types. Other molecular markers were shared, including a high prevalence of the skin infection-associated emm pattern D and the variable fibronectin-collagen-T antigen (FCT) types FCT-3 and FCT-4. Fewer of the Gambian LIC isolates carried prophage-associated superantigens (64%) and DNases (26%) than did the Sheffield HIC isolates (99% and 95%, respectively). We also identified streptococcin genes unique to 36% of the Gambian LIC isolates and a higher prevalence (48%) of glucuronic acid utilization pathway genes in the Gambian LIC isolates than in the Sheffield HIC isolates (26%). Comparison to a wider collection of HIC and LIC isolate genomes supported our findings of differing emm diversity and prevalence of bacterial factors. Our study provides insight into the genetics of LIC isolates and how they compare to HIC isolates. IMPORTANCE The global burden of rheumatic heart disease (RHD) has triggered a World Health Organization response to drive forward development of a vaccine against the causative human pathogen Streptococcus pyogenes. This burden stems primarily from low- and middle-income settings where there are high levels of S. pyogenes skin and soft tissue infections, which can lead to RHD. Our study provides much needed whole-genome-based molecular characterization of isolates causing skin infections in Sukuta, The Gambia, a low-income country (LIC) in West Africa where infection and RHD rates are high. Although we identified a greater level of diversity in these LIC isolates than in isolates from Sheffield, United Kingdom (a high-income country), there were some shared features. There were also some features that differed by geographical region, warranting further investigation into their contribution to infection. Our study has also contributed data essential for the development of a vaccine that would target geographically relevant strains.

Keywords: FCT region; bacterial genomics; emm types; group A Streptococcus; skin infection; vaccine.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

FIG 1
FIG 1
Distribution of emm type, pattern, and cluster differs by site. (A) Frequency of emm types found in the wider LIC genome collection and Gambian isolate genomes combined (LIC; orange) compared to the that found in the wider HIC genome collection and Sheffield isolate genomes combined (HIC; black) (B) An emm pattern of A-C, D, or E was assigned to 98/107 Gambian LIC isolates (patterns for the remaining 9 were undetermined) and all 142 Sheffield HIC isolates. The same emm patterns were also assigned to 766/781 (15 were undetermined) LIC isolates from Gambia, Kenya, and Fiji (All LIC) and all 4,820 HIC isolates from Sheffield, BSAC (England/Wales), PHE (United Kingdom), and ABCs (United States) (All HIC). (C) An emm cluster was also assigned to 98/107 Gambian LIC isolates (the remaining 9 were excluded) and all 142 Sheffield HIC isolates, as well as to all LIC and all HIC isolates. Pie charts represent the percentage of isolates associated with each pattern/cluster.
FIG 2
FIG 2
Phylogenetic analysis of the 107 Gambian LIC isolate genomes. A maximum likelihood phylogeny was constructed from the core gene alignment (1,382,412 bp) using RAxML (51) with 100 bootstraps. Isolates clustered by emm type except for those indicated, whereby two lineages were represented by a single emm genotype: star, emm25; filled square, emm65; open square, emm85; open circle, emm89; filled circle, emm209. Also shown is the presence (black)/absence (white) of the superantigen genes speA, speC, speG, speH to speM, speQ, speR, ssa, and smeZ and DNase genes spd1 and spd3; four other DNase genes (sda1, sda2, sdn, and spd4) were tested for but were not found in any isolate. In all cases except one (red dot), speA was located within the prophage-like element Φ10394.2. One isolate had a gene that appeared to be a fusion of 5′ speK and 3′ speM (red triangles). Antimicrobial resistance genes (AMR) tetM, tetL, dfrG, and dfrK were also identified in some isolates (white, absent; black, present). The positivity for serum opacity factor (sof) is also shown, although for one emm55 isolate, this gene would produce a truncated variant of serum opacity factor (gray). The scale bar represents substitutions per site. emm types are colored for easy visualization, and type numbers are also given.
FIG 3
FIG 3
Phylogenetic analysis of the 142 Sheffield HIC isolate genomes. A maximum likelihood phylogenetic tree was generated with the core gene alignment (1,185,813 bp) by using RAxML (51) with 100 bootstraps. All isolates clustered by emm type. The presence (black)/absence (white) of superantigen genes speA, speC, speG, speH to speM, speQ, speR, ssa, and smeZ and DNase genes sda1, sda2, sdn, spd1, spd3, and spd4 is indicated. Antimicrobial resistance genes (AMR) tetM, ermA, and ermB were also identified in some isolates (white, absent; black, present). The positivity for serum opacity factor (sof) is also shown, but in all emm12 types, this gene would produce a truncated variation of serum opacity factor (gray). The scale bar represents substitutions per site. emm types are colored for easy visualization, and type numbers are also given.
FIG 4
FIG 4
Comparison of superantigen and DNase gene carriage in LIC isolates and HIC isolates. The proportions of the Gambian LIC isolates (gray bars) and the Sheffield HIC isolates (white bars) as well as the wider LIC isolates (Kenya, Fiji, Gambia; n = 781) (orange bars) and the wider HIC isolates (BSAC, PHE, ABCs, Sheffield; n = 4,820) (black bars) carrying the respective genes were determined by BLAST analysis and mapping.
FIG 5
FIG 5
Arrangement of genes in the Mga regulon of Gambian LIC and Sheffield HIC isolates. The genes within the mga regulon for each isolate were determined, and Mga regulon type I to VI was assigned. The majority of emm types in both the Sheffield HIC isolates and the Gambian LIC isolates were type I, with the M-like protein genes mrp and enn flanking the M protein gene emm. The previously assigned emm pattern A-C, D, or E (based on the emm type) is also given. The streptococcal inhibitor of complement (sic) gene was identified only in Sheffield HIC emm1 isolates, and the distantly related to sic (drs) gene was found only in Sheffield HIC emm12 isolates. The gene pgs encodes Pgs, a 15.5-kDa protein of unknown function (27).
FIG 6
FIG 6
FCT arrangement patterns identified in Gambian LIC and Sheffield HIC isolates. FCT regions were extracted from de novo assemblies, and the FCT type was assigned based on the predicted function and order of genes within the extracted region. The emm types of isolates with each FCT type are shown for Sheffield HIC and Gambian LIC isolates. A new FCT region was identified (FCT10) as similar to that of FCT5 but with an additional gene encoding for a fibronectin binding protein after the sortase genes. For all emm types, there was at least one isolate with a designated FCT type in a single contiguous region. The only exception to this was emm118 (*), where the FCT was estimated to be FCT4 and the new FCT10 for each of the two isolates as the FCT region was split over two contigs. In FCT1, transposases were found in Sheffield HIC emm6 and emm108, and in FCT9, transposases were found in Sheffield HIC emm75 and Gambian LIC emm4. fbp, fibronectin binding protein; cwp, cell wall protein; ap, ancillary protein; trp, transposase.

References

    1. Walker MJ, Barnett TC, McArthur JD, Cole JN, Gillen CM, Henningham A, Sriprakash KS, Sanderson-Smith ML, Nizet V. 2014. Disease manifestations and pathogenic mechanisms of group A Streptococcus. Clin Microbiol Rev 27:264–301. doi:10.1128/CMR.00101-13. - DOI - PMC - PubMed
    1. Watkins DA, Johnson CO, Colquhoun SM, Karthikeyan G, Beaton A, Bukhman G, Forouzanfar MH, Longenecker CT, Mayosi BM, Mensah GA, Nascimento BR, Ribeiro ALP, Sable CA, Steer AC, Naghavi M, Mokdad AH, Murray CJL, Vos T, Carapetis JR, Roth GA. 2017. Global, regional, and national burden of rheumatic heart disease, 1990-2015. N Engl J Med 377:713–722. doi:10.1056/NEJMoa1603693. - DOI - PubMed
    1. Vekemans J, Gouvea-Reis F, Kim JH, Excler J-L, Smeesters PR, O'Brien KL, Van Beneden CA, Steer AC, Carapetis JR, Kaslow DC. 2019. The path to group A Streptococcus vaccines: World Health Organization research and development technology roadmap and preferred product characteristics. Clin Infect Dis 69:877–883. doi:10.1093/cid/ciy1143. - DOI - PMC - PubMed
    1. Dale JB, Penfound TA, Chiang EY, Walton WJ. 2011. New 30-valent M protein-based vaccine evokes cross-opsonic antibodies against non-vaccine serotypes of group A streptococci. Vaccine 29:8175–8178. doi:10.1016/j.vaccine.2011.09.005. - DOI - PMC - PubMed
    1. Pastural É, McNeil SA, MacKinnon-Cameron D, Ye L, Langley JM, Stewart R, Martin LH, Hurley GJ, Salehi S, Penfound TA, Halperin S, Dale JB. 2020. Safety and immunogenicity of a 30-valent M protein-based group a streptococcal vaccine in healthy adult volunteers: a randomized, controlled phase I study. Vaccine 38:1384–1392. doi:10.1016/j.vaccine.2019.12.005. - DOI - PubMed

Publication types

Substances