Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jun;52(6):1871-6.
doi: 10.1128/JCM.00029-14. Epub 2014 Mar 19.

Deriving group A Streptococcus typing information from short-read whole-genome sequencing data

Affiliations

Deriving group A Streptococcus typing information from short-read whole-genome sequencing data

Taryn B T Athey et al. J Clin Microbiol. 2014 Jun.

Abstract

Typing of group A Streptococcus (GAS) is crucial for infection control and epidemiology. While whole-genome sequencing (WGS) is revolutionizing the way that bacterial organisms are typed, it is necessary to provide backward compatibility with currently used typing schemas to facilitate comparisons and understanding of epidemiological trends. Here, we sequenced the genomes of 191 GAS isolates representing 42 different emm types and used bioinformatics tools to derive commonly used GAS typing information directly from the short-read WGS data. We show that emm typing and multilocus sequence typing can be achieved rapidly and efficiently using this approach, which also permits the determination of the presence or absence of genes associated with GAS tissue tropism. We also report on how the WGS data analysis was instrumental in identifying ambiguities present in the commonly used emm type database hosted by the U.S. Centers for Disease Control and Prevention.

PubMed Disclaimer

Figures

FIG 1
FIG 1
Schematics showing emm chromosomal arrangements in different emm GAS types. (A) Many emm types have an emm chromosomal arrangement in which gene emm is found downstream of mga and upstream of scpA. These emm types do not possess emm-like genes. However, some emm types may have other genes between emm and scpA (generically represented by the triangle). All GAS strains in our collection belonging to emm types with this type of emm chromosomal arrangement (listed to the right of scpA) were successfully assigned an emm type using the SRST2 WGS data mapping-based approach. (B) Several GAS emm types contain emm-like genes (mrp and/or enn) in addition to gene emm. We were able to correctly derive emm type information using the same WGS mapping-based approach for several GAS emm types (listed to the right of enn). However, the mapping-based strategy failed in cases where the CDC emm database contained sequences found also in emm-like genes. These emm types (listed below the red box in the gene schematics) were confounded by SRST2 with “emm” types whose sequences actually match either mrp or enn (listed below the green and blue boxes, respectively, in the schematics). The correct emm type could, however, be determined after de novo assembly of WGS data and BLAST and positional analysis. Dotted lines link emm sequences found in the CDC database that were identified in the same strain. Red arrows indicate annealing positions of primers 1 and 2 used in Sanger-based emm typing.
FIG 2
FIG 2
Schematics showing emm chromosomal arrangements in different strains. (A) Strain NGAS320, found to be nontypeable by Sanger sequencing (top) and strain NGAS300, representative of other emm83 GAS strains in our collection (bottom). Although nontypeable by Sanger-based emm typing, NGAS320 was identified as an emm83 by WGS-based emm typing. Further sequence analysis discovered a deletion of 1,390 bp in this strain that knocked out the annealing site of primer 1 used in traditional Sanger-based emm typing. The deletion was confirmed by PCR amplification using primers annealing to the 3′ end of mga and the 5′ end of scpA (indicated by the blue arrows). Red arrows indicate annealing positions of primers 1 and 2 used in Sanger-based emm typing. (B) Strain NGAS128. This GAS has an emm14 gene and was correctly typed by both Sanger-based and WGS-based emm typing. However, the enn gene of this strain possesses sequences identical to those found in the CDC emm database for emm51 (indicated by the blue box).

References

    1. Olsen RJ, Musser JM. 2010. Molecular pathogenesis of necrotizing fasciitis. Annu. Rev. Pathol. 5:1–31. 10.1146/annurev-pathol-121808-102135 - DOI - PubMed
    1. Lancefield RC. 1928. The antigenic complex of Streptococcus haemolyticus: I. Demonstration of a type-specific substance in extracts of Streptococcus haemolyticus. J. Exp. Med. 47:91–103 - PMC - PubMed
    1. Scott JR, Pulliam WM, Hollingshead SK, Fischetti VA. 1985. Relationship of M protein genes in group A streptococci. Proc. Natl. Acad. Sci. U. S. A. 82:1822–1826. 10.1073/pnas.82.6.1822 - DOI - PMC - PubMed
    1. Manjula BN, Acharya AS, Fairwell T, Fischetti VA. 1986. Antigenic domains of the streptococcal Pep M5 protein. Localization of epitopes crossreactive with type 6 M protein and identification of a hypervariable region of the M molecule. J. Exp. Med. 163:129–138 - PMC - PubMed
    1. Facklam R, Beall B, Efstratiou A, Fischetti V, Johnson D, Kaplan E, Kriz P, Lovgren M, Martin D, Schwartz B, Totolian A, Bessen D, Hollingshead S, Rubin F, Scott J, Tyrrell G. 1999. emm typing and validation of provisional M types for group A streptococci. Emerg. Infect. Dis. 5:247–253. 10.3201/eid0502.990209 - DOI - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources