Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 8;49(D1):D644-D650.
doi: 10.1093/nar/gkaa821.

BacWGSTdb 2.0: a one-stop repository for bacterial whole-genome sequence typing and source tracking

Affiliations

BacWGSTdb 2.0: a one-stop repository for bacterial whole-genome sequence typing and source tracking

Ye Feng et al. Nucleic Acids Res. .

Abstract

An increasing prevalence of hospital acquired infections and foodborne illnesses caused by pathogenic and multidrug-resistant bacteria has stimulated a pressing need for benchtop computational techniques to rapidly and accurately classify bacteria from genomic sequence data, and based on that, to trace the source of infection. BacWGSTdb (http://bacdb.org/BacWGSTdb) is a free publicly accessible database we have developed for bacterial whole-genome sequence typing and source tracking. This database incorporates extensive resources for bacterial genome sequencing data and the corresponding metadata, combined with specialized bioinformatics tools that enable the systematic characterization of the bacterial isolates recovered from infections. Here, we present BacWGSTdb 2.0, which encompasses several major updates, including (i) the integration of the core genome multi-locus sequence typing (cgMLST) approach, which is highly scalable and appropriate for typing isolates belonging to different lineages; (ii) the addition of a multiple genome analysis module that can process dozens of user uploaded sequences in a batch mode; (iii) a new source tracking module for comparing user uploaded plasmid sequences to those deposited in the public databases; (iv) the number of species encompassed in BacWGSTdb 2.0 has increased from 9 to 20, which represents bacterial pathogens of medical importance; (v) a newly designed, user-friendly interface and a set of visualization tools for providing a convenient platform for users are also included. Overall, the updated BacWGSTdb 2.0 bears great utility in continuing to provide users, including epidemiologists, clinicians and bench scientists, with a one-stop solution to bacterial genome sequence analysis.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Overview of the content and function modules of BacWGSTdb 2.0. ‘Infrastructure’ lists the public database and tools integrated in BacWGSTdb 2.0. ‘Browse’ functions to visualize and compare the genetic relationships among isolates deposited in BacWGSTdb 2.0. ‘Tools’ functions for whole genome sequence typing and source tracking based on user uploaded sequence(s). ‘Species’ represents 20 bacterial species currently supported by BacWGSTdb 2.0.
Figure 2.
Figure 2.
Screenshots of the updated web interface of the Single Genome Analysis module and the detailed outputs using a Salmonella enterica isolate as an example. After uploading the preassembled genomic sequence and setting the appropriate parameters, the analytical results return and can subsequently be classified into three sections: conventional seven-locus MLST, identification of AMR and virulence genes, and source tracking of plasmids and bacteria. In particular, the phylogenetic analysis revealed that the query isolate shows high degree of genetic relatedness in the database, suggesting that these isolates might have originated from the same source. The entire analysis process takes 3–5 min.
Figure 3.
Figure 3.
Screenshots of the updated web interface of the Multiple Genome Analysis module and the detailed outputs using nine S. enterica isolates as an example. The Results page lists for each of the uploaded genomes the conventional seven-locus MLST results and the predictions of AMR and virulence genes. In addition, the phylogenetic trees based on the SNP and cgMLST approaches both reveal that the query isolates did not involve an outbreak event, since they differ from one another by over 100 SNPs or cgMLST loci. The entire analysis process takes 5–8 min.
Figure 4.
Figure 4.
Screenshots of the updated Browse interface. Users can sort and select isolates based on their various attributes. For the selected isolates, phylogenetic trees following both the SNP and cgMLST approaches are provided.

Similar articles

Cited by

References

    1. Brachman P.S. Infectious diseases–past, present, and future. Int. J. Epidemiol. 2003; 32:684–686. - PubMed
    1. Hernando-Amado S., Coque T.M., Baquero F., Martinez J.L.. Defining and combating antibiotic resistance from One Health and Global Health perspectives. Nat. Microbiol. 2019; 4:1432–1442. - PubMed
    1. Alexander D.C., Hao W., Gilmour M.W., Zittermann S., Sarabia A., Melano R.G., Peralta A., Lombos M., Warren K., Amatnieks Y. et al. .. Escherichia coli O104:H4 infections and international travel. Emerg. Infect. Dis. 2012; 18:473–476. - PMC - PubMed
    1. Scott P., Deye G., Srinivasan A., Murray C., Moran K., Hulten E., Fishbain J., Craft D., Riddell S., Lindler L. et al. .. An outbreak of multidrug-resistant Acinetobacter baumannii-calcoaceticus complex infection in the US military health care system associated with military operations in Iraq. Clin. Infect. Dis. 2007; 44:1577–1584. - PubMed
    1. Quainoo S., Coolen J.P.M., van Hijum S., Huynen M.A., Melchers W.J.G., van Schaik W., Wertheim H.F.L.. Whole-Genome Sequencing of Bacterial Pathogens: the Future of Nosocomial Outbreak Analysis. Clin. Microbiol. Rev. 2017; 30:1015–1063. - PMC - PubMed

Publication types