Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May;8(5):mgen000818.
doi: 10.1099/mgen.0.000818.

Enabling genomic island prediction and comparison in multiple genomes to investigate bacterial evolution and outbreaks

Affiliations

Enabling genomic island prediction and comparison in multiple genomes to investigate bacterial evolution and outbreaks

Claire Bertelli et al. Microb Genom. 2022 May.

Abstract

Outbreaks of virulent and/or drug-resistant bacteria have a significant impact on human health and major economic consequences. Genomic islands (GIs; defined as clusters of genes of probable horizontal origin) are of high interest because they disproportionately encode virulence factors, some antimicrobial-resistance (AMR) genes, and other adaptations of medical or environmental interest. While microbial genome sequencing has become rapid and inexpensive, current computational methods for GI analysis are not amenable for rapid, accurate, user-friendly and scalable comparative analysis of sets of related genomes. To help fill this gap, we have developed IslandCompare, an open-source computational pipeline for GI prediction and comparison across several to hundreds of bacterial genomes. A dynamic and interactive visualization strategy displays a bacterial core-genome phylogeny, with bacterial genomes linearly displayed at the phylogenetic tree leaves. Genomes are overlaid with GI predictions and AMR determinants from the Comprehensive Antibiotic Resistance Database (CARD), and regions of similarity between the genomes are also displayed. GI predictions are performed using Sigi-HMM and IslandPath-DIMOB, the two most precise GI prediction tools based on nucleotide composition biases, as well as a novel blast-based consistency step to improve cross-genome prediction consistency. GIs across genomes sharing sequence similarity are grouped into clusters, further aiding comparative analysis and visualization of acquisition and loss of mobile GIs in specific sub-clades. IslandCompare is an open-source software that is containerized for local use, plus available via a user-friendly, web-based interface to allow direct use by bioinformaticians, biologists and clinicians (at https://islandcompare.ca).

Keywords: antimicrobial resistance; comparative genomics; genomic islands; interactive visualization; web server.

PubMed Disclaimer

Conflict of interest statement

The authors declare that there are no conflicts of interest.

Figures

Fig. 1.
Fig. 1.
The IslandCompare workflow. IslandCompare integrates three parallel workflows for the prediction and comparison of GIs, the phyletic visualization of genomes, and the annotation and highlighting of genes with potentially interesting functions such as AMR genes. All results are stored in a standard gff3 format that is used either for interactive visualization in the IslandCompare user-friendly web interface, with images available for download, or for the export of data to conduct further GI analyses.
Fig. 2.
Fig. 2.
Comparative visualization of four P. aeruginosa genomes highlighting GIs and AMR determinants. (a) A phylogeny (left) indicates the relationship between the isolates in the analysis, with zoom-in functionality available. (a – right, b) GIs are represented as coloured blocks placed on a linear representation of the genome (linear white bars indicate genomes, with alignments between genomes shown in grey), with GIs coloured by (a) cluster or (b) prediction method. (c) The cluster view allows users to explore gene content within a given GI.
Fig. 3.
Fig. 3.
Counts of predicted GIs for each cluster and proportion predicted by the blast-based consistency module for a dataset of 166 L . monocytogenes genomes. Only clusters with more than one GI sequence predicted in the dataset are represented here (see Fig. S4a, b for all clusters).
Fig. 4.
Fig. 4.
GI prediction metrics across 86 genomes with known positive and negative GI regions. Predictions were made on the same set of genomes annotated by either NCBI or Prokka. All analyses were run for the GI results both with and without the blast-based consistency module results included.
Fig. 5.
Fig. 5.
Evaluation of a range of parameters for GI clustering in IslandCompare for datasets of (a) 166 L . monocytogenes genomes and (b) 40 P . aeruginosa genomes. Based on this analysis, a k-mer size of 16 and inflation value of 5.0 were selected for the Mash and MCL steps in the IslandCompare clustering pipeline, respectively.

Similar articles

Cited by

References

    1. Langille MGI, Laird MR, Hsiao WWL, Chiu TA, Eisen JA, et al. MicrobeDB: a locally maintainable database of microbial genomic sequences. Bioinformatics. 2012;28:1947–1948. doi: 10.1093/bioinformatics/bts273. - DOI - PMC - PubMed
    1. Freschi L, Bertelli C, Jeukens J, Moore MP, Kukavica-Ibrulj I, et al. Genomic characterisation of an international Pseudomonas aeruginosa reference panel indicates that the two major groups draw upon distinct mobile gene pools. FEMS Microbiol Lett. 2018;365:fny120. doi: 10.1093/femsle/fny120. - DOI - PubMed
    1. Hingston P, Chen J, Dhillon BK, Laing C, Bertelli C, et al. Genotypes associated with Listeria monocytogenes isolates displaying impaired or enhanced tolerances to cold, salt, acid, or desiccation stress. Front Microbiol. 2017;8:369. doi: 10.3389/fmicb.2017.00369. - DOI - PMC - PubMed
    1. Winsor GL, Griffiths EJ, Lo R, Dhillon BK, Shay JA, et al. Enhanced annotations and features for comparing thousands of Pseudomonas genomes in the Pseudomonas genome database. Nucleic Acids Res. 2016;44:D646–D653. doi: 10.1093/nar/gkv1227. - DOI - PMC - PubMed
    1. Dobrindt U, Hochhut B, Hentschel U, Hacker J. Genomic islands in pathogenic and environmental microorganisms. Nat Rev Microbiol. 2004;2:414–424. doi: 10.1038/nrmicro884. - DOI - PubMed

Publication types