A Novel Method to Predict Genomic Islands Based on Mean Shift Clustering Algorithm
- PMID: 26731657
- PMCID: PMC4711805
- DOI: 10.1371/journal.pone.0146352
A Novel Method to Predict Genomic Islands Based on Mean Shift Clustering Algorithm
Abstract
Genomic Islands (GIs) are regions of bacterial genomes that are acquired from other organisms by the phenomenon of horizontal transfer. These regions are often responsible for many important acquired adaptations of the bacteria, with great impact on their evolution and behavior. Nevertheless, these adaptations are usually associated with pathogenicity, antibiotic resistance, degradation and metabolism. Identification of such regions is of medical and industrial interest. For this reason, different approaches for genomic islands prediction have been proposed. However, none of them are capable of predicting precisely the complete repertory of GIs in a genome. The difficulties arise due to the changes in performance of different algorithms in the face of the variety of nucleotide distribution in different species. In this paper, we present a novel method to predict GIs that is built upon mean shift clustering algorithm. It does not require any information regarding the number of clusters, and the bandwidth parameter is automatically calculated based on a heuristic approach. The method was implemented in a new user-friendly tool named MSGIP--Mean Shift Genomic Island Predictor. Genomes of bacteria with GIs discussed in other papers were used to evaluate the proposed method. The application of this tool revealed the same GIs predicted by other methods and also different novel unpredicted islands. A detailed investigation of the different features related to typical GI elements inserted in these new regions confirmed its effectiveness. Stand-alone and user-friendly versions for this new methodology are available at http://msgip.integrativebioinformatics.me.
Conflict of interest statement
Figures


Similar articles
-
INDeGenIUS, a new method for high-throughput identification of specialized functional islands in completely sequenced organisms.J Biosci. 2010 Sep;35(3):351-64. doi: 10.1007/s12038-010-0040-4. J Biosci. 2010. PMID: 20826944
-
GI-Cluster: Detecting genomic islands via consensus clustering on multiple features.J Bioinform Comput Biol. 2018 Jun;16(3):1840010. doi: 10.1142/S0219720018400103. Epub 2018 Feb 4. J Bioinform Comput Biol. 2018. PMID: 29566638
-
GI-SVM: A sensitive method for predicting genomic islands based on unannotated sequence of a single genome.J Bioinform Comput Biol. 2016 Feb;14(1):1640003. doi: 10.1142/S0219720016400035. J Bioinform Comput Biol. 2016. PMID: 26907990
-
Detecting genomic islands using bioinformatics approaches.Nat Rev Microbiol. 2010 May;8(5):373-82. doi: 10.1038/nrmicro2350. Nat Rev Microbiol. 2010. PMID: 20395967 Review.
-
Identification and characterization of pathogenicity and other genomic islands using base composition analyses.Future Microbiol. 2006 Oct;1(3):309-16. doi: 10.2217/17460913.1.3.309. Future Microbiol. 2006. PMID: 17661643 Review.
Cited by
-
Deciphering pathogenicity and antibiotic resistance islands in methicillin-resistant Staphylococcus aureus genomes.Open Biol. 2017 Dec;7(12):170094. doi: 10.1098/rsob.170094. Open Biol. 2017. PMID: 29263245 Free PMC article.
-
MeShClust v3.0: high-quality clustering of DNA sequences using the mean shift algorithm and alignment-free identity scores.BMC Genomics. 2022 Jun 6;23(1):423. doi: 10.1186/s12864-022-08619-0. BMC Genomics. 2022. PMID: 35668366 Free PMC article.
-
Improved genomic island predictions with IslandPath-DIMOB.Bioinformatics. 2018 Jul 1;34(13):2161-2167. doi: 10.1093/bioinformatics/bty095. Bioinformatics. 2018. PMID: 29905770 Free PMC article.
-
MeShClust: an intelligent tool for clustering DNA sequences.Nucleic Acids Res. 2018 Aug 21;46(14):e83. doi: 10.1093/nar/gky315. Nucleic Acids Res. 2018. PMID: 29718317 Free PMC article.
-
Comparative Analysis of Genomic Island Prediction Tools.Front Genet. 2018 Dec 12;9:619. doi: 10.3389/fgene.2018.00619. eCollection 2018. Front Genet. 2018. PMID: 30631340 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases