Review and comparative analysis of machine learning-based phage virion protein identification methods
- PMID: 32135196
- DOI: 10.1016/j.bbapap.2020.140406
Review and comparative analysis of machine learning-based phage virion protein identification methods
Abstract
Phage virion protein (PVP) identification plays key role in elucidating relationships between phages and hosts. Moreover, PVP identification can facilitate the design of related biochemical entities. Recently, several machine learning approaches have emerged for this purpose and have shown their potential capacities. In this study, the proposed PVP identifiers are systemically reviewed, and the related algorithms and tools are comprehensively analyzed. We summarized the common framework of these PVP identifiers and constructed our own novel identifiers based upon the framework. Furthermore, we focus on a performance comparison of all PVP identifiers by using a training dataset and an independent dataset. Highlighting the pros and cons of these identifiers demonstrates that g-gap DPC (dipeptide composition) features are capable of representing characteristics of PVPs. Moreover, SVM (support vector machine) is proven to be the more effective classifier to distinguish PVPs and non-PVPs.
Keywords: G-gap DPC; Machine leaning; Phage virion proteins; Support vector machine.
Copyright © 2020 Elsevier B.V. All rights reserved.
Conflict of interest statement
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Similar articles
-
Meta-iPVP: a sequence-based meta-predictor for improving the prediction of phage virion proteins using effective feature representation.J Comput Aided Mol Des. 2020 Oct;34(10):1105-1116. doi: 10.1007/s10822-020-00323-z. Epub 2020 Jun 16. J Comput Aided Mol Des. 2020. PMID: 32557165
-
PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method.Cells. 2020 Feb 3;9(2):353. doi: 10.3390/cells9020353. Cells. 2020. PMID: 32028709 Free PMC article.
-
Prediction of Phage Virion Proteins Using Machine Learning Methods.Molecules. 2023 Feb 28;28(5):2238. doi: 10.3390/molecules28052238. Molecules. 2023. PMID: 36903484 Free PMC article.
-
Large-scale comparative review and assessment of computational methods for phage virion proteins identification.EXCLI J. 2022 Jan 3;21:11-29. doi: 10.17179/excli2021-4411. eCollection 2022. EXCLI J. 2022. PMID: 35145365 Free PMC article. Review.
-
Application of Machine Learning Approaches for Protein-protein Interactions Prediction.Med Chem. 2017;13(6):506-514. doi: 10.2174/1573406413666170522150940. Med Chem. 2017. PMID: 28530547 Review.
Cited by
-
PhageScanner: a reconfigurable machine learning framework for bacteriophage genomic and metagenomic feature annotation.Front Microbiol. 2024 Sep 17;15:1446097. doi: 10.3389/fmicb.2024.1446097. eCollection 2024. Front Microbiol. 2024. PMID: 39355420 Free PMC article.
-
Clinical Prediction of Heart Failure in Hemodialysis Patients: Based on the Extreme Gradient Boosting Method.Front Genet. 2022 Apr 26;13:889378. doi: 10.3389/fgene.2022.889378. eCollection 2022. Front Genet. 2022. PMID: 35559036 Free PMC article.
-
Identification of Causal Genes of COVID-19 Using the SMR Method.Front Genet. 2021 Jul 5;12:690349. doi: 10.3389/fgene.2021.690349. eCollection 2021. Front Genet. 2021. PMID: 34290742 Free PMC article.
-
Application of machine learning in bacteriophage research.BMC Microbiol. 2021 Jun 26;21(1):193. doi: 10.1186/s12866-021-02256-5. BMC Microbiol. 2021. PMID: 34174831 Free PMC article. Review.
-
Genomic insight on Klebsiella variicola isolated from wastewater treatment plant has uncovered a novel bacteriophage.BMC Genomics. 2024 Oct 22;25(1):986. doi: 10.1186/s12864-024-10906-x. BMC Genomics. 2024. PMID: 39438783 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous