Comprehensive assessment of machine learning-based methods for predicting antimicrobial peptides

Jing Xu¹, Fuyi Li², André Leier³, Dongxu Xiang¹, Hsin-Hui Shen⁴, Tatiana T Marquez Lago⁵, Jian Li⁶, Dong-Jun Yu⁷, Jiangning Song⁸

Affiliations

¹ Department of Biochemistry and Molecular Biology and Biomedicine Discovery Institute, Monash University, Australia.
² Department of Microbiology and Immunology, the Peter Doherty Institute for Infection and Immunity, the University of Melbourne, Australia.
³ Department of Genetics, UAB School of Medicine, USA.
⁴ Department of Biochemistry & Molecular Biology and Department of Materials Science & Engineering, Monash University, Australia.
⁵ Departments of Genetics and Microbiology, UAB School of Medicine, USA.
⁶ Monash Biomedicine Discovery Institute and Department of Microbiology, Monash University, Australia.
⁷ School of Computer Science and Engineering, Nanjing University of Science and Technology, China.
⁸ Monash Biomedicine Discovery Institute, Monash University, Australia.

PMID: 33774670
DOI: 10.1093/bib/bbab083

Comprehensive assessment of machine learning-based methods for predicting antimicrobial peptides

Jing Xu et al. Brief Bioinform. 2021.

. 2021 Sep 2;22(5):bbab083.

doi: 10.1093/bib/bbab083.

Authors

Jing Xu¹, Fuyi Li², André Leier³, Dongxu Xiang¹, Hsin-Hui Shen⁴, Tatiana T Marquez Lago⁵, Jian Li⁶, Dong-Jun Yu⁷, Jiangning Song⁸

Affiliations

¹ Department of Biochemistry and Molecular Biology and Biomedicine Discovery Institute, Monash University, Australia.
² Department of Microbiology and Immunology, the Peter Doherty Institute for Infection and Immunity, the University of Melbourne, Australia.
³ Department of Genetics, UAB School of Medicine, USA.
⁴ Department of Biochemistry & Molecular Biology and Department of Materials Science & Engineering, Monash University, Australia.
⁵ Departments of Genetics and Microbiology, UAB School of Medicine, USA.
⁶ Monash Biomedicine Discovery Institute and Department of Microbiology, Monash University, Australia.
⁷ School of Computer Science and Engineering, Nanjing University of Science and Technology, China.
⁸ Monash Biomedicine Discovery Institute, Monash University, Australia.

PMID: 33774670
DOI: 10.1093/bib/bbab083

Abstract

Antimicrobial peptides (AMPs) are a unique and diverse group of molecules that play a crucial role in a myriad of biological processes and cellular functions. AMP-related studies have become increasingly popular in recent years due to antimicrobial resistance, which is becoming an emerging global concern. Systematic experimental identification of AMPs faces many difficulties due to the limitations of current methods. Given its significance, more than 30 computational methods have been developed for accurate prediction of AMPs. These approaches show high diversity in their data set size, data quality, core algorithms, feature extraction, feature selection techniques and evaluation strategies. Here, we provide a comprehensive survey on a variety of current approaches for AMP identification and point at the differences between these methods. In addition, we evaluate the predictive performance of the surveyed tools based on an independent test data set containing 1536 AMPs and 1536 non-AMPs. Furthermore, we construct six validation data sets based on six different common AMP databases and compare different computational methods based on these data sets. The results indicate that amPEPpy achieves the best predictive performance and outperforms the other compared methods. As the predictive performances are affected by the different data sets used by different methods, we additionally perform the 5-fold cross-validation test to benchmark different traditional machine learning methods on the same data set. These cross-validation results indicate that random forest, support vector machine and eXtreme Gradient Boosting achieve comparatively better performances than other machine learning methods and are often the algorithms of choice of multiple AMP prediction tools.

Keywords: antimicrobial peptides; bioinformatics; deep learning; feature engineering; machine learning; predictors.

PubMed Disclaimer

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
- Ovid Technologies, Inc.
- Silverchair Information Systems
Other Literature Sources
- scite Smart Citations
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Comprehensive assessment of machine learning-based methods for predicting antimicrobial peptides

Affiliations

Comprehensive assessment of machine learning-based methods for predicting antimicrobial peptides

Authors

Affiliations

Abstract

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials