Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 7;12(4):725.
doi: 10.3390/antibiotics12040725.

CalcAMP: A New Machine Learning Model for the Accurate Prediction of Antimicrobial Activity of Peptides

Affiliations

CalcAMP: A New Machine Learning Model for the Accurate Prediction of Antimicrobial Activity of Peptides

Colin Bournez et al. Antibiotics (Basel). .

Abstract

To combat infection by microorganisms host organisms possess a primary arsenal via the innate immune system. Among them are defense peptides with the ability to target a wide range of pathogenic organisms, including bacteria, viruses, parasites, and fungi. Here, we present the development of a novel machine learning model capable of predicting the activity of antimicrobial peptides (AMPs), CalcAMP. AMPs, in particular short ones (<35 amino acids), can become an effective solution to face the multi-drug resistance issue arising worldwide. Whereas finding potent AMPs through classical wet-lab techniques is still a long and expensive process, a machine learning model can be useful to help researchers to rapidly identify whether peptides present potential or not. Our prediction model is based on a new data set constructed from the available public data on AMPs and experimental antimicrobial activities. CalcAMP can predict activity against both Gram-positive and Gram-negative bacteria. Different features either concerning general physicochemical properties or sequence composition have been assessed to retrieve higher prediction accuracy. CalcAMP can be used as an promising prediction asset to identify short AMPs among given peptide sequences.

Keywords: antimicrobial peptides; antimicrobial resistance; artificial intelligence; bacteria; drug discovery; machine learning.

PubMed Disclaimer

Conflict of interest statement

Madam Therapeutics is a commercial company that aims to put these type of AMPs on the market.

Figures

Figure 10
Figure 10
Different examples of peptides with their reported experimental activities and their label.
Figure 1
Figure 1
Overview of the variety of 3D AMP structures. (a) Crotamine from PDB code 1Z99; (b) fowlcidin from PDB code 2AMN; (c) circullin B from PDB code 2ERI; (d) LEAP-2 from PDB code 2L1Q. Yellow bonds represent disulfide bridges.
Figure 2
Figure 2
Number of experimental assays retrieved for the top five species by category (a). Venn diagram showing the distribution of peptides per category (b).
Figure 3
Figure 3
Comparison of amino acid composition (a), global net charge (b), and molecular weight (c) between AMPs and Non-AMPs.
Figure 4
Figure 4
Matrix of labels for the common peptides between Gram− and Gram+ categories.
Figure 5
Figure 5
PCA (a) and t-SNE (b) projections of physicochemical descriptors between common peptides of Gram+ and Gram− categories. In blue are peptides are labelled as AMP in both categories, in red are peptides are labelled as Non-AMP in both categories, in green are peptides labelled AMP for Gram+ and Non-AMP for Gram−, and in black is the opposite.
Figure 6
Figure 6
Top 20 features importance plot and their impact on the external test set prediction for CalcAMP+ (a) and CalcAMP- (b). Shown are physicochemical properties such as molecular weight (MW), Charge, or Length. However, the majority of the top features were from CTD descriptors. They are identifiable by their names beginning with an underscore character, followed by the property and finally the component characteristics: composition (C), transition (T), and distribution (D).
Figure 7
Figure 7
Confusion matrix for CalcAMP+ model prediction versus CalcAMP- (a) and AMP probability score by predicted class (b).
Figure 8
Figure 8
Receiver operator characteristic (ROC) curves of the different AMP classifiers and their area under the curve score.
Figure 9
Figure 9
Top 20 feature importance plot and their impact on the external test set prediction for CalcAFP.
Figure 11
Figure 11
PCA projections of the training set (blue) and external test set (red) for Gram+ (a) and Gram− (b) categories.

References

    1. Prestinaci F., Pezzotti P., Pantosti A. Antimicrobial Resistance: A Global Multifaceted Phenomenon. Pathog. Glob. Health. 2015;109:309. doi: 10.1179/2047773215Y.0000000030. - DOI - PMC - PubMed
    1. Tacconelli E., Carrara E., Savoldi A., Harbarth S., Mendelson M., Monnet D.L., Pulcini C., Kahlmeter G., Kluytmans J., Carmeli Y., et al. Discovery, Research, and Development of New Antibiotics: The WHO Priority List of Antibiotic-Resistant Bacteria and Tuberculosis. Lancet Infect. Dis. 2018;18:318–327. doi: 10.1016/S1473-3099(17)30753-3. - DOI - PubMed
    1. Eliopoulos G.M., Meka V.G., Gold H.S. Antimicrobial Resistance to Linezolid. Clin. Infect. Dis. 2004;39:1010–1015. doi: 10.1086/423841. - DOI - PubMed
    1. Munoz-Price L.S., Lolans K., Quinn J.P. Emergence of Resistance to Daptomycin during Treatment of Vancomycin-Resistant Enterococcus Faecalis Infection. Clin. Infect. Dis. 2005;41:565–566. doi: 10.1086/432121. - DOI - PubMed
    1. WHO Publishes List of Bacteria for Which New Antibiotics Are Urgently Needed. [(accessed on 9 November 2022)]. Available online: https://www.who.int/news/item/27-02-2017-who-publishes-list-of-bacteria-....

LinkOut - more resources