Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jun 5:20:882-894.
doi: 10.1016/j.omtn.2020.05.006. Epub 2020 May 12.

Deep-AmPEP30: Improve Short Antimicrobial Peptides Prediction with Deep Learning

Affiliations

Deep-AmPEP30: Improve Short Antimicrobial Peptides Prediction with Deep Learning

Jielu Yan et al. Mol Ther Nucleic Acids. .

Abstract

Antimicrobial peptides (AMPs) are a valuable source of antimicrobial agents and a potential solution to the multi-drug resistance problem. In particular, short-length AMPs have been shown to have enhanced antimicrobial activities, higher stability, and lower toxicity to human cells. We present a short-length (≤30 aa) AMP prediction method, Deep-AmPEP30, developed based on an optimal feature set of PseKRAAC reduced amino acids composition and convolutional neural network. On a balanced benchmark dataset of 188 samples, Deep-AmPEP30 yields an improved performance of 77% in accuracy, 85% in the area under the receiver operating characteristic curve (AUC-ROC), and 85% in area under the precision-recall curve (AUC-PR) over existing machine learning-based methods. To demonstrate its power, we screened the genome sequence of Candida glabrata-a gut commensal fungus expected to interact with and/or inhibit other microbes in the gut-for potential AMPs and identified a peptide of 20 aa (P3, FWELWKFLKSLWSIFPRRRP) with strong anti-bacteria activity against Bacillus subtilis and Vibrio parahaemolyticus. The potency of the peptide is remarkably comparable to that of ampicillin. Therefore, Deep-AmPEP30 is a promising prediction tool to identify short-length AMPs from genomic sequences for drug discovery. Our method is available at https://cbbio.cis.um.edu.mo/AxPEP for both individual sequence prediction and genome screening for AMPs.

Keywords: AmPEP; AxPEP; Candida glabrata; ampicillin; antimicrobial peptide; convolutional neural network; drug discovery; machine learning; reduced amino acid composition.

PubMed Disclaimer

Figures

None
Graphical abstract
Figure 1
Figure 1
Size Effect of the Train Dataset on Model Performance
Figure 2
Figure 2
Performance of AMP Classifiers (A) Receiver operator characteristic curves of different AMP classifiers and (B) their run time performances on the benchmark dataset.
Figure 3
Figure 3
Anti-Bacterial Effect of Three Top-Ranked Predicted AMPs against Four Different Bacteria Species Growth assay of Bacillus subtilis, Vibrio parahaemolyticus, Pseudomonas aeruginosa, and Escherichia coli in the absence (H2O) or presence of P3, P10, P26, and a control peptide (Pcontrol) that is known to have no anti-bacterial effect. Ampicillin was used as a positive control. Growth of bacteria was measured by absorbance at OD600 over time. The average of three independent experiments is presented. Treatment showing an inhibitory effect against the assayed bacteria is highlighted by a red box. A pink box indicates a subtle but significant (e.g., consistent in all three biological repeats) effect.
Figure 4
Figure 4
The Architecture of Our CNN-Based Classifier for Short AMP Prediction The model accepts a feature vector of N elements as input. First, the data values are normalized using a batch size of 64; then, the input is transferred into convoluted features by two convolutional layers and two maximum pooling layers. Each convolutional layer applies 128 kernels using a kernel size of 3 × 1 with stride 1, while each maximum pooling layer pools together data using a kernel size of 2 × 1 with stride 2. A dropout rate of 20% is applied in the maximum pooling step to prevent overfitting. Finally, all convoluted features are flattened and fed into a fully connected neural network with 10 hidden nodes and 1 output node. The rectified linear function (ReLU) is used as the activation function in the convolutional layer and by the hidden nodes, but the sigmoid function is used by the output node.

Similar articles

Cited by

References

    1. Hancock R.E.W., Sahl H.G. Antimicrobial and host-defense peptides as new anti-infective therapeutic strategies. Nat. Biotechnol. 2006;24:1551–1557. - PubMed
    1. Afacan N.J., Yeung A.T.Y., Pena O.M., Hancock R.E.W. Therapeutic potential of host defense peptides in antibiotic-resistant infections. Curr. Pharm. Des. 2012;18:807–819. - PubMed
    1. Bechinger B., Gorr S.-U. Antimicrobial peptides: mechanisms of action and resistance. J. Dent. Res. 2017;96:254–260. - PMC - PubMed
    1. Ciumac D., Gong H., Hu X., Lu J.R. Membrane targeting cationic antimicrobial peptides. J. Colloid Interface Sci. 2019;537:163–185. - PubMed
    1. Scocchi M., Mardirossian M., Runti G., Benincasa M. Non-membrane permeabilizing modes of action of antimicrobial peptides on bacteria. Curr. Top. Med. Chem. 2016;16:76–88. - PubMed