Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy
- PMID: 29893128
- DOI: 10.1021/acs.jproteome.8b00148
Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy
Abstract
Cell-penetrating peptides (CPPs) can enter cells as a variety of biologically active conjugates and have various biomedical applications. To offset the cost and effort of designing novel CPPs in laboratories, computational methods are necessitated to identify candidate CPPs before in vitro experimental studies. We developed a two-layer prediction framework called machine-learning-based prediction of cell-penetrating peptides (MLCPPs). The first-layer predicts whether a given peptide is a CPP or non-CPP, whereas the second-layer predicts the uptake efficiency of the predicted CPPs. To construct a two-layer prediction framework, we employed four different machine-learning methods and five different compositions including amino acid composition (AAC), dipeptide composition, amino acid index, composition-transition-distribution, and physicochemical properties (PCPs). In the first layer, hybrid features (combination of AAC and PCP) and extremely randomized tree outperformed state-of-the-art predictors in CPP prediction with an accuracy of 0.896 when tested on independent data sets, whereas in the second layer, hybrid features obtained through feature selection protocol and random forest produced an accuracy of 0.725 that is better than state-of-the-art predictors. We anticipate that our method MLCPP will become a valuable tool for predicting CPPs and their uptake efficiency and might facilitate hypothesis-driven experimental design. The MLCPP server interface along with the benchmarking and independent data sets are freely accessible at www.thegleelab.org/MLCPP .
Keywords: cell-penetrating peptides; extremely randomized tree; feature selection; machine learning; random forest; uptake efficiency.
Similar articles
-
KELM-CPPpred: Kernel Extreme Learning Machine Based Prediction Model for Cell-Penetrating Peptides.J Proteome Res. 2018 Sep 7;17(9):3214-3222. doi: 10.1021/acs.jproteome.8b00322. Epub 2018 Aug 13. J Proteome Res. 2018. PMID: 30032609
-
MLCPP 2.0: An Updated Cell-penetrating Peptides and Their Uptake Efficiency Predictor.J Mol Biol. 2022 Jun 15;434(11):167604. doi: 10.1016/j.jmb.2022.167604. Epub 2022 Apr 28. J Mol Biol. 2022. PMID: 35662468
-
CPPred-RF: A Sequence-based Predictor for Identifying Cell-Penetrating Peptides and Their Uptake Efficiency.J Proteome Res. 2017 May 5;16(5):2044-2053. doi: 10.1021/acs.jproteome.7b00019. Epub 2017 Apr 26. J Proteome Res. 2017. PMID: 28436664
-
The Development of Machine Learning Methods in Cell-Penetrating Peptides Identification: A Brief Review.Curr Drug Metab. 2019;20(3):217-223. doi: 10.2174/1389200219666181010114750. Curr Drug Metab. 2019. PMID: 30317992 Review.
-
Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools.Brief Bioinform. 2020 Mar 23;21(2):408-420. doi: 10.1093/bib/bby124. Brief Bioinform. 2020. PMID: 30649170 Review.
Cited by
-
MASS: predict the global qualities of individual protein models using random forests and novel statistical potentials.BMC Bioinformatics. 2020 Jul 6;21(Suppl 4):246. doi: 10.1186/s12859-020-3383-3. BMC Bioinformatics. 2020. PMID: 32631256 Free PMC article. Review.
-
The Spectrum of Design Solutions for Improving the Activity-Selectivity Product of Peptide Antibiotics against Multidrug-Resistant Bacteria and Prostate Cancer PC-3 Cells.Molecules. 2020 Aug 1;25(15):3526. doi: 10.3390/molecules25153526. Molecules. 2020. PMID: 32752241 Free PMC article.
-
In-Cell Penetration Selection-Mass Spectrometry Produces Noncanonical Peptides for Antisense Delivery.ACS Chem Biol. 2023 Mar 17;18(3):615-628. doi: 10.1021/acschembio.2c00920. Epub 2023 Mar 1. ACS Chem Biol. 2023. PMID: 36857503 Free PMC article.
-
Viral Prefusion Targeting Using Entry Inhibitor Peptides: The Case of SARS-CoV-2 and Influenza A virus.Int J Pept Res Ther. 2022;28(1):42. doi: 10.1007/s10989-021-10357-y. Epub 2022 Jan 3. Int J Pept Res Ther. 2022. PMID: 35002586 Free PMC article.
-
Meta-4mCpred: A Sequence-Based Meta-Predictor for Accurate DNA 4mC Site Prediction Using Effective Feature Representation.Mol Ther Nucleic Acids. 2019 Jun 7;16:733-744. doi: 10.1016/j.omtn.2019.04.019. Epub 2019 Apr 30. Mol Ther Nucleic Acids. 2019. PMID: 31146255 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources