Quantitative prediction of logk of peptides in high-performance liquid chromatography based on molecular descriptors by using the heuristic method and support vector machine
- PMID: 15554667
- DOI: 10.1021/ci049891a
Quantitative prediction of logk of peptides in high-performance liquid chromatography based on molecular descriptors by using the heuristic method and support vector machine
Abstract
A new method support vector machine (SVM) and the heuristic method (HM) were used to develop the nonlinear and linear models between the capacity factor (logk) and seven molecular descriptors of 75 peptides for the first time. The molecular descriptors representing the structural features of the compounds only included the constitutional and topological descriptors, which can be obtained easily without optimizing the structure of the molecule. The seven molecular descriptors selected by the heuristic method in CODESSA were used as inputs for SVM. The results obtained by SVM were compared with those obtained by the heuristic method. The prediction result of the SVM model is better than that of heuristic method. For the test set, a predictive correlation coefficient R = 0.9801 and root-mean-square error of 0.1523 were obtained. The prediction results are in very good agreement with the experimental values. But the linear model of the heuristic method is easier to understand and ready to use for a chemist. This paper provided a new and effective method for predicting the chromatography retention of peptides and some insight into the structural features which are related to the capacity factor of peptides.
Similar articles
-
Support vector machine and the heuristic method to predict the solubility of hydrocarbons in electrolyte.J Phys Chem A. 2005 Apr 21;109(15):3485-92. doi: 10.1021/jp0501446. J Phys Chem A. 2005. PMID: 16833686
-
Prediction of retention times for a large set of pesticides or toxicants based on support vector machine and the heuristic method.Toxicol Lett. 2007 Dec 10;175(1-3):136-44. doi: 10.1016/j.toxlet.2007.10.005. Epub 2007 Oct 18. Toxicol Lett. 2007. PMID: 18024009
-
Prediction of surface tension for common compounds based on novel methods using heuristic method and support vector machine.Talanta. 2007 Aug 15;73(1):147-56. doi: 10.1016/j.talanta.2007.03.037. Epub 2007 Mar 24. Talanta. 2007. PMID: 19071862
-
Accurate quantitative structure-property relationship model to predict the solubility of C60 in various solvents based on a novel approach using a least-squares support vector machine.J Phys Chem B. 2005 Nov 3;109(43):20565-71. doi: 10.1021/jp052223n. J Phys Chem B. 2005. PMID: 16853662
-
Analysis of peptides by high-performance liquid chromatography.Methods Enzymol. 1996;271:3-50. doi: 10.1016/s0076-6879(96)71003-0. Methods Enzymol. 1996. PMID: 8782547 Review. No abstract available.
Cited by
-
Prediction of standard Gibbs energies of the transfer of peptide anions from aqueous solution to nitrobenzene based on support vector machine and the heuristic method.J Comput Aided Mol Des. 2006 Jan;20(1):1-11. doi: 10.1007/s10822-005-9031-1. Epub 2006 Apr 19. J Comput Aided Mol Des. 2006. PMID: 16622797
-
Improved peptide elution time prediction for reversed-phase liquid chromatography-MS by incorporating peptide sequence information.Anal Chem. 2006 Jul 15;78(14):5026-39. doi: 10.1021/ac060143p. Anal Chem. 2006. PMID: 16841926 Free PMC article.
-
Prediction of the tissue/blood partition coefficients of organic compounds based on the molecular structure using least-squares support vector machines.J Comput Aided Mol Des. 2005 Jul;19(7):499-508. doi: 10.1007/s10822-005-9003-5. Epub 2005 Nov 30. J Comput Aided Mol Des. 2005. PMID: 16317501
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources