Effect of molecular descriptor feature selection in support vector machine classification of pharmacokinetic and toxicological properties of chemical agents
- PMID: 15446820
- DOI: 10.1021/ci049869h
Effect of molecular descriptor feature selection in support vector machine classification of pharmacokinetic and toxicological properties of chemical agents
Abstract
Statistical-learning methods have been developed for facilitating the prediction of pharmacokinetic and toxicological properties of chemical agents. These methods employ a variety of molecular descriptors to characterize structural and physicochemical properties of molecules. Some of these descriptors are specifically designed for the study of a particular type of properties or agents, and their use for other properties or agents might generate noise and affect the prediction accuracy of a statistical learning system. This work examines to what extent the reduction of this noise can improve the prediction accuracy of a statistical learning system. A feature selection method, recursive feature elimination (RFE), is used to automatically select molecular descriptors for support vector machines (SVM) prediction of P-glycoprotein substrates (P-gp), human intestinal absorption of molecules (HIA), and agents that cause torsades de pointes (TdP), a rare but serious side effect. RFE significantly reduces the number of descriptors for each of these properties thereby increasing the computational speed for their classification. The SVM prediction accuracies of P-gp and HIA are substantially increased and that of TdP remains unchanged by RFE. These prediction accuracies are comparable to those of earlier studies derived from a selective set of descriptors. Our study suggests that molecular feature selection is useful for improving the speed and, in some cases, the accuracy of statistical learning methods for the prediction of pharmacokinetic and toxicological properties of chemical agents.
Copyright 2004 American Chemical Society
Similar articles
-
An integrated scheme for feature selection and parameter setting in the support vector machine modeling and its application to the prediction of pharmacokinetic properties of drugs.Artif Intell Med. 2009 Jun;46(2):155-63. doi: 10.1016/j.artmed.2008.07.001. Epub 2008 Aug 12. Artif Intell Med. 2009. PMID: 18701266
-
Prediction of P-glycoprotein substrates by a support vector machine approach.J Chem Inf Comput Sci. 2004 Jul-Aug;44(4):1497-505. doi: 10.1021/ci049971e. J Chem Inf Comput Sci. 2004. PMID: 15272858
-
Effect of selection of molecular descriptors on the prediction of blood-brain barrier penetrating and nonpenetrating agents by statistical learning methods.J Chem Inf Model. 2005 Sep-Oct;45(5):1376-84. doi: 10.1021/ci050135u. J Chem Inf Model. 2005. PMID: 16180914
-
Prediction of genotoxicity of chemical compounds by statistical learning methods.Chem Res Toxicol. 2005 Jun;18(6):1071-80. doi: 10.1021/tx049652h. Chem Res Toxicol. 2005. PMID: 15962942
-
Prediction of compounds with specific pharmacodynamic, pharmacokinetic or toxicological property by statistical learning methods.Mini Rev Med Chem. 2006 Apr;6(4):449-59. doi: 10.2174/138955706776361501. Mini Rev Med Chem. 2006. PMID: 16613581 Review.
Cited by
-
In Silico Prediction of Chemical Toxicity for Drug Design Using Machine Learning Methods and Structural Alerts.Front Chem. 2018 Feb 20;6:30. doi: 10.3389/fchem.2018.00030. eCollection 2018. Front Chem. 2018. PMID: 29515993 Free PMC article. Review.
-
Synthesis, single crystal (XRD), Hirshfeld surface analysis, computational study (DFT) and molecular docking studies of (E)-4-((2-hydroxy-3,5-diiodobenzylidene)amino)-N-(pyrimidine)-2-yl) benzenesulfonamide.Heliyon. 2021 Aug 6;7(8):e07724. doi: 10.1016/j.heliyon.2021.e07724. eCollection 2021 Aug. Heliyon. 2021. PMID: 34458601 Free PMC article.
-
Prediction of Chromatography Conditions for Purification in Organic Synthesis Using Deep Learning.Molecules. 2021 Apr 23;26(9):2474. doi: 10.3390/molecules26092474. Molecules. 2021. PMID: 33922736 Free PMC article.
-
Prediction of Molecular Properties Using Molecular Topographic Map.Molecules. 2021 Jul 24;26(15):4475. doi: 10.3390/molecules26154475. Molecules. 2021. PMID: 34361624 Free PMC article.
-
ADMET evaluation in drug discovery. 12. Development of binary classification models for prediction of hERG potassium channel blockage.Mol Pharm. 2012 Apr 2;9(4):996-1010. doi: 10.1021/mp300023x. Epub 2012 Mar 16. Mol Pharm. 2012. PMID: 22380484 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous