A comprehensive support vector machine binary hERG classification model based on extensive but biased end point hERG data sets
- PMID: 21504223
- DOI: 10.1021/tx200099j
A comprehensive support vector machine binary hERG classification model based on extensive but biased end point hERG data sets
Abstract
The human ether-a-go-go related gene (hERG) potassium ion channel plays a key role in cardiotoxicity and is therefore a key target as part of preclinical drug discovery toxicity screening. The PubChem hERG Bioassay data set, composed of 1668 compounds, was used to construct an in silico screening model. The corresponding trial models were constructed from a descriptor pool composed of 4D fingerprints (4D-FP) and traditional 2D and 3D VolSurf-like molecular descriptors. A final binary classification model was constructed via a support vector machine (SVM). The resultant model was then validated using the PubChem hERG Bioassay data set (AID 376) and an external hERG data set by evaluating the model's ability to determine hERG blockers from nonblockers. The external data set (the test set) consisted of 356 compounds collected from available literature data and consisting of 287 actives and 69 inactives. Four different sampling protocols and a 10-fold cross-correlation analysis--used in the validation process to evaluate classification models--explored the impact of the active--inactive data imbalance distribution of the PubChem high-throughput data set. Four different data sets were explored, and the one employing Lipinski's rule-of-five coupled with measures of relative molecular lipophilicity performed the best in the 10-fold cross-correlation validation of the training data set as well as overall prediction accuracy of the external test sets. The linear SVM binary classification model building strategy was applied to different combinations of MOE (traditional 2D, "21/2D", and 3D VolSurf-like) and 4D-FP molecular descriptors to further explore and refine previously proposed key descriptors, identify new significant features that contribute to the prediction of hERG toxicity, and construct the optimal SVM binary classification model from a shrunken descriptor pool. The accuracy, sensitivity, and specificity of the best model determined from 10-fold cross-validation are 95, 90, and 96%, respectively; the overall accuracy is near 87% for the external set. The models constructed in this study demonstrate the following: (i) robustness based upon performance in accuracy across the structural diversity of the training set, (ii) ability to predict a compound's "predisposition" to block hERG ion channels, and (iii) define and illustrate structural features that can be overlaid onto the chemical structures to aid in the 3D structure-activity interpretation of the hERG blocking effect.
Similar articles
-
In silico binary classification QSAR models based on 4D-fingerprints and MOE descriptors for prediction of hERG blockage.J Chem Inf Model. 2010 Jul 26;50(7):1304-18. doi: 10.1021/ci100081j. J Chem Inf Model. 2010. PMID: 20565102
-
hERG classification model based on a combination of support vector machine method and GRIND descriptors.Mol Pharm. 2008 Jan-Feb;5(1):117-27. doi: 10.1021/mp700124e. Epub 2008 Jan 16. Mol Pharm. 2008. PMID: 18197627
-
Combined receptor and ligand-based approach to the universal pharmacophore model development for studies of drug blockade to the hERG1 pore domain.J Chem Inf Model. 2011 Feb 28;51(2):463-74. doi: 10.1021/ci100409y. Epub 2011 Jan 11. J Chem Inf Model. 2011. PMID: 21241063
-
In silico prediction of hERG inhibition.Future Med Chem. 2015;7(5):571-86. doi: 10.4155/fmc.15.18. Future Med Chem. 2015. PMID: 25921399 Review.
-
Tuning out of hERG.Curr Opin Drug Discov Devel. 2008 Jan;11(1):128-40. Curr Opin Drug Discov Devel. 2008. PMID: 18175275 Review.
Cited by
-
In silico prediction of hERG potassium channel blockage by chemical category approaches.Toxicol Res (Camb). 2016 Jan 14;5(2):570-582. doi: 10.1039/c5tx00294j. eCollection 2016 Mar 1. Toxicol Res (Camb). 2016. PMID: 30090371 Free PMC article.
-
Paradigm shift in toxicity testing and modeling.AAPS J. 2012 Sep;14(3):473-80. doi: 10.1208/s12248-012-9358-1. Epub 2012 Apr 20. AAPS J. 2012. PMID: 22528508 Free PMC article. Review.
-
Novel Bayesian classification models for predicting compounds blocking hERG potassium channels.Acta Pharmacol Sin. 2014 Aug;35(8):1093-102. doi: 10.1038/aps.2014.35. Epub 2014 Jun 30. Acta Pharmacol Sin. 2014. PMID: 24976154 Free PMC article.
-
Global analysis reveals families of chemical motifs enriched for HERG inhibitors.PLoS One. 2015 Feb 20;10(2):e0118324. doi: 10.1371/journal.pone.0118324. eCollection 2015. PLoS One. 2015. PMID: 25700001 Free PMC article.
-
The great descriptor melting pot: mixing descriptors for the common good of QSAR models.J Comput Aided Mol Des. 2012 Jan;26(1):39-43. doi: 10.1007/s10822-011-9511-4. Epub 2011 Dec 27. J Comput Aided Mol Des. 2012. PMID: 22200979
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources