A general approach for developing system-specific functions to score protein-ligand docked complexes using support vector inductive logic programming
- PMID: 17910057
- DOI: 10.1002/prot.21782
A general approach for developing system-specific functions to score protein-ligand docked complexes using support vector inductive logic programming
Abstract
Despite the increased recent use of protein-ligand and protein-protein docking in the drug discovery process due to the increases in computational power, the difficulty of accurately ranking the binding affinities of a series of ligands or a series of proteins docked to a protein receptor remains largely unsolved. This problem is of major concern in lead optimization procedures and has lead to the development of scoring functions tailored to rank the binding affinities of a series of ligands to a specific system. However, such methods can take a long time to develop and their transferability to other systems remains open to question. Here we demonstrate that given a suitable amount of background information a new approach using support vector inductive logic programming (SVILP) can be used to produce system-specific scoring functions. Inductive logic programming (ILP) learns logic-based rules for a given dataset that can be used to describe properties of each member of the set in a qualitative manner. By combining ILP with support vector machine regression, a quantitative set of rules can be obtained. SVILP has previously been used in a biological context to examine datasets containing a series of singular molecular structures and properties. Here we describe the use of SVILP to produce binding affinity predictions of a series of ligands to a particular protein. We also for the first time examine the applicability of SVILP techniques to datasets consisting of protein-ligand complexes. Our results show that SVILP performs comparably with other state-of-the-art methods on five protein-ligand systems as judged by similar cross-validated squares of their correlation coefficients. A McNemar test comparing SVILP to CoMFA and CoMSIA across the five systems indicates our method to be significantly better on one occasion. The ability to graphically display and understand the SVILP-produced rules is demonstrated and this feature of ILP can be used to derive hypothesis for future ligand design in lead optimization procedures. The approach can readily be extended to evaluate the binding affinities of a series of protein-protein complexes.
(c) 2007 Wiley-Liss, Inc.
Similar articles
-
DOCKGROUND system of databases for protein recognition studies: unbound structures for docking.Proteins. 2007 Dec 1;69(4):845-51. doi: 10.1002/prot.21714. Proteins. 2007. PMID: 17803215
-
Docking and scoring protein complexes: CAPRI 3rd Edition.Proteins. 2007 Dec 1;69(4):704-18. doi: 10.1002/prot.21804. Proteins. 2007. PMID: 17918726
-
Protein-protein docking in CAPRI using ATTRACT to account for global and local flexibility.Proteins. 2007 Dec 1;69(4):774-80. doi: 10.1002/prot.21735. Proteins. 2007. PMID: 17803217
-
Comparing protein-ligand docking programs is difficult.Proteins. 2005 Aug 15;60(3):325-32. doi: 10.1002/prot.20497. Proteins. 2005. PMID: 15937897 Review.
-
Protein-ligand docking: current status and future challenges.Proteins. 2006 Oct 1;65(1):15-26. doi: 10.1002/prot.21082. Proteins. 2006. PMID: 16862531 Review.
Cited by
-
Improving molecular docking through eHiTS' tunable scoring function.J Comput Aided Mol Des. 2011 Nov;25(11):1033-51. doi: 10.1007/s10822-011-9482-5. Epub 2011 Nov 11. J Comput Aided Mol Des. 2011. PMID: 22076470
-
Physicochemical Heuristics for Identifying High Fidelity, Near-Native Structural Models of Peptide/MHC Complexes.Front Immunol. 2022 Apr 25;13:887759. doi: 10.3389/fimmu.2022.887759. eCollection 2022. Front Immunol. 2022. PMID: 35547730 Free PMC article.
-
Discovering rules for protein-ligand specificity using support vector inductive logic programming.Protein Eng Des Sel. 2009 Sep;22(9):561-7. doi: 10.1093/protein/gzp035. Epub 2009 Jul 2. Protein Eng Des Sel. 2009. PMID: 19574295 Free PMC article.
-
Automated site preparation in physics-based rescoring of receptor ligand complexes.Proteins. 2009 Oct;77(1):52-61. doi: 10.1002/prot.22415. Proteins. 2009. PMID: 19382204 Free PMC article.
-
Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening.Wiley Interdiscip Rev Comput Mol Sci. 2015 Nov-Dec;5(6):405-424. doi: 10.1002/wcms.1225. Epub 2015 Aug 28. Wiley Interdiscip Rev Comput Mol Sci. 2015. PMID: 27110292 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources