Extraction and validation of substructure profiles for enriching compound libraries
- PMID: 22983491
- DOI: 10.1007/s10822-012-9604-8
Extraction and validation of substructure profiles for enriching compound libraries
Abstract
Compounds known to be potent against a specific protein target may potentially contain a signature profile of common substructures that is highly correlated to their potency. These substructure profiles may be useful in enriching compound libraries or for prioritizing compounds against a specific protein target. With this objective in mind, a set of compounds with known potency against six selected kinases (2 each from 3 kinase families) was used to generate binary molecular fingerprints. Each fingerprint key represents a substructure that is found within a compound and the frequency with which the fingerprint occurs was then tabulated. Thereafter, a frequent pattern mining technique was applied with the aim of uncovering substructures that are not only well represented among known potent inhibitors but are also unrepresented among known inactive compounds and vice versa. Substructure profiles that are representative of potent inhibitors against each of the 3 kinase families were thus extracted. Based on our validation results, these substructure profiles demonstrated significant enrichment for highly potent compounds against their respective kinase targets. The advantages of using our approach over conventional methods in analyzing such datasets and its application in the mining of substructures for enriching compound libraries are presented.
Similar articles
-
Substructural Connectivity Fingerprint and Extreme Entropy Machines-A New Method of Compound Representation and Analysis.Molecules. 2018 May 23;23(6):1242. doi: 10.3390/molecules23061242. Molecules. 2018. PMID: 29789513 Free PMC article.
-
Substructure mining using elaborate chemical representation.J Chem Inf Model. 2006 Mar-Apr;46(2):597-605. doi: 10.1021/ci0503715. J Chem Inf Model. 2006. PMID: 16562988
-
Protocols for the Design of Kinase-focused Compound Libraries.Mol Inform. 2018 May;37(5):e1700119. doi: 10.1002/minf.201700119. Epub 2017 Nov 8. Mol Inform. 2018. PMID: 29116686
-
Fragment-based approaches to the discovery of kinase inhibitors.Methods Enzymol. 2014;548:69-92. doi: 10.1016/B978-0-12-397918-6.00003-3. Methods Enzymol. 2014. PMID: 25399642 Review.
-
Kinase-targeted libraries: the design and synthesis of novel, potent, and selective kinase inhibitors.Drug Discov Today. 2009 Mar;14(5-6):291-7. doi: 10.1016/j.drudis.2008.12.002. Epub 2009 Jan 21. Drug Discov Today. 2009. PMID: 19121409 Review.
Cited by
-
Machine learning in chemoinformatics and drug discovery.Drug Discov Today. 2018 Aug;23(8):1538-1546. doi: 10.1016/j.drudis.2018.05.010. Epub 2018 May 8. Drug Discov Today. 2018. PMID: 29750902 Free PMC article. Review.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources