Sharing chemical information without sharing chemical structure
- PMID: 18254609
- DOI: 10.1021/ci600383v
Sharing chemical information without sharing chemical structure
Abstract
Studies to assess the risks of revealing chemical structures by sharing various chemical descriptor data are presented. Descriptors examined include "Lipinski-like" properties, 2D-BCUT descriptors, and a high-dimensional "fingerprint-like" descriptor (MACCs-vector). We demonstrate that unless sufficient precautions are taken, de novo design software such as EA-Inventor is able to derive a unique chemical structure or a set of closely related analogs from some commonly used descriptors. Based on the results of our studies, a set of guidelines or recommendations for safely sharing chemical information without revealing chemical structure is presented. A procedure for assessing the risk of revealing chemical structure when exchanging chemical descriptor information was also developed. The procedure is generic and can be applied to any chemical descriptor or combination of descriptors and to any set of structures to enable a decision about whether the exchange of information can be done without revealing the chemical structures.
Similar articles
-
Representation of chemical information in OASIS centralized 3D database for existing chemicals.J Chem Inf Model. 2006 Nov-Dec;46(6):2537-51. doi: 10.1021/ci060142y. J Chem Inf Model. 2006. PMID: 17125194
-
Design and evaluation of a molecular fingerprint involving the transformation of property descriptor values into a binary classification scheme.J Chem Inf Comput Sci. 2003 Jul-Aug;43(4):1151-7. doi: 10.1021/ci030285+. J Chem Inf Comput Sci. 2003. PMID: 12870906
-
Mold(2), molecular descriptors from 2D structures for chemoinformatics and toxicoinformatics.J Chem Inf Model. 2008 Jul;48(7):1337-44. doi: 10.1021/ci800038f. Epub 2008 Jun 20. J Chem Inf Model. 2008. PMID: 18564836
-
Mining chemical structural information from the drug literature.Drug Discov Today. 2006 Jan;11(1-2):35-42. doi: 10.1016/S1359-6446(05)03682-2. Drug Discov Today. 2006. PMID: 16478689 Review.
-
Chemical Descriptors Library (CDL): a generic, open source software library for chemical informatics.J Chem Inf Model. 2008 Oct;48(10):1931-42. doi: 10.1021/ci800135h. Epub 2008 Sep 20. J Chem Inf Model. 2008. PMID: 18803371 Review.
Cited by
-
Development of artificial neural network models to predict the PAMPA effective permeability of new, orally administered drugs active against the coronavirus SARS-CoV-2.Netw Model Anal Health Inform Bioinform. 2023;12(1):16. doi: 10.1007/s13721-023-00410-9. Epub 2023 Feb 6. Netw Model Anal Health Inform Bioinform. 2023. PMID: 36778642 Free PMC article.
-
Development of an Infrastructure for the Prediction of Biological Endpoints in Industrial Environments. Lessons Learned at the eTOX Project.Front Pharmacol. 2018 Oct 11;9:1147. doi: 10.3389/fphar.2018.01147. eCollection 2018. Front Pharmacol. 2018. PMID: 30364191 Free PMC article.
-
A constructive approach for discovering new drug leads: Using a kernel methodology for the inverse-QSAR problem.J Cheminform. 2009 Apr 28;1:4. doi: 10.1186/1758-2946-1-4. J Cheminform. 2009. PMID: 20142987 Free PMC article.
-
On the interpretation and interpretability of quantitative structure-activity relationship models.J Comput Aided Mol Des. 2008 Dec;22(12):857-71. doi: 10.1007/s10822-008-9240-5. Epub 2008 Sep 11. J Comput Aided Mol Des. 2008. PMID: 18784976
-
Neuraldecipher - reverse-engineering extended-connectivity fingerprints (ECFPs) to their molecular structures.Chem Sci. 2020 Sep 11;11(38):10378-10389. doi: 10.1039/d0sc03115a. Chem Sci. 2020. PMID: 34094299 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources