De Novo Direct Inverse QSPR/QSAR: Chemical Variational Autoencoder and Gaussian Mixture Regression Models
- PMID: 36635071
- DOI: 10.1021/acs.jcim.2c01298
De Novo Direct Inverse QSPR/QSAR: Chemical Variational Autoencoder and Gaussian Mixture Regression Models
Abstract
Herein, we propose a de novo direct inverse quantitative structure-property relationship/quantitative structure-activity relationship (QSPR/QSAR) analysis method, based on the chemical variational autoencoder (VAE) and Gaussian mixture regression (GMR) models, to generate molecules with the desired target variables of interest for properties and activities (y). A data set of molecules was analyzed, and an encoder was used to transform the simplified molecular input line entry system (SMILES) strings to latent variables (x), while a decoder was used to transform x to SMILES strings. A chemical VAE model was used for analysis and a GMR model (between x and y) was constructed for direct inverse analysis. The target y values were input into the GMR model to directly predict the x values. Following this, the predicted x values were input into the decoder associated with the chemical VAE model and the SMILES string representations (or chemical structures of molecules) were obtained as the output, indicating that the proposed method could be used to selectively obtain the molecules that were characterized by the target y values. We confirmed that the proposed method can be used to generate molecules within the target y ranges even when the conventional chemical VAE model failed to generate the target molecules.
Similar articles
-
Improving Molecular Design with Direct Inverse Analysis of QSAR/QSPR Model.Mol Inform. 2025 Jan;44(1):e202400227. doi: 10.1002/minf.202400227. Mol Inform. 2025. PMID: 39797757 Free PMC article.
-
Improving Chemical Autoencoder Latent Space and Molecular De Novo Generation Diversity with Heteroencoders.Biomolecules. 2018 Oct 30;8(4):131. doi: 10.3390/biom8040131. Biomolecules. 2018. PMID: 30380783 Free PMC article.
-
Inverse QSPR/QSAR Analysis for Chemical Structure Generation (from y to x).J Chem Inf Model. 2016 Feb 22;56(2):286-99. doi: 10.1021/acs.jcim.5b00628. Epub 2016 Feb 8. J Chem Inf Model. 2016. PMID: 26818135
-
[Ring-system-based Chemical Structure Enumeration for de Novo Design].Yakugaku Zasshi. 2016;136(1):101-6. doi: 10.1248/yakushi.15-00230-2. Yakugaku Zasshi. 2016. PMID: 26725676 Review. Japanese.
-
Deep Generative Models for Molecular Science.Mol Inform. 2018 Jan;37(1-2). doi: 10.1002/minf.201700133. Epub 2018 Feb 6. Mol Inform. 2018. PMID: 29405647 Review.
Cited by
-
Improving Molecular Design with Direct Inverse Analysis of QSAR/QSPR Model.Mol Inform. 2025 Jan;44(1):e202400227. doi: 10.1002/minf.202400227. Mol Inform. 2025. PMID: 39797757 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials