Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 7;16(6):776.
doi: 10.3390/pharmaceutics16060776.

Predicting ADMET Properties from Molecule SMILE: A Bottom-Up Approach Using Attention-Based Graph Neural Networks

Affiliations

Predicting ADMET Properties from Molecule SMILE: A Bottom-Up Approach Using Attention-Based Graph Neural Networks

Alessandro De Carlo et al. Pharmaceutics. .

Abstract

Understanding the pharmacokinetics, safety and efficacy of candidate drugs is crucial for their success. One key aspect is the characterization of absorption, distribution, metabolism, excretion and toxicity (ADMET) properties, which require early assessment in the drug discovery and development process. This study aims to present an innovative approach for predicting ADMET properties using attention-based graph neural networks (GNNs). The model utilizes a graph-based representation of molecules directly derived from Simplified Molecular Input Line Entry System (SMILE) notation. Information is processed sequentially, from substructures to the whole molecule, employing a bottom-up approach. The developed GNN is tested and compared with existing approaches using six benchmark datasets and by encompassing regression (lipophilicity and aqueous solubility) and classification (CYP2C9, CYP2C19, CYP2D6 and CYP3A4 inhibition) tasks. Results show the effectiveness of our model, which bypasses the computationally expensive retrieval and selection of molecular descriptors. This approach provides a valuable tool for high-throughput screening, facilitating early assessment of ADMET properties and enhancing the likelihood of drug success in the development pipeline.

Keywords: ADMET prediction; attention-based architecture; graph neural network; model-based drug development.

PubMed Disclaimer

Conflict of interest statement

The authors confirm that there are no conflicts of interest.

Figures

Figure 1
Figure 1
Example of how adjacency matrices are extracted from molecular SMILE. For each type of bond (i.e., single, double, triple or aromatic), a specific adjacency matrix is derived in order to focus on molecular substructures.
Figure 2
Figure 2
Schematic representation of the GNN adopted. The architecture is organized as a stack of three main modules, each with a specific function.
Figure 3
Figure 3
Schematic representation of the implemented five-fold cross validation. At each step, one fold (orange) is used as an external test set; the remaining four are used for training and validation. And 20% of the four folds are used as validation data.
Figure 4
Figure 4
Distributions of regression variables in two benchmark datasets. Histograms of Lipophilicity AZ panel (A) and AqSolDB panel (B) data.
Figure 5
Figure 5
Example of the weighting strategy adopted for both regression tasks. Panel (A) shows the weights introduced for training the GNN on LogD prediction. Panel (B) focuses on LogS. For both tasks, α was set to 0.55.
Figure 6
Figure 6
Models used in the ablation study to benchmark the implemented GNN architecture. Panel (A) illustrates the ‘Whole Molecule’ GNN, which does not consider molecular substructures. Panel (B) represents the ‘Convolutional’ GNN, in which the attention mechanism for the entire molecule is replaced by a graph convolutional (GC) layer.
Figure 7
Figure 7
Results of the ablation study on the regression tasks.
Figure 8
Figure 8
Results of the ablation study on the classification tasks.

Similar articles

Cited by

References

    1. Wouters O.J., McKee M., Luyten J. Estimated Research and Development Investment Needed to Bring a New Medicine to Market, 2009–2018. JAMA. 2020;323:844–853. doi: 10.1001/jama.2020.1166. - DOI - PMC - PubMed
    1. Cook D., Brown D., Alexander R., March R., Morgan P., Satterthwaite G., Pangalos M.N. Lessons learned from the fate of AstraZeneca’s drug pipeline: A five-dimensional framework. Nat. Rev. Drug Discov. 2014;13:419–431. doi: 10.1038/nrd4309. - DOI - PubMed
    1. Mohamed M.E., Trueman S., Othman A.A., Han J.H., Ju T.R., Marroum P. Development of In Vitro–In Vivo Correlation for Upadacitinib Extended-Release Tablet Formulation. AAPS J. 2019;21:108. doi: 10.1208/s12248-019-0378-y. - DOI - PMC - PubMed
    1. Hanif M., Shoaib M.H., Yousuf R.I., Zafar F. Development of in vitro-in vivo correlations for newly optimized Nimesulide formulations. PLoS ONE. 2018;13:e0203123. doi: 10.1371/journal.pone.0203123. - DOI - PMC - PubMed
    1. Kapungu N.N., Li X., Nhachi C., Masimirembwa C., Thelingwani R.S. In vitro and in vivo human metabolism and pharmacokinetics of S- and R-praziquantel. Pharmacol. Res. Perspect. 2020;8:e00618. doi: 10.1002/prp2.618. - DOI - PMC - PubMed

LinkOut - more resources