Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May;17(5):e13824.
doi: 10.1111/cts.13824.

Machine learning framework to predict pharmacokinetic profile of small molecule drugs based on chemical structure

Affiliations

Machine learning framework to predict pharmacokinetic profile of small molecule drugs based on chemical structure

Nikhil Pillai et al. Clin Transl Sci. 2024 May.

Abstract

Accurate prediction of a new compound's pharmacokinetic (PK) profile is pivotal for the success of drug discovery programs. An initial assessment of PK in preclinical species and humans is typically performed through allometric scaling and mathematical modeling. These methods use parameters estimated from in vitro or in vivo experiments, which although helpful for an initial estimation, require extensive animal experiments. Furthermore, mathematical models are limited by the mechanistic underpinning of the drugs' absorption, distribution, metabolism, and elimination (ADME) which are largely unknown in the early stages of drug discovery. In this work, we propose a novel methodology in which concentration versus time profile of small molecules in rats is directly predicted by machine learning (ML) using structure-driven molecular properties as input and thus mitigating the need for animal experimentation. The proposed framework initially predicts ADME properties based on molecular structure and then uses them as input to a ML model to predict the PK profile. For the compounds tested, our results demonstrate that PK profiles can be adequately predicted using the proposed algorithm, especially for compounds with Tanimoto score greater than 0.5, the average mean absolute percentage error between predicted PK profile and observed PK profile data was found to be less than 150%. The suggested framework aims to facilitate PK predictions and thus support molecular screening and design earlier in the drug discovery process.

PubMed Disclaimer

Conflict of interest statement

All authors were employed by Sanofi while the manuscript was written. The authors declared no competing interests for this work.

Figures

FIGURE 1
FIGURE 1
Modeling framework utilized in this study. (1) Chemical structures represented as SMILES string were used as input. (2) RDKit package was utilized to extract Fingerprints and Descriptors based on the SMILES string. (3) Feature selection performed to identify key features. (4) Chemical structures were then used as an input to a machine learning model to predict PK parameters identified in step 3, which were then used as an input to a second machine learning model to predict the PK profile.
FIGURE 2
FIGURE 2
Visualization of the chemical space of compounds used to train the model and the compounds which were used in the test set. T distributed stochastic neighbor embedding (t‐SNE) approach is used to perform dimensionality reduction to help facilitate visualization of chemical space.
FIGURE 3
FIGURE 3
Scatter plot of concentrations predicted using machine learning framework (y‐axis) versus observed concentrations (x‐axis) for the test dataset.
FIGURE 4
FIGURE 4
PK profiles (concentration vs. time) predicted using ML framework (red line), PBPK modeling (Green region) overlayed over observed data (Blue dots).
FIGURE 5
FIGURE 5
Distribution of ratio of AUC24h observed versus AUC24h predicted, and ratio of C max observed versus C max predicted for predictions generated using ML framework.

Similar articles

Cited by

References

    1. Hughes JP, Rees S, Kalindjian SB, Philpott KL. Principles of early drug discovery. Br J Pharmacol. 2011;162(6):1239‐1249. - PMC - PubMed
    1. Sou T, Hansen J, Liepinsh E, et al. Model‐informed drug development for antimicrobials: translational PK and PK/PD modeling to predict an efficacious human dose for Apramycin. Clin Pharmacol Ther. 2021;109(4):1063‐1073. - PMC - PubMed
    1. Cella M, Gorter de Vries F, Burger D, Danhof M, Della Pasqua O. A model‐based approach to dose selection in early pediatric development. Clin Pharmacol Ther. 2010;87(3):294‐302. - PubMed
    1. Mavroudis PD, Pillai N, Wang Q, Pouzin C, Greene B, Fretland J. A multi‐model approach to predict efficacious clinical dose for an anti‐TGF‐β antibody (GC2008) in the treatment of osteogenesis imperfecta. CPT Pharmacometrics Syst Pharmacol. 2022;11(11):1485‐1496. - PMC - PubMed
    1. Stewart A, Denoyer D, Gao X, Toh YC. The FDA modernisation act 2.0: bringing non‐animal technologies to the regulatory table. Drug Discov Today. 2023;28(4):103496. - PubMed

Publication types

LinkOut - more resources