Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Feb 12;10(4):633-638.
doi: 10.1021/acsmedchemlett.8b00603. eCollection 2019 Apr 11.

Prediction of UGT-mediated Metabolism Using the Manually Curated MetaQSAR Database

Affiliations

Prediction of UGT-mediated Metabolism Using the Manually Curated MetaQSAR Database

Angelica Mazzolari et al. ACS Med Chem Lett. .

Abstract

Even though glucuronidations are the most frequent metabolic reactions of conjugation, both in quantitative and qualitative terms, they have rather seldom been investigated using computational approaches. To fill this gap, we have used the manually collected MetaQSAR metabolic reaction database to generate two models for the prediction of UGT-mediated metabolism, both based on molecular descriptors and implementing the Random Forest algorithm. The first model predicts the occurrence of the reaction and was internally validated with a Matthew correlation coefficient (MCC) of 0.76 and an area under the ROC curve (AUC) of 0.94, and further externally validated using a test set composed of 120 additional xenobiotics (MCC of 0.70 and AUC of 0.90). The second model distinguishes between O- and N-glucuronidations and was optimized by the random undersampling procedure to improve the predictive accuracy during the internal validation, with the recall measure of the minority class increasing from 0.55 to 0.78.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Model data sets: relative proportions between the two classes undergoing binary classification modeling. Panel e shows the ratio of first-generation substrates (I-GEN SUBs) and second-or more generation substrates (II-GEN SUBs) of the Model 1 data set.
Figure 2
Figure 2
Principal component analysis (a,b) and molecular similarity analysis based on Skelspheres descriptors (c) of the Model 1 data set. The data set is composed of 1400 UGT non-substrates and 792 UGT-substrates comprising 400 first-generation substrates (I-GEN SUBs) and 392 second-or more generation substrates (II-GEN SUBs). See the Supporting Information for a more detailed description of the images.
Figure 3
Figure 3
MCCV performance for Model 1, Model 2, and random undersampled Model 2 based on 100 iterations of model generation and evaluation. The performance for the single class is measured in terms of Precision, Recall, and F1 score, while the overall performance is measured in terms of Matthew Correlation Coefficient (MCC) and Area Under ROC Curve (AUC).
Figure 4
Figure 4
Distribution of the probability measures assigned by the RF algorithm during Model 1 LOO validation. The analysis involves the probabilities assigned only to the retrieved true positives and true negatives.
Figure 5
Figure 5
Analysis of the feature importance in Model 1. Distribution of the number of features (a), distribution of the category importance (b), and distribution of the category importance weighted on the total number of features in each category (c).

References

    1. Kirchmair J.; Williamson M. J.; Tyzack J. D.; Tan L.; Bond P. J.; Bender A.; Glen R. C. Computational Prediction of Metabolism: Sites, Products, SAR, P450 Enzyme Dynamics, and Mechanisms. J. Chem. Inf. Model. 2012, 52 (3), 617–648. 10.1021/ci200542m. - DOI - PMC - PubMed
    1. Sun H.; Scott D. O. Structure-Based Drug Metabolism Predictions for Drug Design. Chem. Biol. Drug Des. 2010, 75 (1), 3–17. 10.1111/j.1747-0285.2009.00899.x. - DOI - PubMed
    1. Almond L.; Yang J.; Jamei M.; Tucker G.; Rostami-Hodjegan A. Towards a Quantitative Framework for the Prediction of DDIs Arising from Cytochrome P450 Induction. Curr. Drug Metab. 2009, 10 (4), 420–432. 10.2174/138920009788498978. - DOI - PubMed
    1. Cascorbi I. Drug Interactions--Principles, Examples and Clinical Consequences. Dtsch. Arztebl. Int. 2012, 109 (33–34), 546–555. - PMC - PubMed
    1. Caron G.; Ermondi G.; Testa B. Predicting the Oxidative Metabolism of Statins: An Application of the MetaSite®. Pharm. Res. 2007, 24 (3), 480–501. 10.1007/s11095-006-9199-7. - DOI - PubMed