Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 10:14:1099093.
doi: 10.3389/fphar.2023.1099093. eCollection 2023.

DEEPCYPs: A deep learning platform for enhanced cytochrome P450 activity prediction

Affiliations

DEEPCYPs: A deep learning platform for enhanced cytochrome P450 activity prediction

Daiqiao Ai et al. Front Pharmacol. .

Abstract

Cytochrome P450 (CYP) is a superfamily of heme-containing oxidizing enzymes involved in the metabolism of a wide range of medicines, xenobiotics, and endogenous compounds. Five of the CYPs (1A2, 2C9, 2C19, 2D6, and 3A4) are responsible for metabolizing the vast majority of approved drugs. Adverse drug-drug interactions, many of which are mediated by CYPs, are one of the important causes for the premature termination of drug development and drug withdrawal from the market. In this work, we reported in silicon classification models to predict the inhibitory activity of molecules against these five CYP isoforms using our recently developed FP-GNN deep learning method. The evaluation results showed that, to the best of our knowledge, the multi-task FP-GNN model achieved the best predictive performance with the highest average AUC (0.905), F1 (0.779), BA (0.819), and MCC (0.647) values for the test sets, even compared to advanced machine learning, deep learning, and existing models. Y-scrambling testing confirmed that the results of the multi-task FP-GNN model were not attributed to chance correlation. Furthermore, the interpretability of the multi-task FP-GNN model enables the discovery of critical structural fragments associated with CYPs inhibition. Finally, an online webserver called DEEPCYPs and its local version software were created based on the optimal multi-task FP-GNN model to detect whether compounds bear potential inhibitory activity against CYPs, thereby promoting the prediction of drug-drug interactions in clinical practice and could be used to rule out inappropriate compounds in the early stages of drug discovery and/or identify new CYPs inhibitors.

Keywords: CYPs inhibitors; cytochrome P450; deep learning; multi-task FP-GNN; online webserver.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Model construction pipeline.
FIGURE 2
FIGURE 2
The data occupation distribution for the five isoforms in the CYPs modelling datasets.
FIGURE 3
FIGURE 3
The distribution of molecular chemical space of CYP1A2 (A), CYP2C9 (B), CYP2C19 (C), CYP2D6 (D), and CYP3A4 (E). LogP (X-axis) and molecular weight (MW, Y-axis) were used to define chemical space. RDKit software was used to calculate MW and LogP.
FIGURE 4
FIGURE 4
The Bemis Murcko scaffold analysis of CYP1A2 (orange), CYP2C19 (green), CYP2C9 (purple), CYP2D6 (yellow), and CYP3A4 (blue) inhibitors. The five CYPs isoforms (X-axis) and the fraction of scaffolds in the modeling datasets (scaffolds/compounds (%), Y-axis) were used to determine structural diversity.
FIGURE 5
FIGURE 5
The importance of molecular structures during the prediction process of the GNN module of the multi-task FP-GNN model on the CYPs dataset. The darker the color, the more important are for the structures. (A) Represents an active molecule on the five isoforms. (B) Represents an inactive molecule on the five isoforms.
FIGURE 6
FIGURE 6
(A) Represents the bioactivity prediction diagram of the DEEPCYPs. (B) Represents a case result display of the model applicability domain module of the DEEPCYPs. The chemical structure of thiabendazole is used as an example.

Similar articles

Cited by

References

    1. Ai D., Wu J., Cai H., Zhao D., Chen Y., Wei J., et al. (2022). A multi-task FP-GNN framework enables accurate prediction of selective PARP inhibitors. Front. Pharmacol. 13, 971369. 10.3389/fphar.2022.971369 - DOI - PMC - PubMed
    1. Arimoto R. (2006). Computational models for predicting interactions with cytochrome p450 enzyme. Curr. Top. Med. Chem. 6, 1609–1618. 10.2174/156802606778108951 - DOI - PubMed
    1. Banerjee P., Dunkel M., Kemmler E., Preissner R. (2020). SuperCYPsPred—A web server for the prediction of cytochrome activity. Nucleic Acids Res. 48, W580–W585. 10.1093/nar/gkaa166 - DOI - PMC - PubMed
    1. Bemis G. W., Murcko M. A. (1996). The properties of known drugs. 1. Molecular frameworks. J. Med. Chem. 39, 2887–2893. 10.1021/jm9602928 - DOI - PubMed
    1. Bojić M., Kondža M., Rimac H., Benković G., Maleš Ž. (2019). The effect of flavonoid aglycones on the CYP1A2, CYP2A6, CYP2C8 and CYP2D6 enzymes activity. Molecules 24, 3174. 10.3390/molecules24173174 - DOI - PMC - PubMed