Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May 20;27(6):110032.
doi: 10.1016/j.isci.2024.110032. eCollection 2024 Jun 21.

Predicting therapeutic and side effects from drug binding affinities to human proteome structures

Affiliations

Predicting therapeutic and side effects from drug binding affinities to human proteome structures

Ryusuke Sawada et al. iScience. .

Abstract

Evaluation of the binding affinities of drugs to proteins is a crucial process for identifying drug pharmacological actions, but it requires three dimensional structures of proteins. Herein, we propose novel computational methods to predict the therapeutic indications and side effects of drug candidate compounds from the binding affinities to human protein structures on a proteome-wide scale. Large-scale docking simulations were performed for 7,582 drugs with 19,135 protein structures revealed by AlphaFold (including experimentally unresolved proteins), and machine learning models on the proteome-wide binding affinity score (PBAS) profiles were constructed. We demonstrated the usefulness of the method for predicting the therapeutic indications for 559 diseases and side effects for 285 toxicities. The method enabled to predict drug indications for which the related protein structures had not been experimentally determined and to successfully extract proteins eliciting the side effects. The proposed method will be useful in various applications in drug discovery.

Keywords: Biocomputational method; Bioinformatics; Biological sciences; Natural sciences; Pharmacoinformatics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

None
Graphical abstract
Figure 1
Figure 1
Workflow of the proposed drug discovery methods using AlphaFold (A) Structures of the drugs were obtained from KEGG DRUG. (B) 3D structural data for all human proteins were retrieved from AlphaFoldDB. (C) Ligand-binding pockets were detected for all human protein structures, and the binding affinities for all drug-protein pairs were assessed. (D) Estimated binding affinities were compiled to generate the proteome-wide binding affinity score (PBAS) profiles. (E and F) Using PBAS profiles, the prediction of potential drug therapeutic indications (E) and side effects (F) was conducted.
Figure 2
Figure 2
Proteome-wide binding affinity score (PBAS) profile and clustering analysis The PBAS profiles created from the results of docking simulations for all drugs with all human proteins are shown in the center of the heatmap. The estimated binding affinities are expressed as a gradient, with the horizontal and vertical axes representing proteins (19,135 structures) and drugs (7,582 drugs), respectively. Proteins and drugs are clustered separately, and the labels are shown for each cluster. The dendrogram on the left side of and under the heatmap shows the clusters of drugs (D1–D8) and proteins (P1–P5), respectively. Enrichment analysis was performed for the protein and drug clusters. The table above the heatmap reflects the results of the enrichment analysis of the structural domain groups of InterPro for the protein clusters. The number of proteins in each structural domain group is shown as a gradient. The table on the right side of the heatmap contains the outcomes of the enrichment analysis for the Anatomical Therapeutic Chemical (ATC) classification system groups for the drug clusters. The number of drugs enriched in the ATC group is marked as a gradient.
Figure 3
Figure 3
Small part of the drug-protein-disease network predicted by the proposed methods using proteome-wide binding affinity score profiles Blue circles, green rhombuses, and red rectangles stand for drugs, diseases, and proteins, respectively. Orange broken and gray lines indicate the relationships between the drugs and proteins obtained from the docking simulation results and those between proteins and possible disease indications, respectively.
Figure 4
Figure 4
Protein-compound binding structures estimated by docking simulation Compounds and proteins are shown as a red rod and an ivory ribbon model, respectively. (A) Overall structure of PTGFR docked with cloprostenol. (B) Overall structure of SLC5A2 conjugated with ertugliflozin. (C) Overall structure of OPRM1 merged with buprenorphine. All protein structures were estimated using AlphaFold.
Figure 5
Figure 5
Performance evaluation on the side effect prediction Asterisks indicate level of statistical significance: ∗∗p < 0.01. Upper two panels show the AUC (left) and AUPR (right) scores for the SIDER dataset, and lower two panels show the scores for the FARES dataset. The results of replicates (n = 30) of cross-validation experiments for each of the seven profiles are shown. PBAS + Fingerprint, PBAS + TESS, and PBAS + TELR represent the integrated profiles that combine PBAS with the other three profiles.
Figure 6
Figure 6
Small part of the drug-protein-side effect network predicted using our method Blue circles, red rectangles, and green rhombuses indicate drugs, proteins, and side effects, respectively. Orange-dashed and green-dotted lines show the drug-protein relationships estimated from the docking simulation and the relationships between side effects and proteins with high weights in the predictive model, respectively.

Similar articles

Cited by

References

    1. DiMasi J.A., Grabowski H.G. The cost of biopharmaceutical R&D: is biotech different? MDE. Manage. Decis. Econ. 2007;28:469–479. doi: 10.1002/mde.1360. - DOI
    1. Sawada R., Iwata H., Mizutani S., Yamanishi Y. Target-Based Drug Repositioning Using Large-Scale Chemical-Protein Interactome Data. J. Chem. Inf. Model. 2015;55:2717–2730. doi: 10.1021/acs.jcim.5b00330. - DOI - PubMed
    1. Sawada R., Iwata M., Tabei Y., Yamato H., Yamanishi Y. Predicting inhibitory and activatory drug targets by chemically and genetically perturbed transcriptome signatures. Sci. Rep. 2018;8:156. doi: 10.1038/s41598-017-18315-9. - DOI - PMC - PubMed
    1. Mizutani S., Pauwels E., Stoven V., Goto S., Yamanishi Y. Relating drug-protein interaction network with drug side effects. Bioinformatics. 2012;28:i522–i528. doi: 10.1093/bioinformatics/bts383. - DOI - PMC - PubMed
    1. Amano Y., Honda H., Sawada R., Nukada Y., Yamane M., Ikeda N., Morita O., Yamanishi Y. In silico systems for predicting chemical-induced side effects using known and potential chemical protein interactions, enabling mechanism estimation. J. Toxicol. Sci. 2020;45:137–149. doi: 10.2131/jts.45.137. - DOI - PubMed

LinkOut - more resources