Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 May 18:12:169.
doi: 10.1186/1471-2105-12-169.

Predicting drug side-effect profiles: a chemical fragment-based approach

Affiliations

Predicting drug side-effect profiles: a chemical fragment-based approach

Edouard Pauwels et al. BMC Bioinformatics. .

Abstract

Background: Drug side-effects, or adverse drug reactions, have become a major public health concern. It is one of the main causes of failure in the process of drug development, and of drug withdrawal once they have reached the market. Therefore, in silico prediction of potential side-effects early in the drug discovery process, before reaching the clinical stages, is of great interest to improve this long and expensive process and to provide new efficient and safe therapies for patients.

Results: In the present work, we propose a new method to predict potential side-effects of drug candidate molecules based on their chemical structures, applicable on large molecular databanks. A unique feature of the proposed method is its ability to extract correlated sets of chemical substructures (or chemical fragments) and side-effects. This is made possible using sparse canonical correlation analysis (SCCA). In the results, we show the usefulness of the proposed method by predicting 1385 side-effects in the SIDER database from the chemical structures of 888 approved drugs. These predictions are performed with simultaneous extraction of correlated ensembles formed by a set of chemical substructures shared by drugs that are likely to have a set of side-effects. We also conduct a comprehensive side-effect prediction for many uncharacterized drug molecules stored in DrugBank, and were able to confirm interesting predictions using independent source of information.

Conclusions: The proposed method is expected to be useful in various stages of the drug development process.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Characteristics of side-effect data. The left panel shows the index-plot of the number of associated drugs for each side-effect, and the right panel shows the histogram of the number of associated drugs for each side-effect.
Figure 2
Figure 2
ROC curves in the 5-fold cross-validation. Comparison of the performance between nearest neighbor (NN), support vector machine (SVM), ordinary canonical correlation analysis (OCCA) and sparse canonical correlation analysis (SCCA).
Figure 3
Figure 3
Boxplot of the prediction accuracy of predicted side-effects for each drug. Prediction accuract of top 10 ranked predictions (top panels) and top 100 ranked predictions (bottom panels). Comparison of the performance between nearest neighbor (NN), support vector machine (SVM), ordinary canonical correlation analysis (OCCA) and sparse canonical correlation analysis (SCCA).
Figure 4
Figure 4
Boxplot of the AUC (under the ROC curve) scores for individual side-effects. Comparison of the performance between nearest neighbor (NN), support vector machine (SVM), ordinary canonical correlation analysis (OCCA) and sparse canonical correlation analysis (SCCA).
Figure 5
Figure 5
Index-plot of weight vectors for drug substructures and side-effects in OCCA. Index-plot of weight vectors for drug substructures (left) and side-effects (right) extracted by ordinary canonical correlation analysis (OCCA).
Figure 6
Figure 6
Index-plot of weight vectors for drug substructures and side-effects in SCCA. Index-plot of weight vectors for drug substructures (left) and side-effects (right) extracted by sparse canonical correlation analysis (SCCA).
Figure 7
Figure 7
Computational cost. Total execution time of the cross-validation experiment for the four methods (log10 scale).
Figure 8
Figure 8
Nitrogen-containing rings of size 5. (A) Porphyrin group, (B) Proline residue, (C) Histidine residue, (D) Tryptophane residue.
Figure 9
Figure 9
Chemical structure of risperidone. Two dimensional graph structure of risperidone.
Figure 10
Figure 10
Rimonabant substructure selected by the proposed method to be a clue of psychoacticity. The substructure of Rimonabant is selected to be a clue of psychoacticity.

References

    1. Giacomini KM, Krauss RM, Roden DM, Eichelbaum M, Hayden MR, Nakamura Y. When good drugs go bad. Nature. 2007;446(7139):975–977. doi: 10.1038/446975a. - DOI - PubMed
    1. Houtsma D, Guchelaar H, Gelderblom H. Pharmacogenetics in oncology: a promising field. Curr Pharm Des. 2010;16(2):155–163. doi: 10.2174/138161210790112719. - DOI - PubMed
    1. McWhinney S, Goldberg R, McLeod H. Platinum neurotoxicity pharmacogenetics. Mol Cancer Ther. 2009. pp. 10–16. - PMC - PubMed
    1. Tatonetti N, Liu T, Altman R. Predicting drug side-effects by chemical systems biology. Genome Biol. 2009;10:238. doi: 10.1186/gb-2009-10-9-238. - DOI - PMC - PubMed
    1. Scheiber J, Chen B, Milik M, Sukuru S, Bender A, Mikhailov D, Whitebread S, Hamon J, Azzaoui K, Urban L, Glick M, Davies J, Jenkins J. Gaining insight into off-target mediated effects of drug candidates with a comprehensive systems chemical biology analysis. J Chem Inf Model. 2009;49(2):308–17. doi: 10.1021/ci800344p. - DOI - PubMed

MeSH terms

Substances