A deep learning approach to predict blood-brain barrier permeability

Shrooq Alsenan¹, Isra Al-Turaiki², Alaaeldin Hafez³

Affiliations

¹ Information Systems Department, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.
² Information Technology Department, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia.
³ Information Systems Department, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia.

PMID: 34179448
PMCID: PMC8205267
DOI: 10.7717/peerj-cs.515

A deep learning approach to predict blood-brain barrier permeability

Shrooq Alsenan et al. PeerJ Comput Sci. 2021.

. 2021 Jun 10:7:e515.

doi: 10.7717/peerj-cs.515. eCollection 2021.

Authors

Shrooq Alsenan¹, Isra Al-Turaiki², Alaaeldin Hafez³

Affiliations

¹ Information Systems Department, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.
² Information Technology Department, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia.
³ Information Systems Department, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia.

PMID: 34179448
PMCID: PMC8205267
DOI: 10.7717/peerj-cs.515

Abstract

The blood-brain barrier plays a crucial role in regulating the passage of 98% of the compounds that enter the central nervous system (CNS). Compounds with high permeability must be identified to enable the synthesis of brain medications for the treatment of various brain diseases, such as Parkinson's, Alzheimer's, and brain tumors. Throughout the years, several models have been developed to solve this problem and have achieved acceptable accuracy scores in predicting compounds that penetrate the blood-brain barrier. However, predicting compounds with "low" permeability has been a challenging task. In this study, we present a deep learning (DL) classification model to predict blood-brain barrier permeability. The proposed model addresses the fundamental issues presented in former models: high dimensionality, class imbalances, and low specificity scores. We address these issues to enhance the high-dimensional, imbalanced dataset before developing the classification model: the imbalanced dataset is addressed using oversampling techniques and the high dimensionality using a non-linear dimensionality reduction technique known as kernel principal component analysis (KPCA). This technique transforms the high-dimensional dataset into a low-dimensional Euclidean space while retaining invaluable information. For the classification task, we developed an enhanced feed-forward deep learning model and a convolutional neural network model. In terms of specificity scores (i.e., predicting compounds with low permeability), the results obtained by the enhanced feed-forward deep learning model outperformed those obtained by other models in the literature that were developed using the same technique. In addition, the proposed convolutional neural network model surpassed models used in other studies in multiple accuracy measures, including overall accuracy and specificity. The proposed approach solves the problem inevitably faced with obtaining low specificity resulting in high false positive rate.

Keywords: Blood Brain Barrier (BBB) permeability; Chemoinformatics; Convolutional Neural Network (CNN); Quantitative Structure-Activity Relationships (QSAR).

PubMed Disclaimer

Conflict of interest statement

The authors declare they have no competing interests.

Figures

**Figure 1. The four phases of developing the BBB permeability model.**

**Figure 3. Block diagram of FFDNN model.**

**Figure 5. Transforming network shape from 2D to 3D.**

**Figure 7. SMOTE oversampling technique.**
(A) Class labels transformation. (B) Synthesizing new instance.

**Figure 8. Dataset transformation with Kernel PCA.**
(A) Original dataset. (B) After kernel PCA.

**Figure 10. ROC plots for DL models.**
(A) ROC Enhanced FFDNN. (B) ROC CNN.

**Figure 11. ROC plots for ML models.**
(A) ROC XGboost. (B) ROC SVM. C) ROC RF.

See this image and copyright information in PMC

References

1. Alsenan S, Al-Turaiki I, Hafez A. Autoencoder-based dimensionality reduction for QSAR modeling. 3rd international conference on computer applications and information security riyadh, Saudi Arabia, March 19-21, 2020; 2020. p. 65.
1. Alvascience Srl alvaDesc. 2019. https://www.alvascience.com. [16 October 2019]. https://www.alvascience.com
1. Arlot S, Celisse A. A survey of cross-validation procedures for model selection. Statistics Surveys. 2010;4:40–79.
1. Bagchi S, Chhibber T, Lahooti B, Verma A, Borse V, Jayant RD. In-vitro blood-brain barrier models for drug screening and permeation studies: an overview. Drug Design, Development and Therapy. 2019;13:3591–3605. doi: 10.2147/DDDT.S218708. - DOI - PMC - PubMed
1. Blagus R, Lusa L. Improved shrunken centroid classifiers for high-dimensional class-imbalanced data. BMC Bioinformatics. 2013;14(1):64–78. doi: 10.1186/1471-2105-14-64. - DOI - PMC - PubMed

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A deep learning approach to predict blood-brain barrier permeability

Affiliations

A deep learning approach to predict blood-brain barrier permeability

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources