Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jun 10:7:e515.
doi: 10.7717/peerj-cs.515. eCollection 2021.

A deep learning approach to predict blood-brain barrier permeability

Affiliations

A deep learning approach to predict blood-brain barrier permeability

Shrooq Alsenan et al. PeerJ Comput Sci. .

Abstract

The blood-brain barrier plays a crucial role in regulating the passage of 98% of the compounds that enter the central nervous system (CNS). Compounds with high permeability must be identified to enable the synthesis of brain medications for the treatment of various brain diseases, such as Parkinson's, Alzheimer's, and brain tumors. Throughout the years, several models have been developed to solve this problem and have achieved acceptable accuracy scores in predicting compounds that penetrate the blood-brain barrier. However, predicting compounds with "low" permeability has been a challenging task. In this study, we present a deep learning (DL) classification model to predict blood-brain barrier permeability. The proposed model addresses the fundamental issues presented in former models: high dimensionality, class imbalances, and low specificity scores. We address these issues to enhance the high-dimensional, imbalanced dataset before developing the classification model: the imbalanced dataset is addressed using oversampling techniques and the high dimensionality using a non-linear dimensionality reduction technique known as kernel principal component analysis (KPCA). This technique transforms the high-dimensional dataset into a low-dimensional Euclidean space while retaining invaluable information. For the classification task, we developed an enhanced feed-forward deep learning model and a convolutional neural network model. In terms of specificity scores (i.e., predicting compounds with low permeability), the results obtained by the enhanced feed-forward deep learning model outperformed those obtained by other models in the literature that were developed using the same technique. In addition, the proposed convolutional neural network model surpassed models used in other studies in multiple accuracy measures, including overall accuracy and specificity. The proposed approach solves the problem inevitably faced with obtaining low specificity resulting in high false positive rate.

Keywords: Blood Brain Barrier (BBB) permeability; Chemoinformatics; Convolutional Neural Network (CNN); Quantitative Structure-Activity Relationships (QSAR).

PubMed Disclaimer

Conflict of interest statement

The authors declare they have no competing interests.

Figures

Figure 1
Figure 1. The four phases of developing the BBB permeability model.
Figure 2
Figure 2. Dataset analysis.
Figure 3
Figure 3. Block diagram of FFDNN model.
Figure 4
Figure 4. Convolutional layer.
Figure 5
Figure 5. Transforming network shape from 2D to 3D.
Figure 6
Figure 6. CNN architecture.
Figure 7
Figure 7. SMOTE oversampling technique.
(A) Class labels transformation. (B) Synthesizing new instance.
Figure 8
Figure 8. Dataset transformation with Kernel PCA.
(A) Original dataset. (B) After kernel PCA.
Figure 9
Figure 9. DL vs. ML models.
Figure 10
Figure 10. ROC plots for DL models.
(A) ROC Enhanced FFDNN. (B) ROC CNN.
Figure 11
Figure 11. ROC plots for ML models.
(A) ROC XGboost. (B) ROC SVM. C) ROC RF.

References

    1. Alsenan S, Al-Turaiki I, Hafez A. Autoencoder-based dimensionality reduction for QSAR modeling. 3rd international conference on computer applications and information security riyadh, Saudi Arabia, March 19-21, 2020; 2020. p. 65.
    1. Alvascience Srl alvaDesc. 2019. https://www.alvascience.com. [16 October 2019]. https://www.alvascience.com
    1. Arlot S, Celisse A. A survey of cross-validation procedures for model selection. Statistics Surveys. 2010;4:40–79.
    1. Bagchi S, Chhibber T, Lahooti B, Verma A, Borse V, Jayant RD. In-vitro blood-brain barrier models for drug screening and permeation studies: an overview. Drug Design, Development and Therapy. 2019;13:3591–3605. doi: 10.2147/DDDT.S218708. - DOI - PMC - PubMed
    1. Blagus R, Lusa L. Improved shrunken centroid classifiers for high-dimensional class-imbalanced data. BMC Bioinformatics. 2013;14(1):64–78. doi: 10.1186/1471-2105-14-64. - DOI - PMC - PubMed

LinkOut - more resources