Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Apr 17;14(1):8908.
doi: 10.1038/s41598-024-59734-9.

Non-animal models for blood-brain barrier permeability evaluation of drug-like compounds

Affiliations

Non-animal models for blood-brain barrier permeability evaluation of drug-like compounds

Frederic O Dehnbostel et al. Sci Rep. .

Abstract

Diseases related to the central nervous system (CNS) are major health concerns and have serious social and economic impacts. Developing new drugs for CNS-related disorders presents a major challenge as it actively involves delivering drugs into the CNS. Therefore, it is imperative to develop in silico methodologies to reliably identify potential lead compounds that can penetrate the blood-brain barrier (BBB) and help to thoroughly understand the role of different physicochemical properties fundamental to the BBB permeation of molecules. In this study, we have analysed the chemical space of the CNS drugs and compared it to the non-CNS-approved drugs. Additionally, we have collected a feature selection dataset from Muehlbacher et al. (J Comput Aided Mol Des 25(12):1095-1106, 2011. 10.1007/s10822-011-9478-1) and an in-house dataset. This information was utilised to design a molecular fingerprint that was used to train machine learning (ML) models. The best-performing models reported in this study achieved accuracies of 0.997 and 0.98, sensitivities of 1.0 and 0.992, specificities of 0.971 and 0.962, MCCs of 0.984 and 0.958, and ROC-AUCs of 0.997 and 0.999 on an imbalanced and a balanced dataset, respectively. They demonstrated overall good accuracies and sensitivities in the blind validation dataset. The reported models can be applied for fast and early screening drug-like molecules with BBB potential. Furthermore, the bbbPythoN package can be used by the research community to both produce the BBB-specific molecular fingerprints and employ the models mentioned earlier for BBB-permeability prediction.

Keywords: Blood–brain barrier; CNS; CNS drug discovery; Central nervous system; Computational prediction; Machine learning; Model; Permeability.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
The figure above shows scatterplots of six combinations of the descriptors rotatable bonds, H-bond acceptors, H-bond donors, Log P, molecular weight and TPSA, and displays a clear conservation of CNS drugs around specific values of the aforementioned descriptors. The blue and black lines correspond to the median of the respective descriptor.
Figure 2
Figure 2
An example of histograms resulting from binning of TopoPSA(NO) values for the class-wise split of the dataset as well as the combined histogram, in which each bins probability is the mean of the respective active and inactive class bins. As observable from the histograms and the given SE, the information content of a descriptor is larger for a wide and equal distribution of values.
Figure 3
Figure 3
Left: Scatterplot of Top 6 descriptors, black crosses correspond to test set, while gray crosses correspond to training set instances. Right: Distribution of TopoPSA(NO) and IC1 descriptors in training, test and complete dataset. Shows a strong similarity between value distributions of training and complete dataset.
Figure 4
Figure 4
Fingerprint Creation: Pseudocode showing encoding of descriptor values.
Figure 5
Figure 5
Table showing the three best performing models based on the imbalanced dataset and MI-DSE fingerprint and the best performing model based on the imbalanced dataset and Morgan 2 circular fingerprints. Both internal cross-validation (average with standard deviation) and external validation results are depicted. The highest values of each performance metric are marked in red.
Figure 6
Figure 6
Table Showing the three best performing models based on the balanced dataset and MI-DSE fingerprints. Both internal cross-validation (average with standard deviation) and external validation results are depicted.The highest values of each performance metric are marked in red.
Figure 7
Figure 7
2D descriptors: Scatterplots of approved drugs and CNS drugs via six combinations of best performing 2D descriptors, AATSC0s, AATSC0c, AATSC1c, TopoPSA(NO) and ATSC1c. Stark conservation of CNS drugs around values indicating low polarity can be observed.
Figure 8
Figure 8
3D descriptor: Scatterplots of approved drugs and CNS drugs via six combinations of best performing 3D descriptors, FPSA4, TPSA, PPSA4, PNSA3 and RASA. When compared with Fig. 5(2D) the CNS drugs are less conserved but tend to lower values for TPSA, FPSA4 and PPSA4—all relating to charged surface area—and higher values for RASA being a measure of hydrophobic surface area.
Figure 9
Figure 9
Histograms showing value distributions of TopoPSA(NO) descriptor for the positive and negative class and the respective MI-DSE scores. Left: Distributions for the original feature selection set. Right: distributions for dataset with an extended negative class.
Figure 10
Figure 10
Histograms of IC1 and ATSC1se values for active and inactive classes. As described above it can be seen that the active class samples are conserved around specific values, while the inactive class is more widely distributed. This indicates the ability of these descriptors to discern between the respective activity classes while also showing that active compounds tend to be both simpler (IC1) and less polar (ATSC1se) than inactive samples.

Similar articles

Cited by

References

    1. Deuschl G, Beghi E, Fazekas F, Varga T, Christoforidi KA, Sipido E, Bassetti CL, Vos T, Feigin VL. The burden of neurological diseases in Europe: An analysis for the global burden of disease study 2017. Lancet Public Health. 2020;5(10):e551–e567. doi: 10.1016/S2468-2667(20)30190-0. - DOI - PubMed
    1. Zhao YH, Abraham MH, Ibrahim A, Fish PV, Cole S, Lewis ML, de Groot MJ, Reynolds DP. Predicting penetration across the blood-brain barrier from simple descriptors and fragmentation schemes. J. Chem. Inf. Model. 2007;47(1):170–175. doi: 10.1021/ci600312d. - DOI - PubMed
    1. Barar J, Rafi MA, Pourseif MM, Omidi Y. Blood-brain barrier transport machineries and targeted therapy of brain diseases. BioImpacts BI. 2016;6(4):225–248. doi: 10.15171/bi.2016.30. - DOI - PMC - PubMed
    1. Ghose AK, Herbertz T, Hudkins RL, Dorsey BD, Mallamo JP. Knowledge-based, central nervous system (CNS) lead selection and lead optimization for CNS drug discovery. ACS Chem. Neurosci. 2012;3(1):50–68. doi: 10.1021/cn200100h. - DOI - PMC - PubMed
    1. Rankovic Z. CNS drug design: Balancing physicochemical properties for optimal brain exposure. J. Med. Chem. 2015;58(6):2584–2608. doi: 10.1021/jm501535r. - DOI - PubMed