Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 25:14:931729.
doi: 10.3389/fnagi.2022.931729. eCollection 2022.

Alzheimer-Compound Identification Based on Data Fusion and forgeNet_SVM

Affiliations

Alzheimer-Compound Identification Based on Data Fusion and forgeNet_SVM

Bin Yang et al. Front Aging Neurosci. .

Erratum in

Abstract

Rapid screening and identification of potential candidate compounds are very important to understand the mechanism of drugs for the treatment of Alzheimer's disease (AD) and greatly promote the development of new drugs. In order to greatly improve the success rate of screening and reduce the cost and workload of research and development, this study proposes a novel Alzheimer-related compound identification algorithm namely forgeNet_SVM. First, Alzheimer related and unrelated compounds are collected using the data mining method from the literature databases. Three molecular descriptors (ECFP6, MACCS, and RDKit) are utilized to obtain the feature sets of compounds, which are fused into the all_feature set. The all_feature set is input to forgeNet_SVM, in which forgeNet is utilized to provide the importance of each feature and select the important features for feature extraction. The selected features are input to support vector machines (SVM) algorithm to identify the new compounds in Traditional Chinese Medicine (TCM) prescription. The experiment results show that the selected feature set performs better than the all_feature set and three single feature sets (ECFP6, MACCS, and RDKit). The performances of TPR, FPR, Precision, Specificity, F1, and AUC reveal that forgeNet_SVM could identify more accurately Alzheimer-related compounds than other classical classifiers.

Keywords: Alzheimer; data fusion; feature selection; machine learning; network pharmacology; virtual screening.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
The flowchart of forgeNet algorithm.
Figure 2
Figure 2
Flowchart of Alzheimer-related active compound identification by forgeNet_SVM.
Figure 3
Figure 3
Performances of forgeNet_SVM with the different numbers of features.
Figure 4
Figure 4
ROC curves and AUC performances of our method with different feature sets for Alzheimer-related compound identification with Dat1.
Figure 5
Figure 5
ROC curves and AUC performances of our method with different feature sets for Alzheimer-related compound identification with Dat2.

Similar articles

Cited by

References

    1. Almeida O. P., Crocco E. I. (2000). Perception of cognitive deficits and behavior disorders in patients with Alzheimer's disease. Arq. Neuropsiquiatr. 58, 292–299. 10.1590/S0004-282X2000000200015 - DOI - PubMed
    1. An H. M., Huang D. R., Yang H., Liu X. G., Du J., Li Y., et al. . (2020). Comprehensive chemical profiling of Jia-Wei-Qi-Fu-Yin and its network pharmacology-based analysis on Alzheimer's disease. J. Pharm. Biomed. Anal. 189, 113467. 10.1016/j.jpba.2020.113467 - DOI - PubMed
    1. Berger S. I., Iyengar R. (2009). Network analyses in systems pharmacology. Bioinformatics. 25, 2466–2472. 10.1093/bioinformatics/btp465 - DOI - PMC - PubMed
    1. Breiman L. (2001). Random forest. Mach. Learn. 45, 5–32. 10.1023/A:1010933404324 - DOI
    1. Breiman L., Friedman J. H., Olshen R. A., Stone C. J. (1984). Classification and Regression Trees (CART). Biometrics. 40, 358. 10.2307/2530946 - DOI

LinkOut - more resources