HN-CNN: A Heterogeneous Network Based on Convolutional Neural Network for m7 G Site Disease Association Prediction
- PMID: 33747055
- PMCID: PMC7970120
- DOI: 10.3389/fgene.2021.655284
HN-CNN: A Heterogeneous Network Based on Convolutional Neural Network for m7 G Site Disease Association Prediction
Abstract
N7-methylguanosine (m7G) is a typical positively charged RNA modification, playing a vital role in transcriptional regulation. m7G can affect the biological processes of mRNA and tRNA and has associations with multiple diseases including cancers. Wet-lab experiments are cost and time ineffective for the identification of disease-related m7G sites. Thus, a heterogeneous network method based on Convolutional Neural Networks (HN-CNN) has been proposed to predict unknown associations between m7G sites and diseases. HN-CNN constructs a heterogeneous network with m7G site similarity, disease similarity, and disease-associated m7G sites to formulate features for m7G site-disease pairs. Next, a convolutional neural network (CNN) obtains multidimensional and irrelevant features prominently. Finally, XGBoost is adopted to predict the association between m7G sites and diseases. The performance of HN-CNN is compared with Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), as well as Gradient Boosting Decision Tree (GBDT) through 10-fold cross-validation. The average AUC of HN-CNN is 0.827, which is superior to others.
Keywords: XGBoost; convolutional neural network; diseases; heterogeneous network; m7G sites.
Copyright © 2021 Zhang, Chen, Ma and Liu.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures
References
-
- Chang C. C., Lin C. J. (2011). LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2:27. 10.1145/1961189.1961199 - DOI
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
