circRNA-binding protein site prediction based on multi-view deep learning, subspace learning and multi-view classifier
- PMID: 34571539
- DOI: 10.1093/bib/bbab394
circRNA-binding protein site prediction based on multi-view deep learning, subspace learning and multi-view classifier
Abstract
Circular RNAs (circRNAs) generally bind to RNA-binding proteins (RBPs) to play an important role in the regulation of autoimmune diseases. Thus, it is crucial to study the binding sites of RBPs on circRNAs. Although many methods, including traditional machine learning and deep learning, have been developed to predict the interactions between RNAs and RBPs, and most of them are focused on linear RNAs. At present, few studies have been done on the binding relationships between circRNAs and RBPs. Thus, in-depth research is urgently needed. In the existing circRNA-RBP binding site prediction methods, circRNA sequences are the main research subjects, but the relevant characteristics of circRNAs have not been fully exploited, such as the structure and composition information of circRNA sequences. Some methods have extracted different views to construct recognition models, but how to efficiently use the multi-view data to construct recognition models is still not well studied. Considering the above problems, this paper proposes a multi-view classification method called DMSK based on multi-view deep learning, subspace learning and multi-view classifier for the identification of circRNA-RBP interaction sites. In the DMSK method, first, we converted circRNA sequences into pseudo-amino acid sequences and pseudo-dipeptide components for extracting high-dimensional sequence features and component features of circRNAs, respectively. Then, the structure prediction method RNAfold was used to predict the secondary structure of the RNA sequences, and the sequence embedding model was used to extract the context-dependent features. Next, we fed the above four views' raw features to a hybrid network, which is composed of a convolutional neural network and a long short-term memory network, to obtain the deep features of circRNAs. Furthermore, we used view-weighted generalized canonical correlation analysis to extract four views' common features by subspace learning. Finally, the learned subspace common features and multi-view deep features were fed to train the downstream multi-view TSK fuzzy system to construct a fuzzy rule and fuzzy inference-based multi-view classifier. The trained classifier was used to predict the specific positions of the RBP binding sites on the circRNAs. The experiments show that the prediction performance of the proposed method DMSK has been improved compared with the existing methods. The code and dataset of this study are available at https://github.com/Rebecca3150/DMSK.
Keywords: WGCCA; circRNA-RBP binding site prediction; deep feature learning; multi-view TSK fuzzy system.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Similar articles
-
RNA-binding protein recognition based on multi-view deep feature and multi-label learning.Brief Bioinform. 2021 May 20;22(3):bbaa174. doi: 10.1093/bib/bbaa174. Brief Bioinform. 2021. PMID: 32808039
-
Collaborative deep learning improves disease-related circRNA prediction based on multi-source functional information.Brief Bioinform. 2023 Mar 19;24(2):bbad069. doi: 10.1093/bib/bbad069. Brief Bioinform. 2023. PMID: 36847701
-
Prediction of RBP binding sites on circRNAs using an LSTM-based deep sequence learning architecture.Brief Bioinform. 2021 Nov 5;22(6):bbab342. doi: 10.1093/bib/bbab342. Brief Bioinform. 2021. PMID: 34415289
-
Identification of circRNA-disease associations via multi-model fusion and ensemble learning.J Cell Mol Med. 2024 Apr;28(7):e18180. doi: 10.1111/jcmm.18180. J Cell Mol Med. 2024. PMID: 38506066 Free PMC article. Review.
-
Deep learning models for disease-associated circRNA prediction: a review.Brief Bioinform. 2022 Nov 19;23(6):bbac364. doi: 10.1093/bib/bbac364. Brief Bioinform. 2022. PMID: 36130259 Review.
Cited by
-
Nucleotide-level prediction of CircRNA-protein binding based on fully convolutional neural network.Front Genet. 2023 Oct 6;14:1283404. doi: 10.3389/fgene.2023.1283404. eCollection 2023. Front Genet. 2023. PMID: 37867600 Free PMC article.
-
CircGNB1 facilitates the malignant phenotype of GSCs by regulating miR-515-5p/miR-582-3p-XPR1 axis.Cancer Cell Int. 2023 Jul 5;23(1):132. doi: 10.1186/s12935-023-02970-2. Cancer Cell Int. 2023. PMID: 37407973 Free PMC article.
-
Research Progress of circRNAs in Glioblastoma.Front Cell Dev Biol. 2021 Nov 22;9:791892. doi: 10.3389/fcell.2021.791892. eCollection 2021. Front Cell Dev Biol. 2021. PMID: 34881248 Free PMC article. Review.
-
Emerging roles of circ_NRIP1 in tumor development and cancer therapy (Review).Oncol Lett. 2023 Jun 8;26(1):321. doi: 10.3892/ol.2023.13907. eCollection 2023 Jul. Oncol Lett. 2023. PMID: 37332333 Free PMC article. Review.
-
CircSSNN: circRNA-binding site prediction via sequence self-attention neural networks with pre-normalization.BMC Bioinformatics. 2023 May 30;24(1):220. doi: 10.1186/s12859-023-05352-7. BMC Bioinformatics. 2023. PMID: 37254080 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources