Prediction of the RBP binding sites on lncRNAs using the high-order nucleotide encoding convolutional neural network
- PMID: 31323206
- DOI: 10.1016/j.ab.2019.113364
Prediction of the RBP binding sites on lncRNAs using the high-order nucleotide encoding convolutional neural network
Abstract
Long non-coding RNA (lncRNA) plays an important role in cells through the interaction with RNA-binding proteins (RBPs). Finding the RBPs binding sites on the lncRNA chains can help to understand the post-transcriptional regulatory mechanism, exploring the pathogenesis of cancers and possible roles in other diseases. Although many genome-wide RBP experimental techniques can identify the RNA-protein interactions and detect the binding sites on RNA chains, they are still time-consuming, labor-intensive and cost-heavy. Thus, many computational methods have been developed to predict the RBPs sites by integrating the RNA sequence, structure and domain specific features, etc. However, current approaches that focus on predicting the RBPs binding sites on RNA chains lack a consideration of the dependencies among nucleotides. In this work, we propose a higher-order nucleotide encoding convolutional neural network-based method (namely HOCNNLB) to predict the RBPs binding sites on lncRNA chains. HOCNNLB first employs a high-order one-hot encoding strategy to encode the lncRNA sequences by considering the dependence among nucleotides, then the encoded lncRNA sequences are fed into the convolutional neural network (CNN) to predict the RBP binding sites. We evaluate HOCNNLB on 31 experimental datasets of 12 lncRNA binding proteins. The average AUC of HOCNNLB achieves 0.953, which is 0.247, 0.175 higher than that of iDeepS and DeepBind, respectively. The average accuracy is 90.2%, which is 26.8%, 19.5% higher than that of iDeepS and DeepBind, respectively. These results demonstrate that HOCNNLB can reliably predict the RBP binding sites on lncRNA chains and outperforms the state-of-the-art methods. The source code of HOCNNLB and the datasets used in this work are available at https://github.com/NWPU-903PR/HOCNNLB for academic users.
Keywords: Binding site; Convolutional neural network; Higher-order one-hot encoding; lncRNA-binding protein.
Copyright © 2019 Elsevier Inc. All rights reserved.
Similar articles
-
AC-Caps: Attention Based Capsule Network for Predicting RBP Binding Sites of LncRNA.Interdiscip Sci. 2020 Dec;12(4):414-423. doi: 10.1007/s12539-020-00379-3. Epub 2020 Jun 22. Interdiscip Sci. 2020. PMID: 32572768
-
Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks.BMC Genomics. 2018 Jul 3;19(1):511. doi: 10.1186/s12864-018-4889-1. BMC Genomics. 2018. PMID: 29970003 Free PMC article.
-
LPI-CNNCP: Prediction of lncRNA-protein interactions by using convolutional neural network with the copy-padding trick.Anal Biochem. 2020 Jul 15;601:113767. doi: 10.1016/j.ab.2020.113767. Epub 2020 May 23. Anal Biochem. 2020. PMID: 32454029
-
Revealing protein-lncRNA interaction.Brief Bioinform. 2016 Jan;17(1):106-16. doi: 10.1093/bib/bbv031. Epub 2015 Jun 2. Brief Bioinform. 2016. PMID: 26041786 Free PMC article. Review.
-
Computational Prediction of RNA-Binding Proteins and Binding Sites.Int J Mol Sci. 2015 Nov 3;16(11):26303-17. doi: 10.3390/ijms161125952. Int J Mol Sci. 2015. PMID: 26540053 Free PMC article. Review.
Cited by
-
CBR3-AS1 Accelerates the Malignant Proliferation of Gestational Choriocarcinoma Cells by Stabilizing SETD4.Dis Markers. 2022 May 24;2022:7155525. doi: 10.1155/2022/7155525. eCollection 2022. Dis Markers. 2022. Retraction in: Dis Markers. 2023 Oct 4;2023:9830426. doi: 10.1155/2023/9830426. PMID: 35655916 Free PMC article. Retracted.
-
rbpTransformer: A novel deep learning model for prediction of piRNA and mRNA bindings.PLoS One. 2025 Jun 25;20(6):e0324462. doi: 10.1371/journal.pone.0324462. eCollection 2025. PLoS One. 2025. PMID: 40561121 Free PMC article.
-
RMDNet: RNA-aware dung beetle optimization-based multi-branch integration network for RNA-protein binding sites prediction.BMC Bioinformatics. 2025 Jul 11;26(1):176. doi: 10.1186/s12859-025-06197-y. BMC Bioinformatics. 2025. PMID: 40646507 Free PMC article.
-
PmliHFM: Predicting Plant miRNA-lncRNA Interactions with Hybrid Feature Mining Network.Interdiscip Sci. 2023 Mar;15(1):44-54. doi: 10.1007/s12539-022-00540-0. Epub 2022 Oct 12. Interdiscip Sci. 2023. PMID: 36223068
-
Predicting RNA Secondary Structure Using In Vitro and In Vivo Data.Methods Mol Biol. 2022;2404:43-52. doi: 10.1007/978-1-0716-1851-6_2. Methods Mol Biol. 2022. PMID: 34694602
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources