Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 4;46(2):1360-1373.
doi: 10.3390/cimb46020087.

Role of Optimization in RNA-Protein-Binding Prediction

Affiliations

Role of Optimization in RNA-Protein-Binding Prediction

Shrooq Alsenan et al. Curr Issues Mol Biol. .

Abstract

RNA-binding proteins (RBPs) play an important role in regulating biological processes, such as gene regulation. Understanding their behaviors, for example, their binding site, can be helpful in understanding RBP-related diseases. Studies have focused on predicting RNA binding by means of machine learning algorithms including deep convolutional neural network models. One of the integral parts of modeling deep learning is achieving optimal hyperparameter tuning and minimizing a loss function using optimization algorithms. In this paper, we investigate the role of optimization in the RBP classification problem using the CLIP-Seq 21 dataset. Three optimization methods are employed on the RNA-protein binding CNN prediction model; namely, grid search, random search, and Bayesian optimizer. The empirical results show an AUC of 94.42%, 93.78%, 93.23% and 92.68% on the ELAVL1C, ELAVL1B, ELAVL1A, and HNRNPC datasets, respectively, and a mean AUC of 85.30 on 24 datasets. This paper's findings provide evidence on the role of optimizers in improving the performance of RNA-protein binding prediction.

Keywords: Bayesian optimizer; RNA-binding proteins; artificial intelligence; bioinformatics; convolutional neural network (CNN); deep learning; grid search; machine learning; optimization; proteins; random search optimizer.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 4
Figure 4
Random optimization vs. Bayesian optimizer.
Figure 1
Figure 1
RBP binding sites as a binary classification problem.
Figure 2
Figure 2
Model architecture.
Figure 3
Figure 3
One-hot encoding.
Figure 5
Figure 5
CNN model architecture.
Figure 6
Figure 6
Model performance comparison prior to and after optimization.
Figure 7
Figure 7
Training vs. testing loss for HNRNPC dataset.
Figure 8
Figure 8
Training vs. Testing loss for C22ORF28 dataset.
Figure 9
Figure 9
Training vs. testing loss for ELAVL1A dataset.
Figure 10
Figure 10
Training vs. testing AUC for AGO2 dataset.

Similar articles

References

    1. Ma H., Wen H., Xue Z., Li G., Zhang Z. RNANetMotif: Identifying sequence-structure RNA network motifs in RNA-protein binding sites. PLoS Comput. Biol. 2022;18:e1010293. doi: 10.1371/journal.pcbi.1010293. - DOI - PMC - PubMed
    1. Oliveira C., Faoro H., Alves L.R., Goldenberg S. RNA-binding proteins and their role in the regulation of gene expression in Trypanosoma cruzi and Saccharomyces cerevisiae. Genet. Mol. Biol. 2017;40:22–30. doi: 10.1590/1678-4685-gmb-2016-0258. - DOI - PMC - PubMed
    1. Qin H., Ni H., Liu Y., Yuan Y., Xi T., Li X., Zheng L. RNA-binding proteins in tumor progression. J. Hematol. Oncol. 2020;13:90. doi: 10.1186/s13045-020-00927-w. - DOI - PMC - PubMed
    1. Gebauer F., Schwarzl T., Valcárcel J., Hentze M.W. RNA-binding proteins in human genetic disease. Nat. Rev. Genet. 2021;22:185–198. doi: 10.1038/s41576-020-00302-y. - DOI - PubMed
    1. Li D., Zhang J., Li X., Chen Y., Yu F., Liu Q. Insights into lncRNAs in Alzheimer’s disease mechanisms. RNA Biol. 2021;18:1037–1047. doi: 10.1080/15476286.2020.1788848. - DOI - PMC - PubMed

LinkOut - more resources