Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Oct 7:9:1000205.
doi: 10.3389/fmolb.2022.1000205. eCollection 2022.

Artificial intelligence methods enhance the discovery of RNA interactions

Affiliations
Review

Artificial intelligence methods enhance the discovery of RNA interactions

G Pepe et al. Front Mol Biosci. .

Abstract

Understanding how RNAs interact with proteins, RNAs, or other molecules remains a challenge of main interest in biology, given the importance of these complexes in both normal and pathological cellular processes. Since experimental datasets are starting to be available for hundreds of functional interactions between RNAs and other biomolecules, several machine learning and deep learning algorithms have been proposed for predicting RNA-RNA or RNA-protein interactions. However, most of these approaches were evaluated on a single dataset, making performance comparisons difficult. With this review, we aim to summarize recent computational methods, developed in this broad research area, highlighting feature encoding and machine learning strategies adopted. Given the magnitude of the effect that dataset size and quality have on performance, we explored the characteristics of these datasets. Additionally, we discuss multiple approaches to generate datasets of negative examples for training. Finally, we describe the best-performing methods to predict interactions between proteins and specific classes of RNA molecules, such as circular RNAs (circRNAs) and long non-coding RNAs (lncRNAs), and methods to predict RNA-RNA or RNA-RBP interactions independently of the RNA type.

Keywords: RNA; RNA interaction predictors; RNA secondary structure; RNA sequence; deep learning; embedding; machine learning; natural language processing.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Overlap between the four RPI datasets generated from PDB RNA-protein complexes.

References

    1. Adinolfi M., Pietrosanto M., Parca L., Ausiello G., Ferrè F., Helmer-Citterich M. (2019). Discovering sequence and structure landscapes in RNA interaction motifs. Nucleic Acids Res. 47, 4958–4969. 10.1093/nar/gkz250 - DOI - PMC - PubMed
    1. Akbaripour-Elahabad M., Zahiri J., Rafeh R., Eslami M., Azari M. (2016). rpiCOOL: A tool for in silico RNA–protein interaction detection using random forest. J. Theor. Biol. 402, 1–8. 10.1016/j.jtbi.2016.04.025 - DOI - PubMed
    1. Alipanahi B., Delong A., Weirauch M. T., Frey B. J. (2015). Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat. Biotechnol. 33, 831–838. 10.1038/nbt.3300 - DOI - PubMed
    1. Bai Y., Dai X., Ye T., Zhang P., Yan X., Gong X., et al. (2019). PlncRNADB: A repository of plant lncRNAs and lncRNA-RBP protein interactions. Curr. Bioinform. 14, 621–627. 10.2174/1574893614666190131161002 - DOI
    1. Burley S. K., Bhikadiya C., Bi C., Bittrich S., Chen L., Crichlow G. V., et al. (2021). RCSB protein data bank: Powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences. Nucleic Acids Res. 49, D437–D451. 10.1093/nar/gkaa1038 - DOI - PMC - PubMed

LinkOut - more resources