Review

. 2023 May 19;24(3):bbad153.

doi: 10.1093/bib/bbad153.

Machine learning for RNA 2D structure prediction benchmarked on experimental data

Marek Justyna¹, Maciej Antczak^{1

2}, Marta Szachniuk^{1

2}

Affiliations

¹ Institute of Computing Science, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland.
² Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland.

PMID: 37096592
PMCID: PMC10199776
DOI: 10.1093/bib/bbad153

Review

Machine learning for RNA 2D structure prediction benchmarked on experimental data

Marek Justyna et al. Brief Bioinform. 2023.

. 2023 May 19;24(3):bbad153.

doi: 10.1093/bib/bbad153.

Authors

Marek Justyna¹, Maciej Antczak^{1

2}, Marta Szachniuk^{1

2}

Affiliations

¹ Institute of Computing Science, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland.
² Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland.

PMID: 37096592
PMCID: PMC10199776
DOI: 10.1093/bib/bbad153

Abstract

Since the 1980s, dozens of computational methods have addressed the problem of predicting RNA secondary structure. Among them are those that follow standard optimization approaches and, more recently, machine learning (ML) algorithms. The former were repeatedly benchmarked on various datasets. The latter, on the other hand, have not yet undergone extensive analysis that could suggest to the user which algorithm best fits the problem to be solved. In this review, we compare 15 methods that predict the secondary structure of RNA, of which 6 are based on deep learning (DL), 3 on shallow learning (SL) and 6 control methods on non-ML approaches. We discuss the ML strategies implemented and perform three experiments in which we evaluate the prediction of (I) representatives of the RNA equivalence classes, (II) selected Rfam sequences and (III) RNAs from new Rfam families. We show that DL-based algorithms (such as SPOT-RNA and UFold) can outperform SL and traditional methods if the data distribution is similar in the training and testing set. However, when predicting 2D structures for new RNA families, the advantage of DL is no longer clear, and its performance is inferior or equal to that of SL and non-ML methods.

Keywords: RNA 2D structure prediction; algorithm benchmarking; deep learning; machine learning.

PubMed Disclaimer

Figures

**Figure 1**
Interaction Network Fidelity (INF) computed for canonical base pairs, in Experiment I (predicting representatives of equivalence classes). Colors refer to groups of algorithms: blue – deep learning (DL), orange – shallow learning (SL) and green – non-ML algorithms.

**Figure 2**
Interaction Network Fidelity (INF) computed for canonical base pairs, in Experiment II (predicting selected Rfam sequences). Colors refer to groups of algorithms: blue – deep learning (DL), orange – shallow learning (SL) and green – non-ML algorithms.

**Figure 3**
Interaction Network Fidelity (INF) computed for canonical base pairs, in Experiment III (predicting new Rfam families). Colors refer to groups of algorithms: blue – deep learning (DL), orange – shallow learning (SL) and green – non-ML algorithms.

See this image and copyright information in PMC

Cited by

RNA secondary structure prediction by conducting multi-class classifications.
Yang J, Sato K, Loza M, Park SJ, Nakai K. Yang J, et al. Comput Struct Biotechnol J. 2025 Apr 4;27:1449-1459. doi: 10.1016/j.csbj.2025.04.001. eCollection 2025. Comput Struct Biotechnol J. 2025. PMID: 40256169 Free PMC article.
Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction.
Bahai A, Kwoh CK, Mu Y, Li Y. Bahai A, et al. PLoS Comput Biol. 2024 Dec 30;20(12):e1012715. doi: 10.1371/journal.pcbi.1012715. eCollection 2024 Dec. PLoS Comput Biol. 2024. PMID: 39775239 Free PMC article.
Analysis of natural structures and chemical mapping data reveals local stability compensation in RNA.
Cornwell-Arquitt RL, Nigh R, Hathaway MT, Yesselman JD, Hendrix DA. Cornwell-Arquitt RL, et al. Nucleic Acids Res. 2025 Jun 20;53(12):gkaf565. doi: 10.1093/nar/gkaf565. Nucleic Acids Res. 2025. PMID: 40568944 Free PMC article.
Comprehensive datasets for RNA design, machine learning, and beyond.
Badura J, Rybarczyk A, Zok T. Badura J, et al. Sci Rep. 2025 Jul 1;15(1):21417. doi: 10.1038/s41598-025-07041-2. Sci Rep. 2025. PMID: 40594473 Free PMC article.
Comparative analysis of RNA 3D structure prediction methods: towards enhanced modeling of RNA-ligand interactions.
Nithin C, Kmiecik S, Błaszczyk R, Nowicka J, Tuszyńska I. Nithin C, et al. Nucleic Acids Res. 2024 Jul 22;52(13):7465-7486. doi: 10.1093/nar/gkae541. Nucleic Acids Res. 2024. PMID: 38917327 Free PMC article.

See all "Cited by" articles

References

1. Mortimer SA, Kidwell MA, Doudna JA. Insights into RNA structure and function from genome-wide studies. Nat Rev Genet 2014;15(7):469–79. - PubMed
1. Meister G, Tuschl T. Mechanisms of gene silencing by double-stranded RNA. Nature 2004;431:343–9. - PubMed
1. Serganov A, Nudler E. A decade of riboswitches. Cell 2013;152(1–2):17–24. - PMC - PubMed
1. Wu L, Belasco JG. Let me count the ways: mechanisms of gene regulation by miRNAs and siRNAs. Mol Cell 2008;29(1):1–7. - PubMed
1. Zou Q, Li J, Hong Q, et al. . Prediction of microRNA-disease associations based on social network analysis methods. Biomed Res Int 2015;2015:810514. - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine learning for RNA 2D structure prediction benchmarked on experimental data

Affiliations

Machine learning for RNA 2D structure prediction benchmarked on experimental data

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources