Wfold: A new method for predicting RNA secondary structure with deep learning
- PMID: 39341115
- DOI: 10.1016/j.compbiomed.2024.109207
Wfold: A new method for predicting RNA secondary structure with deep learning
Abstract
Precise estimations of RNA secondary structures have the potential to reveal the various roles that non-coding RNAs play in regulating cellular activity. However, the mainstay of traditional RNA secondary structure prediction methods relies on thermos-dynamic models via free energy minimization, a laborious process that requires a lot of prior knowledge. Here, RNA secondary structure prediction using Wfold, an end-to-end deep learning-based approach, is suggested. Wfold is trained directly on annotated data and base-pairing criteria. It makes use of an image-like representation of RNA sequences, which an enhanced U-net incorporated with a transformer encoder can process effectively. Wfold eventually increases the accuracy of RNA secondary structure prediction by combining the benefits of self-attention mechanism's mining of long-range information with U-net's ability to gather local information. We compare Wfold's performance using RNA datasets that are within and across families. When trained and evaluated on different RNA families, it achieves a similar performance as the traditional methods, but dramatically outperforms the state-of-the-art methods on within-family datasets. Moreover, Wfold can also reliably forecast pseudoknots. The findings imply that Wfold may be useful for improving sequence alignment, functional annotations, and RNA structure modeling.
Keywords: Deep learning; Image-like representation; RNA secondary structure prediction; Self-attention mechanism; Unet.
Copyright © 2024 Elsevier Ltd. All rights reserved.
Conflict of interest statement
Declaration of competing interest None Declared.
Similar articles
-
UFold: fast and accurate RNA secondary structure prediction with deep learning.Nucleic Acids Res. 2022 Feb 22;50(3):e14. doi: 10.1093/nar/gkab1074. Nucleic Acids Res. 2022. PMID: 34792173 Free PMC article.
-
A divide-and-conquer approach based on deep learning for long RNA secondary structure prediction: Focus on pseudoknots identification.PLoS One. 2025 Apr 25;20(4):e0314837. doi: 10.1371/journal.pone.0314837. eCollection 2025. PLoS One. 2025. PMID: 40279361 Free PMC article.
-
REDalign: accurate RNA structural alignment using residual encoder-decoder network.BMC Bioinformatics. 2024 Nov 5;25(1):346. doi: 10.1186/s12859-024-05956-7. BMC Bioinformatics. 2024. PMID: 39501155 Free PMC article.
-
Energy-based RNA consensus secondary structure prediction in multiple sequence alignments.Methods Mol Biol. 2014;1097:125-41. doi: 10.1007/978-1-62703-709-9_7. Methods Mol Biol. 2014. PMID: 24639158 Review.
-
RNA structure prediction using deep learning - A comprehensive review.Comput Biol Med. 2025 Apr;188:109845. doi: 10.1016/j.compbiomed.2025.109845. Epub 2025 Feb 20. Comput Biol Med. 2025. PMID: 39983363 Review.
Cited by
-
Transformers in RNA structure prediction: A review.Comput Struct Biotechnol J. 2025 Mar 17;27:1187-1203. doi: 10.1016/j.csbj.2025.03.021. eCollection 2025. Comput Struct Biotechnol J. 2025. PMID: 40213272 Free PMC article. Review.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources