ELMo4m6A: A Contextual Language Embedding-Based Predictor for Detecting RNA N6-Methyladenosine Sites
- PMID: 35536814
- DOI: 10.1109/TCBB.2022.3173323
ELMo4m6A: A Contextual Language Embedding-Based Predictor for Detecting RNA N6-Methyladenosine Sites
Abstract
N6-methyladenosine (m6A) is a universal post-transcriptional modification of RNAs, and it is widely involved in various biological processes. Identifying m6A modification sites accurately is indispensable to further investigate m6A-mediated biological functions. How to better represent RNA sequences is crucial for building effective computational methods for detecting m6A modification sites. However, traditional encoding methods require complex biological prior knowledge and are time-consuming. Furthermore, most of the existing m6A sites prediction methods are limited to single species, and few methods are able to predict m6A sites across different species and tissues. Thus, it is necessary to design a more efficient computational method to predict m6A sites across multiple species and tissues. In this paper, we proposed ELMo4m6A, a contextual language embedding-based method for predicting m6A sites from RNA sequences without any prior knowledge. ELMo4m6A first learns embeddings of RNA sequences using a language model ELMo, then uses a hybrid convolutional neural network (CNN) and long short-term memory (LSTM) to identify m6A sites. The results of 5-fold cross-validation and independent testing demonstrate that ELMo4m6A is superior to state-of-the-art methods. Moreover, we applied integrated gradients to find potential sequence patterns contributing to m6A sites.
Similar articles
-
DNN-m6A: A Cross-Species Method for Identifying RNA N6-Methyladenosine Sites Based on Deep Neural Network with Multi-Information Fusion.Genes (Basel). 2021 Feb 28;12(3):354. doi: 10.3390/genes12030354. Genes (Basel). 2021. PMID: 33670877 Free PMC article.
-
MST-m6A: A Novel Multi-Scale Transformer-based Framework for Accurate Prediction of m6A Modification Sites Across Diverse Cellular Contexts.J Mol Biol. 2025 Mar 15;437(6):168856. doi: 10.1016/j.jmb.2024.168856. Epub 2024 Nov 6. J Mol Biol. 2025. PMID: 39510345
-
Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.Brief Bioinform. 2020 Sep 25;21(5):1676-1696. doi: 10.1093/bib/bbz112. Brief Bioinform. 2020. PMID: 31714956 Review.
-
bCNN-Methylpred: Feature-Based Prediction of RNA Sequence Modification Using Branch Convolutional Neural Network.Genes (Basel). 2021 Jul 28;12(8):1155. doi: 10.3390/genes12081155. Genes (Basel). 2021. PMID: 34440330 Free PMC article.
-
[Recent progresses in RNA N6-methyladenosine research].Yi Chuan. 2013 Dec;35(12):1340-51. doi: 10.3724/sp.j.1005.2013.01340. Yi Chuan. 2013. PMID: 24645343 Review. Chinese.
Cited by
-
Hybrid representation learning for human m6A modifications with chromosome-level generalizability.Bioinform Adv. 2025 Jul 14;5(1):vbaf170. doi: 10.1093/bioadv/vbaf170. eCollection 2025. Bioinform Adv. 2025. PMID: 40708868 Free PMC article.
-
RNA sequence analysis landscape: A comprehensive review of task types, databases, datasets, word embedding methods, and language models.Heliyon. 2025 Jan 6;11(2):e41488. doi: 10.1016/j.heliyon.2024.e41488. eCollection 2025 Jan 30. Heliyon. 2025. PMID: 39897847 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources