MST-m6A: A Novel Multi-Scale Transformer-based Framework for Accurate Prediction of m6A Modification Sites Across Diverse Cellular Contexts
- PMID: 39510345
- DOI: 10.1016/j.jmb.2024.168856
MST-m6A: A Novel Multi-Scale Transformer-based Framework for Accurate Prediction of m6A Modification Sites Across Diverse Cellular Contexts
Abstract
N6-methyladenosine (m6A) modification, a prevalent epigenetic mark in eukaryotic cells, is crucial in regulating gene expression and RNA metabolism. Accurately identifying m6A modification sites is essential for understanding their functions within biological processes and the intricate mechanisms that regulate them. Recent advances in high-throughput sequencing technologies have enabled the generation of extensive datasets characterizing m6A modification sites at single-nucleotide resolution, leading to the development of computational methods for identifying m6A RNA modification sites. However, most current methods focus on specific cell lines, limiting their generalizability and practical application across diverse biological contexts. To address the limitation, we propose MST-m6A, a novel approach for identifying m6A modification sites with higher accuracy across various cell lines and tissues. MST-m6A utilizes a multi-scale transformer-based architecture, employing dual k-mer tokenization to capture rich feature representations and global contextual information from RNA sequences at multiple levels of granularity. These representations are then effectively combined using a channel fusion mechanism and further processed by a convolutional neural network to enhance prediction accuracy. Rigorous validation demonstrates that MST-m6A significantly outperforms conventional machine learning models, deep learning models, and state-of-the-art predictors. We anticipate that the high precision and cross-cell-type adaptability of MST-m6A will provide valuable insights into m6A biology and facilitate advancements in related fields. The proposed approach is available at https://github.com/cbbl-skku-org/MST-m6A/ for prediction and reproducibility purposes.
Keywords: BERT; N6-methyladenosine modification; convolutional neural network; feature fusion; transformer.
Copyright © 2024 Elsevier Ltd. All rights reserved.
Conflict of interest statement
Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Similar articles
-
m6A-SPP: Identification of RNA N6-methyladenosine modification sites through multi-source biological features and a hybrid deep learning architecture.Int J Biol Macromol. 2025 Jun;316(Pt 2):144789. doi: 10.1016/j.ijbiomac.2025.144789. Epub 2025 May 29. Int J Biol Macromol. 2025. PMID: 40449782
-
Injecting structure-aware insights for the learning of RNA sequence representations to identify m6A modification sites.PeerJ. 2025 Feb 24;13:e18878. doi: 10.7717/peerj.18878. eCollection 2025. PeerJ. 2025. PMID: 40017651 Free PMC article.
-
DNN-m6A: A Cross-Species Method for Identifying RNA N6-Methyladenosine Sites Based on Deep Neural Network with Multi-Information Fusion.Genes (Basel). 2021 Feb 28;12(3):354. doi: 10.3390/genes12030354. Genes (Basel). 2021. PMID: 33670877 Free PMC article.
-
Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.Brief Bioinform. 2020 Sep 25;21(5):1676-1696. doi: 10.1093/bib/bbz112. Brief Bioinform. 2020. PMID: 31714956 Review.
-
Functions of RNA N6-methyladenosine modification in cancer progression.Mol Biol Rep. 2019 Apr;46(2):2567-2575. doi: 10.1007/s11033-019-04655-4. Epub 2019 Mar 25. Mol Biol Rep. 2019. PMID: 30911972 Review.
Cited by
-
Hybrid representation learning for human m6A modifications with chromosome-level generalizability.Bioinform Adv. 2025 Jul 14;5(1):vbaf170. doi: 10.1093/bioadv/vbaf170. eCollection 2025. Bioinform Adv. 2025. PMID: 40708868 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous