Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 31;26(5):bbaf447.
doi: 10.1093/bib/bbaf447.

MCAMEF-BERT: an efficient deep learning method for RNA N7-methylguanosine site prediction via multi-branch feature integration

Affiliations

MCAMEF-BERT: an efficient deep learning method for RNA N7-methylguanosine site prediction via multi-branch feature integration

Junlei Yu et al. Brief Bioinform. .

Abstract

Accurate identification of N7-methylguanosine (m7G) modification sites plays a critical role in uncovering the regulatory mechanisms of various biological processes, including human development, tumor initiation, and progression. However, existing prediction methods still suffer from limited representational power, redundant feature fusion, insufficient utilization of biological prior knowledge, and poor interpretability. In this study, we propose a novel deep learning model named MCAMEF-BERT. This model adopts a parallel architecture that integrates both a DNABERT-2-based pretrained model branch and multiple traditional feature encoding branches, enabling comprehensive multi-perspective sequence feature extraction. To address the redundancy issue in feature fusion, we introduce a multi-channel attention module. Our model demonstrates superior accuracy and effectiveness on datasets from m7GHub, outperforming other state-of-the-art classifiers. Furthermore, we validate the interpretability of MCAMEF-BERT through in silico saturation mutagenesis experiments, and confirm its robustness in motif recognition. Moreover, its generalization capability is validated across diverse RNA modification site prediction tasks.

Keywords: in silico saturation mutagenesis experiments; DNABERT-2 pretrained model; N7-methylguanosine modification; multi-channel attention; multi-encoding fusion.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The architectures of MCAMEF-BERT. (A) The development process of MCAMEF-BERT. (B) Encoding module. Transforming the input sequences. (C) fine-tuned DNABERT-2 module. Fine-tuning the DNABERT-2 pre-trained model. (D) MEF module. Extracting feature vectors of various encoding methods. (E) Multi-Channel attention block. Removing redundancy and achieving effective feature fusion of embedded vectors. (F) Representation alignment module. Reconciling the differences in feature representations between the two branches. (G) Classification. Predicting the RNA modifications.
Figure 2
Figure 2
Comparison results of different input sequence lengths. Six input length (51 bp, 101 bp, 201 bp, 301 bp, 401 bp, and 501 bp) were utilized.
Figure 3
Figure 3
Comparison results of different classifiers. iRNA-m7G was evaluated with 41 bp and 201 bp inputs, m7GPredictor with 101 bp and 201 bp, moss-m7G with 201 bp and 501 bp, and MCAMEF-BERT with 201 bp input.
Figure 4
Figure 4
Comparison results of different model variants and UMAP visualization of different model variants. (A) The performances results of different model variants. (B-H) The UMAP visualization of different model variants (MCAMEF-BERT, W/o pretrain, W/o mca, W/o fuse, W/o LSTM, W/o transformer, W/o CNN).
Figure 5
Figure 5
Base-wise sensitivity and importance scores derived from the ISM experiment, along with representative sequence motifs identified by the MEME Suite tool. (A) Base sensitivity of the model at each position derived from the ISM experiment. Dashed boxes indicate regions overlapping with motifs identified in (C), while solid boxes highlight motifs to which the model exhibits high sensitivity. (B) Position-wise importance scores in model predictions derived from the ISM experiment. (C) Motifs widely identified in the dataset by STREME from the MEME Suite.

Similar articles

References

    1. Liu W-W, Zheng SQ, Li T. et al. RNA modifications in cellular metabolism: implications for metabolism-targeted therapy and immunotherapy. Signal Transduct Target Ther 2024;9:70. 10.1038/s41392-024-01777-5 - DOI - PMC - PubMed
    1. Enroth C, Poulsen LD, Iversen S. et al. Detection of internal N7-methylguanosine (m7G) RNA modifications by mutational profiling sequencing. Nucleic Acids Res 2019;47:e126–6. 10.1093/nar/gkz736 - DOI - PMC - PubMed
    1. Marchand V, Ayadi L, Ernst FGM. et al. AlkAniline-Seq: profiling of m7G and m3C RNA modifications at single nucleotide resolution. Angew Chem Int Ed 2018;57:16785–90. 10.1002/anie.201810946 - DOI - PubMed
    1. Huang H, Weng H, Deng X. et al. RNA modifications in cancer: functions, mechanisms, and therapeutic implications. Annual Review of Cancer Biology 2020;4:221–40. 10.1146/annurev-cancerbio-030419-033357 - DOI
    1. Zhang X, Zhu WY, Shen SY. et al. Biological roles of RNA m7G modification and its implications in cancer. Biol Direct 2023;18:58. 10.1186/s13062-023-00414-5 - DOI - PMC - PubMed

LinkOut - more resources