Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Apr 12;13(4):677.
doi: 10.3390/genes13040677.

m5CRegpred: Epitranscriptome Target Prediction of 5-Methylcytosine (m5C) Regulators Based on Sequencing Features

Affiliations

m5CRegpred: Epitranscriptome Target Prediction of 5-Methylcytosine (m5C) Regulators Based on Sequencing Features

Zhizhou He et al. Genes (Basel). .

Abstract

5-methylcytosine (m5C) is a common post-transcriptional modification observed in a variety of RNAs. m5C has been demonstrated to be important in a variety of biological processes, including RNA structural stability and metabolism. Driven by the importance of m5C modification, many projects focused on the m5C sites prediction were reported before. To better understand the upstream and downstream regulation of m5C, we present a bioinformatics framework, m5CRegpred, to predict the substrate of m5C writer NSUN2 and m5C readers YBX1 and ALYREF for the first time. After features comparison, window lengths selection and algorism comparison on the mature mRNA model, our model achieved AUROC scores 0.869, 0.724 and 0.889 for NSUN2, YBX1 and ALYREF, respectively in an independent test. Our work suggests the substrate of m5C regulators can be distinguished and may help the research of m5C regulators in a special condition, such as substrates prediction of hyper- or hypo-expressed m5C regulators in human disease.

Keywords: 5-methylcytosine; machine learning; readers.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The workflow for m5CRegpred. The methylation sites and RNA Binding proteins (RBP) sites were obtained from four and two types of sequencing techniques, respectively. Eight kinds of encoding methods were considered in the project.
Figure 2
Figure 2
Full transcript model and mature mRNA model. To select negative sites, the unmodified sites and methylated sites un-regulated by NSUN2/YBX1/ALYREF from the intron and exons were both considered in the full transcript model; whereas the mature mRNA model only considered sites from exons. As most captured sequences during library preparation are exons (mature mRNA) due to polyA selection, the performance of full transcript model will be overestimated.
Figure 3
Figure 3
Performance of different length windows with PSNP encoding method.
Figure 4
Figure 4
Performance analysis on different machine learning algorithms. SVM represents for support vector machine, RF represents for random forest and GLMs represent for generalize linear model.
Figure 5
Figure 5
Motif discovery for the training data andfalse-negative sites of independent test data. For the training data, only the positive sites were used for motif discovery. The motif with width of 4 bp to 8 bp was scanned by STREME. The different motifs of ALYREF in training data may be due to the different data size, which contained 296 sequences in the full transcript model whereas only 137 in mature mRNA model.

Similar articles

Cited by

References

    1. Boccaletto P., Stefaniak F., Ray A., Cappannini A., Mukherjee S., Purta E., Kurkowska M., Shirvanizadeh N., Destefanis E., Groza P., et al. MODOMICS: A database of RNA modification pathways. 2021 update. Nucleic Acids Res. 2022;50:D231–D235. doi: 10.1093/nar/gkab1083. - DOI - PMC - PubMed
    1. Trixl L., Lusser A. The dynamic RNA modification 5-methylcytosine and its emerging role as an epitranscriptomic mark. Wiley Interdiscip. Rev. RNA. 2019;10:e1510. doi: 10.1002/wrna.1510. - DOI - PMC - PubMed
    1. Tang Y., Gao C.C., Gao Y., Yang Y., Shi B., Yu J.L., Lyu C., Sun B.F., Wang H.L., Xu Y., et al. OsNSUN2-Mediated 5-Methylcytosine mRNA Modification Enhances Rice Adaptation to High Temperature. Dev. Cell. 2020;53:272–286.e277. doi: 10.1016/j.devcel.2020.03.009. - DOI - PubMed
    1. Heissenberger C., Liendl L., Nagelreiter F., Gonskikh Y., Yang G., Stelzer E.M., Krammer T.L., Micutkova L., Vogt S., Kreil D.P., et al. Loss of the ribosomal RNA methyltransferase NSUN5 impairs global protein synthesis and normal growth. Nucleic Acids Res. 2019;47:11807–11825. doi: 10.1093/nar/gkz1043. - DOI - PMC - PubMed
    1. Tuorto F., Liebers R., Musch T., Schaefer M., Hofmann S., Kellner S., Frye M., Helm M., Stoecklin G., Lyko F. RNA cytosine methylation by Dnmt2 and NSun2 promotes tRNA stability and protein synthesis. Nat. Struct. Mol. Biol. 2012;19:900–905. doi: 10.1038/nsmb.2357. - DOI - PubMed

Publication types

LinkOut - more resources