Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 May 3:2023.05.03.539298.
doi: 10.1101/2023.05.03.539298.

Common Analysis of Direct RNA SequencinG CUrrently Leads to Misidentification of 5-Methylcytosine Modifications at GCU Motifs

Affiliations

Common Analysis of Direct RNA SequencinG CUrrently Leads to Misidentification of 5-Methylcytosine Modifications at GCU Motifs

Kaylee J Watson et al. bioRxiv. .

Update in

Abstract

RNA modifications, such as méthylation, can be detected with Oxford Nanopore Technologies direct RNA sequencing. One commonly used tool for detecting 5-methylcytosine (m5C) modifications is Tombo, which uses an "Alternative Model" to detect putative modifications from a single sample. We examined direct RNA sequencing data from diverse taxa including virus, bacteria, fungi, and animals. The algorithm consistently identified a 5-methylcytosine at the central position of a GCU motif. However, it also identified a 5-methylcytosine in the same motif in fully unmodified in vitro transcribed RNA, suggesting that this a frequent false prediction. In the absence of further validation, several published predictions of 5-methylcytosine in human coronavirus and human cerebral organoid RNA in a GCU context should be reconsidered.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
ONT direct RNA sequencing and GCU motif detection. A Schematic showing how RNA molecules are directly sequenced with ONT, followed by basecalling of the signal data produced by changes in ionic current, mapping to a reference, and detection of modified bases. Adapted from “Nanopore Sequencing”, by BioRender.com (2022). Retrieved from https://app.biorender.com/biorender-templates. B MEME Suite motifs overrepresented in the 10-nucleotide sequences surrounding the top 1000 putative modifications detected by the Tombo m5C “Alternative Model” for B. malayi, D. ananassae, C. albicans, and E. coli.
Figure 2.
Figure 2.
Methylated fractions predicted by Tombo “Alternative Model” for m5C. A Density plots of the methylated fraction at all 3-mers containing a central cytosine in native and IVT viral RNA. Cytosine positions were filtered for depth > 10 and methylated fraction > 0 in both IVT and native samples. Histograms available in supplemental information. B Boxplot showing distributions of the methylated fractions detected by Tombo m5C “Alternative Model.” The methylated fraction was extracted for cytosines in animal, fungal, and bacterial RNA with depth > 100 and cytosines in native and IVT viral RNA with sequencing depth > 10. Methylated fractions were grouped based on non-GCU and GCU sequence context. Statistical significance based on p-value from two-tailed Z-test and Cohen’s d effect size.

References

    1. Garalde DR, Snell EA, Jachimowicz D, Sipos B, Lloyd JH, Bruce M, Pantic N, Admassu T, James P, Warland A, Jordan M, Ciccone J, Serra S, Keenan J, Martin S, McNeill L, Wallace EJ, Jayasinghe L, Wright C, Blasco J, Young S, Brocklebank D, Juul S, Clarke J, Heron AJ, Turner DJ. 2018. Highly parallel direct RNA sequencing on an array of nanopores. Nat Methods 15:201–206. - PubMed
    1. Smith AM, Jain M, Mulroney L, Garalde DR, Akeson M. 2019. Reading canonical and modified nucleobases in 16S ribosomal RNA using nanopore native RNA sequencing. PLoS One 14:e0216709. - PMC - PubMed
    1. Workman RE, Tang AD, Tang PS, Jain M, Tyson JR, Razaghi R, Zuzarte PC, Gilpatrick T, Payne A, Quick J, Sadowski N, Holmes N, de Jesus JG, Jones KL, Soulette CM, Snutch TP, Loman N, Paten B, Loose M, Simpson JT, Olsen HE, Brooks AN, Akeson M, Timp W. 2019. Nanopore native RNA sequencing of a human poly(A) transcriptome. Nat Methods 16:1297–1305. - PMC - PubMed
    1. Zhang S, Li R, Zhang L, Chen S, Xie M, Yang L, Xia Y, Foyer CH, Zhao Z, Lam HM. 2020. New insights into Arabidopsis transcriptome complexity revealed by direct sequencing of native RNAs. Nucleic Acids Res 48:7700–7711. - PMC - PubMed
    1. Kim D, Lee JY, Yang JS, Kim JW, Kim VN, Chang H. 2020. The Architecture of SARS-CoV-2 Transcriptome. Cell 181:914–921 e10. - PMC - PubMed

Publication types