Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2017 Sep 2;14(9):1124-1137.
doi: 10.1080/15476286.2016.1251543. Epub 2016 Oct 28.

Next-generation sequencing technologies for detection of modified nucleotides in RNAs

Affiliations
Review

Next-generation sequencing technologies for detection of modified nucleotides in RNAs

Schraga Schwartz et al. RNA Biol. .

Abstract

Our ability to map and quantify RNA modifications at a genome-wide scale have revolutionized our understanding of the pervasiveness and dynamic regulation of diverse RNA modifications. Recent efforts in the field have demonstrated the presence of modified residues in almost any type of cellular RNA. Next-generation sequencing (NGS) technologies are the primary choice for transcriptome-wide RNA modification mapping. Here we provide an overview of approaches for RNA modification detection based on their RT-signature, specific chemicals, antibody-dependent (Ab) enrichment, or combinations thereof. We further discuss sources of artifacts in genome-wide modification maps, and experimental and computational considerations to overcome them. The future in this field is tightly linked to the development of new specific chemical reagents, highly specific Ab against RNA modifications and use of single-molecule RNA sequencing techniques.

Keywords: Bioinformatics; RNA maturation; RNA modifications; epitranscriptome; next-generation sequencing.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Second- and third-generation sequencing technologies (NGS and NNGS). Second generation sequencing uses cluster amplification of DNA strands prior to fluorescent or potentiometric sequencing (A). Multiple molecules generate detectable signal. Third-generation sequencing technologies use single molecule sequencing with specially designed fluorescent detection systems (Zero-mode Waveguides, ZMW), or nanopore sequencing using exonuclease or DNA polymerase activities (B).
Figure 2.
Figure 2.
Library preparation issues. Detection of RNA modification relies on exact determination of cDNA 3′-end (A). Priming at the 3′-end of RNA depends on its size. Short RNAs (typically < 100 - 200 nt) require adapter ligation step prior to primer annealing (left), while random priming (right) can be used for long RNA species (B). Methods for treatment of cDNA 3′-end for exact determination of RT stop. Four common techniques have been developed. One - single-strand DNA adapter ligation, 2 - oligonucleotide tailing terminal transferase followed by dsDNA primer ligation, 3 - CircLigase protocol with 5′-phosphorylated cDNA followed by second strand synthesis, and 4 - templated cDNA extension with 3′-blocked NNNNNN primer. In all cases the final step includes PCR amplification and appropriate barcoding.
Figure 3.
Figure 3.
Three classes of NGS-based techniques for detection and mapping of RNA modifications. Class I methods are based on “natural” RT-signature generated by the modified nucleotide. Such signature may include RT-arrest (interpreted as a coverage drop) at the modification site or/and nucleotide misincorporation at the same position (left). Class II methods are similar to Class I, but the RNA modification is RT-silent and the visible signature is induced via appropriate chemical treatment (middle). Class III techniques are based on enrichment of RNA fragments containing the RNA modification with a specific Ab. In these cases, the position of modification is typically not determined at single nucleotide resolution, but rather as an enriched region (right).
Figure 4.
Figure 4.
Applications of Class (I)techniques to A-to-(I)editing and m1A. Inosine, which is derived from A by RNA editing, generates “misincorporation” into cDNA, due to its base-pairing to C. The RT-signature contains only the “misincorporated” nucleotide compared to the reference sequence (left). m1A and other modified nucleotides with altered Watson-Crick edge generate complex RT-signatures composed of both RT-arrest and misincorporation at the modification site (right).
Figure 5.
Figure 5.
Applications of Class II techniques to RT-silent pseudouridine (Ψ) and m5C. Pseudouridine is reactive with water-soluble carbodiimide (CMCT) and forms stable adduct, while U-CMC adducts are removed by alkaline treatment. The resulting Ψ-CMC generates RT-arrest, detectable in the sequencing profile (left). 5-methylcytosine (m5C) is RT silent, but its detection is based on its insensitivity to bisulfite deamination. All C residues in RNA are deaminated by bisulfite, while m5C remains non-deaminated after treatment. The presence of residual C is thus detected by sequencing (right).
Figure 6.
Figure 6.
Complex cases of Class II protocols. Application for 2′-O-methylated residues, m1A and Inosine. Left: Detection of 2′-O-methylation is based on the 2′-O-Me-mediated protection of 3′-adjacent phosphodiester bond from nucleophilic cleavage. RNA is first randomly fragmented and a sequencing library is prepared from all generated fragments. Calculation of 5′-end (or cumulated 5′- and 3′-end) coverage reveals characteristic “gaps” one nucleotide downstream of the 2′-O-methylated site. The depth of the gap is proportional to methylation level of the nucleotide. Center: Conversion of m1A to m6A by Dimroth rearrangement. m1A residues present in RNA generate characteristic RT-signature, which disappears after alkaline treatment catalyzing m1A-to-m6A conversion. Comparison of 2 profiles allows identification of m1A site. Right: Specific Inosine detection by reaction with acrylonitrile (ICE-Seq). As discussed above, Inosine generates a simple RT-signature, which can be confounded with SNPs or with sequencing error. Acrylonitrile treatment converts Inosine into an RT-arresting residue, detected by characteristic coverage drop and misincorporation, thus confirming the presence of modified residue (right).
Figure 7.
Figure 7.
Approaches combining pulldown and specific chemistries. Left: Nucleotide-resolution detection of m1A residues by combination of MeRIP and iCLIP. m1A-containing RNA fragments are enriched after covalent cross-linking with specific anti m1A-Ab. After Ab removal by protease, the resulting covalent adducts are used for primer extension. Abortive RT products correspond to neighboring nucleotides around m1A site. Right: Enrichment of pseudouridine-containing RNA fragments using bi-functional “clickable” CMCT. RNA is first randomly fragmented and subjected to reaction with soluble carbodiimide CMCT bearing “clickable” azide (N3) group. After “click” reaction with alkyne-modified biotin, modified fragments are enriched by avidin-beads pull-out. Modified residues are detected based on their RT-arrest signature.
Figure 8.
Figure 8.
Sources of artifacts in Class I, II and III techniques. Left: Identification of RT-signatures in Class I methods may be affected by the presence of genomic SNPs, by possible mis-aligned nucleotides at exon-intron borders and by errors in the sequencing data set. Center: Class II approaches may generate false-positive signals due to strong non-specific cleavage sites in RNAs, due to mis-alignment of some reads to repetitive RNA sequences, due to unannotated transcription sites. Right: Class III approaches based on Ab enrichment may suffer from non-specific enrichment signals, from Ab promiscuity or mis-alignments. Important controls for these approaches include the coverage profile in the input (top trace), and the enrichment profile for KO or deleted strain (middle). Only specific peaks present in WT sample and absent in control traces should be considered as candidates. The insert shows that the position of the modified residue does not always correspond to the maximal enrichment.

References

    1. Lee Y, Rio DC. Mechanisms and Regulation of Alternative Pre-mRNA Splicing. Annu Rev Biochem 2015; 84:291-323; PMID:25784052; http://dx.doi.org/10.1146/annurev-biochem-060614-034316 - DOI - PMC - PubMed
    1. Licht K, Jantsch MF. Rapid and dynamic transcriptome regulation by RNA editing and RNA modifications. J Cell Biol 2016; 213:15-22; PMID:27044895; http://dx.doi.org/10.1083/jcb.201511041 - DOI - PMC - PubMed
    1. Wu R, Jiang D, Wang Y, Wang X. N (6)-methyladenosine (m(6)A) methylation in mRNA with A dynamic and reversible epigenetic modification. Mol Biotechnol 2016; 58:450-9; PMID:27179969; http://dx.doi.org/10.1007/s12033-016-9947-9 - DOI - PubMed
    1. Jia G, Fu Y, He C. Reversible RNA adenosine methylation in biological regulation. Trends Genet 2013; 29:108-15; PMID:23218460; http://dx.doi.org/10.1016/j.tig.2012.11.003 - DOI - PMC - PubMed
    1. El Yacoubi B, Bailly M, de Crécy-Lagard V. Biosynthesis and function of posttranscriptional modifications of transfer RNAs. Annu Rev Genet 2012; 46:69-95; PMID:22905870; http://dx.doi.org/10.1146/annurev-genet-110711-155641 - DOI - PubMed

Publication types