Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 May 30;3(1):4.
doi: 10.1186/1758-907X-3-4.

Reducing ligation bias of small RNAs in libraries for next generation sequencing

Affiliations

Reducing ligation bias of small RNAs in libraries for next generation sequencing

Karim Sorefan et al. Silence. .

Abstract

Background: The use of nucleic acid-modifying enzymes has driven the rapid advancement in molecular biology. Understanding their function is important for modifying or improving their activity. However, functional analysis usually relies upon low-throughput experiments. Here we present a method for functional analysis of nucleic acid-modifying enzymes using next generation sequencing.

Findings: We demonstrate that sequencing data of libraries generated by RNA ligases can reveal novel secondary structure preferences of these enzymes, which are used in small RNA cloning and library preparation for NGS. Using this knowledge we demonstrate that the cloning bias in small RNA libraries is RNA ligase-dependent. We developed a high definition (HD) protocol that reduces the RNA ligase-dependent cloning bias. The HD protocol doubled read coverage, is quantitative and found previously unidentified microRNAs. In addition, we show that microRNAs in miRBase are those preferred by the adapters of the main sequencing platform.

Conclusions: Sequencing bias of small RNAs partially influenced which microRNAs have been studied in depth; therefore most previous small RNA profiling experiments should be re-evaluated. New microRNAs are likely to be found, which were selected against by existing adapters. Preference of currently used adapters towards known microRNAs suggests that the annotation of all existing small RNAs, including miRNAs, siRNAs and piRNAs, has been biased.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Scheme depicting the experimental approach and HD adapters. a Data were generated to analyse the sequence preferences of T4 Rnl1 and T4 Rnl2 using a degenerate RNA library (N21 RNA). b HD adapters include degenerate tags at the end of the adapters that allow the formation of stable secondary structures for more sequences and reduce RNA ligase-dependent sequence bias. Panel (c) shows the structure of miR-29b with the Illumina adapters (top) and some of the structures formed by HD adapters (bottom). We found 1,031 distinct structures originating from 12,479 tag combinations.
Figure 2
Figure 2
Sequencing cDNA generated from N21 RNA libraries. a Number of reads for the 100 most abundant sequences in the N21 libraries, prepared with Illumina (red) or HD adapters (blue). b-d Frequencies of predicted nucleotide base-pairing per position for N21 insert (b), N21 insert and 3’ adapter (c) and 5’ adapter, insert and 3’ adapter (d). In (c) and (d) vertical dotted line indicates ligation point. Red line denotes data obtained with Illumina protocol, blue line with HD protocol and grey line randomly generated sets of 21nt sequences. Bars indicate minimum and maximum values in all replicates. Horizontal bars at bottom indicate sequence region: green, insert; red, 3’ adapter; blue, 5’ adapter. For insert folding frequencies obtained with random sequences are more closely matched by HD data (R2 = 0.83) than by Illumina data (R2 = 0.60). e Comparison of T4 Rnl2 ligase activity on substrates with ss flaps of differing nucleotide lengths upstream or downstream of ligation site. In vitro ligation assay of RNA-DNA duplexes with either a nick (0NT) or ss flaps up- or downstream from the ligation site was carried out at 25°C for 30 min. Substrates with ss flaps >2nt in length upstream from the ligation site are inefficiently ligated. The diagram illustrates the position of the flaps, the fluorescein reporter group (star) and the backbone oligonucleotide (black). If ligation occurs the size of the nucleic acid attached to the fluorescein increases as visualised by 15% PAGE.
Figure 3
Figure 3
cDNA library preparation protocols distort miRNA research. a Comparison of change in miRNA level between wild-type and Dicer KO DLD cells obtained in Illumina (x axis) and HD samples (y axis). R2 = 0.62. b Number of known miRNAs found in DLD cells at different thresholds using Illumina or HD adapters. Regardless of chosen threshold, HD adapters identify more miRNAs. c Absolute quantification of eight known miRNAs (let-7i, miR-10a, miR-19b, miR-21, miR-25, miR-29b, miR-93, miR-375) obtained by Northern blot compared with number of times these miRNAs were sequenced using Illumina or HD adapters in DLD cell line. Data obtained with HD adapters correlates better with absolute quantifications (R2 = 0.70) than Illumina data (R2 = 0.12). d Number of PubMed citations and number of reads per experiment (data obtained from miRbase v17) of miRNAs conserved between mouse and human. MiRNAs with higher number of reads tend to be more extensively studied [R2 = 0.58, p-value < 10(−15)]. e-f Distributions of minimum free energy (MFE) of known human miRNAs concatenated with 5’ and 3’ adapter sequences. Using Illumina adapter sequences sRNA cloning kit V1.5 the set of miRNAs found by Illumina has lower average MFE than the set of miRNAs found by 454 (Wilcoxon test p = 0.01). We found the same result using the 3' adapter from sRNA cloning kit V1.0 (data not shown). e Conversely, using 454 adapter sequences average MFE is lower for set of miRNAs found by 454 (p = 0.07). f Analogous results for concatenation of miRNA only with 3’ adapter display a similar trend (see Additional file 5: Figure S7).

References

    1. Kermekchiev MB, Kirilova LI, Vail EE, Barnes WM. Mutants of Taq DNA polymerase resistant to PCR inhibitors allow DNA amplification from whole blood and crude soil samples. Nucleic Acids Res. 2009;37(5):e40. doi: 10.1093/nar/gkn1055. - DOI - PMC - PubMed
    1. Lundberg KS, Shoemaker DD, Adams MWW, Short JM, Sorge JA, Mathur EJ. High-fidelity amplification using a thermostable DNA polymerase isolated from Pyrococcus furiosus. Gene. 1991;108:1–6. doi: 10.1016/0378-1119(91)90480-Y. - DOI - PubMed
    1. Zhelkovsky AM, McReynolds LA. Simple and efficient synthesis of 5вЂI pre-adenylated DNA using thermostable RNA ligase. Nucleic Acids Res. 2011;39(17):e117. doi: 10.1093/nar/gkr544. - DOI - PMC - PubMed
    1. Ho CK, Wang LK, Lima CD, Shuman S. Structure and Mechanism of RNA Ligase. Structure. 2004;12(2):327–339. - PubMed
    1. Nandakumar J, Shuman S. How an RNA Ligase Discriminates RNA versus DNA Damage. Mol Cell. 2004;16(2):211–221. doi: 10.1016/j.molcel.2004.09.022. - DOI - PubMed