Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2014 Feb;42(3):1414-26.
doi: 10.1093/nar/gkt1021. Epub 2013 Nov 5.

Biases in small RNA deep sequencing data

Affiliations
Review

Biases in small RNA deep sequencing data

Carsten A Raabe et al. Nucleic Acids Res. 2014 Feb.

Abstract

High-throughput RNA sequencing (RNA-seq) is considered a powerful tool for novel gene discovery and fine-tuned transcriptional profiling. The digital nature of RNA-seq is also believed to simplify meta-analysis and to reduce background noise associated with hybridization-based approaches. The development of multiplex sequencing enables efficient and economic parallel analysis of gene expression. In addition, RNA-seq is of particular value when low RNA expression or modest changes between samples are monitored. However, recent data uncovered severe bias in the sequencing of small non-protein coding RNA (small RNA-seq or sRNA-seq), such that the expression levels of some RNAs appeared to be artificially enhanced and others diminished or even undetectable. The use of different adapters and barcodes during ligation as well as complex RNA structures and modifications drastically influence cDNA synthesis efficacies and exemplify sources of bias in deep sequencing. In addition, variable specific RNA G/C-content is associated with unequal polymerase chain reaction amplification efficiencies. Given the central importance of RNA-seq to molecular biology and personalized medicine, we review recent findings that challenge small non-protein coding RNA-seq data and suggest approaches and precautions to overcome or minimize bias.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Illustration of the steps involved in cDNA construction, including potential sources of bias. (A) The starting pool of non-protein coding RNAs with different 5′- and 3′-end modifications schematically indicated by different line types. Abbreviations for the various modifications: OH: hydroxyl, OPO3: 2′-3′-cyclic phosphate, ppp: triphosphate, p: monophosphate, cap: cap, and 2′-OCH3: 2′-O-methyl. (B) The left panel depicts different enzymatic pre-treatments prior to RNA 5′-end ligation to enrich for different RNA subtypes. From left to right: RNA without any pretreatment (5′-adapter ligation); RNA pretreated with tobacco acid phosphatase (TAP); RNA pretreated with Terminator™ 5′-phosphate-dependent exonuclease and TAP (Terminator 5′-exo; TAP); RNA treated with T4 polynucleotide kinase (T4 PNK). RNA classes accessible for adapter ligation after the respective 5′-end pretreatments are schematically represented below each pretreatment. The right panel depicts subtypes of RNA classes accessible for 3′-end tailing (-oligo(A) or –oligo(C) tailing) and adapter ligation. (C) Possible biases associated with RNA 5′-(left) and 3′-(right) end-modifications and with the subsequent steps of cDNA construction.
Figure 2.
Figure 2.
Schematic representation of the three-step mechanism of the bacteriophage T4 RNA ligase (T4Rnl) reaction with the potential side products. Encircled Arabic numbers indicate the order of ligation steps. Step 1: adenylation of the T4Rnl active site; Step 2: donor 5′-adenylation (side reaction: circularization of 3′-end-unprotected donor molecules); Step 3: phosphodiester bond formation between 3′-hydroxylated (OH) acceptor and donor molecules (side reaction: reverse adenylation of donor and circularization of acceptor molecules). PPi: pyrophosphate, p: monophosphate, AMP: adenosine monophosphate (Ap), App: 5′-adenylated termini.

References

    1. Brosius J. The fragmented gene. Ann. N. Y. Acad. Sci. 2009;1178:186–193. - PubMed
    1. Brosius J, Tiedge H. RNomenclature. RNA Biol. 2004;1:81–83. - PubMed
    1. Liu Q, Paroo Z. Biochemical principles of small RNA pathways. Annu. Rev. Biochem. 2010;79:295–319. - PubMed
    1. Brosius J. In: From Nucleic Acids Sequences to Molecular Medicine, RNA Technologies. Erdmann VA, Barciszewski J, editors. Berlin Heidelberg: Springer; 2012. pp. 1–18.
    1. Huttenhofer A, Brosius J, Bachellerie JP. RNomics: identification and function of small, non-messenger RNAs. Curr. Opin. Chem. Biol. 2002;6:835–843. - PubMed

Publication types

MeSH terms