Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Apr 19:2021:6647597.
doi: 10.1155/2021/6647597. eCollection 2021.

Bias in RNA-seq Library Preparation: Current Challenges and Solutions

Affiliations
Review

Bias in RNA-seq Library Preparation: Current Challenges and Solutions

Huajuan Shi et al. Biomed Res Int. .

Abstract

Although RNA sequencing (RNA-seq) has become the most advanced technology for transcriptome analysis, it also confronts various challenges. As we all know, the workflow of RNA-seq is extremely complicated and it is easy to produce bias. This may damage the quality of RNA-seq dataset and lead to an incorrect interpretation for sequencing result. Thus, our detailed understanding of the source and nature of these biases is essential for the interpretation of RNA-seq data, finding methods to improve the quality of RNA-seq experimental, or development bioinformatics tools to compensate for these biases. Here, we discuss the sources of experimental bias in RNA-seq. And for each type of bias, we discussed the method for improvement, in order to provide some useful suggestions for researcher in RNA-seq experimental.

PubMed Disclaimer

Conflict of interest statement

There are no conflicts of interest to declare.

Figures

Figure 1
Figure 1
Simplified protocol of RNA-seq experiment and sources of bias. (a) Sample preservation and isolation. These biases can include sample degradation, DNA contamination. (b) Strategies for cDNA library construction. ①: the RNA directly converts to cDNA; then, cDNA was fragmented and library preparation. ②: classical a protocol. One method involves reverse transcription (RT) using random primers first, subsequently adapter ligations and sequencing (left). The other method is to first sequentially ligate 3′ and 5′ adapters, followed by performing cDNA synthesis with a primer complementary to the adapter (RT-primer), subsequently sequencing (right). On using the RT primer with a specific sequence, mispriming could occur due to annealing of the RT-primer to transcript sequences with some complementarity (RT mispriming). (c) RNA-seq platform (including Pyrosequencing, sequencing-by-synthesis, and single-molecule sequencing). These biases can be introduced by insertions and deletions, raw single-pass data, etc.

References

    1. Wang Z., Gerstein M., Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nature Reviews Genetics. 2009;10(1):57–63. doi: 10.1038/nrg2484. - DOI - PMC - PubMed
    1. Dohm J. C., Lottaz C., Borodina T., Himmelbauer H. Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. Nucleic Acids Research. 2008;36(16, article e105) doi: 10.1093/nar/gkn425. - DOI - PMC - PubMed
    1. Li S., Tighe S. W., Nicolet C. M., et al. Multi-platform assessment of transcriptome profiling using RNA-seq in the ABRF next-generation sequencing study. Nature Biotechnology. 2014;32(9):915–925. doi: 10.1038/nbt.2972. - DOI - PMC - PubMed
    1. Van Dijk E. L., Jaszczyszyn Y., Thermes C. Library preparation methods for next-generation sequencing: tone down the bias. Experimental Cell Research. 2014;322(1):12–20. doi: 10.1016/j.yexcr.2014.01.008. - DOI - PubMed
    1. Camacho-Sanchez M., Burraco P., Gomez-Mestre I., Leonard J. A. Preservation of RNA and DNA from mammal samples under field conditions. Molecular Ecology Resources. 2013;13(4):663–673. doi: 10.1111/1755-0998.12108. - DOI - PubMed