Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2014 Oct 20;15(1):912.
doi: 10.1186/1471-2164-15-912.

Comparison of library preparation methods reveals their impact on interpretation of metatranscriptomic data

Affiliations
Comparative Study

Comparison of library preparation methods reveals their impact on interpretation of metatranscriptomic data

Adriana Alberti et al. BMC Genomics. .

Abstract

Background: Metatranscriptomics is rapidly expanding our knowledge of gene expression patterns and pathway dynamics in natural microbial communities. However, to cope with the challenges of environmental sampling, various rRNA removal and cDNA synthesis methods have been applied in published microbial metatranscriptomic studies, making comparisons arduous. Whereas efficiency and biases introduced by rRNA removal methods have been relatively well explored, the impact of cDNA synthesis and library preparation on transcript abundance remains poorly characterized. The evaluation of potential biases introduced at this step is challenging for metatranscriptomic samples, where data analyses are complex, for example because of the lack of reference genomes.

Results: Herein, we tested four cDNA synthesis and Illumina library preparation protocols on a simplified mixture of total RNA extracted from four bacterial species. In parallel, RNA from each microbe was tested individually. cDNA synthesis was performed on rRNA depleted samples using the TruSeq Stranded Total RNA Library Preparation, the SMARTer Stranded RNA-Seq, or the Ovation RNA-Seq V2 System. A fourth experiment was made directly from total RNA using the Encore Complete Prokaryotic RNA-Seq. The obtained sequencing data were analyzed for: library complexity and reproducibility; rRNA removal efficiency and bias; the number of genes detected; coverage uniformity; and the impact of protocols on expression biases. Significant variations, especially in organism representation and gene expression patterns, were observed among the four methods. TruSeq generally performed best, but is limited by its requirement of hundreds of nanograms of total RNA. The SMARTer method appears the best solution for smaller amounts of input RNA. For very low amounts of RNA, the Ovation System provides the only option; however, the observed biases emphasized its limitations for quantitative analyses.

Conclusions: cDNA and library preparation methods may affect the outcome and interpretation of metatranscriptomic data. The most appropriate method should be chosen based on the available quantity of input RNA and the quantitative or non-quantitative objectives of the study. When low amounts of RNA are available, as in most metatranscriptomic studies, the SMARTer method seems to be the best compromise to obtain reliable results. This study emphasized the difficulty in comparing metatranscriptomic studies performed using different methods.

PubMed Disclaimer

Figures

Figure 1
Figure 1
CDS coverage and GC content. (a), (b), (c): box plots distribution of the CDS read counts normalized by the total read count for three categories of GC content (<40%, 40-50% and >50% GC) and for each MIX library. (d): distribution of the cumulated coverage along the length of the annotated CDS for MIX libraries (in 5′- > 3′ orientation).
Figure 2
Figure 2
Gene expression profile of TS versus OV, SMART and ENC methods in MIX samples. Gene expression profile from TS_MIX versus (a) ENC_MIX (b) SMART_MIX (c) OV_MIX samples. This figure shows the log scatter plots and the coefficients of determination (R2) obtained by comparing FPKM values for 14602 annotated CDS in the mix of bacteria.
Figure 3
Figure 3
Evaluation of the GC bias in the differential expression analysis. Genes detected as the most differentially expressed were extracted (5 fold more or less differentially expressed) for ENC and OV MIX libraries. The proportions of CDS for different GC content intervals were plotted to evaluate the bias introduced by the cDNA synthesis method.

References

    1. Gilbert JA, Field D, Huang Y, Edwards R, Li W, Gilna P, Joint I. Detection of large numbers of novel sequences in the metatranscriptomes of complex marine microbial communities. PLoS One. 2008;3(8):e3042. doi: 10.1371/journal.pone.0003042. - DOI - PMC - PubMed
    1. Frias-Lopez J, Shi Y, Tyson GW, Coleman ML, Schuster SC, Chisholm SW, Delong EF. Microbial community gene expression in ocean surface waters. Proc Natl Acad Sci U S A. 2008;105(10):3805–3810. doi: 10.1073/pnas.0708897105. - DOI - PMC - PubMed
    1. Poretsky RS, Hewson I, Sun S, Allen AE, Zehr JP, Moran MA. Comparative day/night metatranscriptomic analysis of microbial communities in the North Pacific subtropical gyre. Environ Microbiol. 2009;11(6):1358–1375. doi: 10.1111/j.1462-2920.2008.01863.x. - DOI - PubMed
    1. Hewson I, Poretsky RS, Beinart RA, White AE, Shi T, Bench SR, Moisander PH, Paerl RW, Tripp HJ, Montoya JP, Moran MA, Zehr JP. In situ transcriptomic analysis of the globally important keystone N2-fixing taxon Crocosphaera watsonii. ISME J. 2009;3(5):618–631. doi: 10.1038/ismej.2009.8. - DOI - PubMed
    1. Hewson I, Poretsky RS, Dyhrman ST, Zielinski B, White AE, Tripp HJ, Montoya JP, Zehr JP. Microbial community gene expression within colonies of the diazotroph, Trichodesmium, from the Southwest Pacific Ocean. ISME J. 2009;3(11):1286–1300. doi: 10.1038/ismej.2009.75. - DOI - PubMed

Publication types